SWE-bench: Can Language Models Resolve Real-World GitHub Issues?International Conference on Learning Representations (ICLR), 2023 |
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of
Large Language Models for Code GenerationNeural Information Processing Systems (NeurIPS), 2023 |
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual
Benchmarking on HumanEval-XKnowledge Discovery and Data Mining (KDD), 2023 |
SPoC: Search-based Pseudocode to CodeNeural Information Processing Systems (NeurIPS), 2019 |