Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of
Large Language Models for Code GenerationNeural Information Processing Systems (NeurIPS), 2023 |
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022 |