All Papers
0 / 0 papers shown
Title |
|---|
Title |
|---|

Title |
|---|
![]() LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for
Enhanced Following of Instructions with Multiple ConstraintsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024 |
![]() MAmmoTH2: Scaling Instructions from the WebNeural Information Processing Systems (NeurIPS), 2024 |
![]() Easy-to-Hard Generalization: Scalable Alignment Beyond Human SupervisionNeural Information Processing Systems (NeurIPS), 2024 |
![]() MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
ModelsInternational Conference on Learning Representations (ICLR), 2023 |
![]() Let Me Teach You: Pedagogical Foundations of Feedback for Language
ModelsConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() TheoremQA: A Theorem-driven Question Answering datasetConference on Empirical Methods in Natural Language Processing (EMNLP), 2023 |
![]() CRITIC: Large Language Models Can Self-Correct with Tool-Interactive
CritiquingInternational Conference on Learning Representations (ICLR), 2023 |
![]() Teaching Large Language Models to Self-DebugInternational Conference on Learning Representations (ICLR), 2023 |
![]() REFINER: Reasoning Feedback on Intermediate RepresentationsConference of the European Chapter of the Association for Computational Linguistics (EACL), 2023 |
![]() Self-Refine: Iterative Refinement with Self-FeedbackNeural Information Processing Systems (NeurIPS), 2023 |
![]() Language Models can Solve Computer TasksNeural Information Processing Systems (NeurIPS), 2023 |
![]() Reflexion: Language Agents with Verbal Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023 |
![]() Distilling Reasoning Capabilities into Smaller Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
![]() Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve ThemAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
![]() RARR: Researching and Revising What Language Models Say, Using Language
ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2022 |
![]() CodeT: Code Generation with Generated TestsInternational Conference on Learning Representations (ICLR), 2022 |
![]() Training language models to follow instructions with human feedbackNeural Information Processing Systems (NeurIPS), 2022 |
![]() Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsNeural Information Processing Systems (NeurIPS), 2022 |