Evaluating Software Development Agents: Patch Patterns, Code Quality,
and Issue Complexity in Real-World GitHub ScenariosIEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2024 |
LiveCodeBench: Holistic and Contamination Free Evaluation of Large
Language Models for CodeInternational Conference on Learning Representations (ICLR), 2024 |
Automating the Correctness Assessment of AI-generated Code for Security
ContextsJournal of Systems and Software (JSS), 2023 |
LLM for SoC Security: A Paradigm ShiftIEEE Access (IEEE Access), 2023 |
Security Weaknesses of Copilot-Generated Code in GitHub Projects: An Empirical StudyACM Transactions on Software Engineering and Methodology (TOSEM), 2023 |
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning
AttacksIEEE International Conference on Program Comprehension (ICPC), 2023 |