ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2108.04556
  4. Cited By
SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code
  Representation
v1v2v3 (latest)

SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation

10 August 2021
Xin Wang
Yasheng Wang
Fei Mi
Pingyi Zhou
Yao Wan
Xiao Liu
Li Li
Hao Wu
Jin Liu
Xin Jiang
ArXiv (abs)PDFHTML

Papers citing "SynCoBERT: Syntax-Guided Multi-Modal Contrastive Pre-Training for Code Representation"

50 / 59 papers shown
SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval
SPENCER: Self-Adaptive Model Distillation for Efficient Code RetrievalACM Transactions on Software Engineering and Methodology (TOSEM), 2025
Wenchao Gu
Zongyi Lyu
Yanlin Wang
Hongyu Zhang
Cuiyun Gao
Michael R. Lyu
256
3
0
01 Aug 2025
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang
Xin Zhang
X. Zhao
Shouzheng Huang
Baotian Hu
Min Zhang
370
4
0
28 Jul 2025
LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language Models
LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2025
Yan Wang
Ling Ding
Tien N Nguyen
Shaohua Wang
Yanan Zheng
434
1
0
20 May 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation GroundingInternational Conference on Learning Representations (ICLR), 2025
Indraneil Paul
Haoyi Yang
Goran Glavaš
Kristian Kersting
Iryna Gurevych
AAMLSyDa
282
3
0
27 Mar 2025
Speculative Decoding for Verilog: Speed and Quality, All in One
Speculative Decoding for Verilog: Speed and Quality, All in OneDesign Automation Conference (DAC), 2025
Changran Xu
Yi Liu
Yunhao Zhou
Shan Huang
Ningyi Xu
Qiang Xu
250
1
0
18 Mar 2025
Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?
Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?Annual Meeting of the Association for Computational Linguistics (ACL), 2025
Qingyuan Liang
Zhao Zhang
Zeyu Sun
Zheng Lin
Qi Luo
...
Bin Chen
Haotian Zhang
Jun Liu
Haotian Zhang
Y. Xiong
379
12
0
07 Mar 2025
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
Yufan Ye
Pu Pang
Ting Zhang
Hua Huang
527
1
0
24 Feb 2025
Code LLMs: A Taxonomy-based Survey
Code LLMs: A Taxonomy-based SurveyBigData Congress [Services Society] (BSS), 2024
Nishat Raihan
Christian D. Newman
Marcos Zampieri
425
9
0
11 Dec 2024
GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding
GALLa: Graph Aligned Large Language Models for Improved Source Code UnderstandingAnnual Meeting of the Association for Computational Linguistics (ACL), 2024
Ziyin Zhang
Hang Yu
Shijie Li
Peng Di
Jianguo Li
Rui Wang
622
9
0
06 Sep 2024
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
Zhi-Cun Lyu
Xin-Ye Li
Zheng Xie
Ming Li
289
19
0
11 Aug 2024
Towards Better Code Understanding in Decoder-Only Models with Contrastive Learning
Towards Better Code Understanding in Decoder-Only Models with Contrastive Learning
Jiayi Lin
Yutao Xie
Yue Yu
Yibiao Yang
Lei Zhang
SyDa
200
1
0
18 Jun 2024
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via
  Code Rewriting
Uncovering LLM-Generated Code: A Zero-Shot Synthetic Code Detector via Code Rewriting
Tong Ye
Yangkai Du
Tengfei Ma
Lingfei Wu
Xuhong Zhang
R. Beyah
Wenhai Wang
DeLMO
360
20
0
25 May 2024
On the Limitations of Embedding Based Methods for Measuring Functional
  Correctness for Code Generation
On the Limitations of Embedding Based Methods for Measuring Functional Correctness for Code Generation
Atharva Naik
299
10
0
26 Apr 2024
Analyzing the Performance of Large Language Models on Code Summarization
Analyzing the Performance of Large Language Models on Code SummarizationInternational Conference on Language Resources and Evaluation (LREC), 2024
Rajarshi Haldar
Anjali Narayan-Chen
241
41
0
10 Apr 2024
CSEPrompts: A Benchmark of Introductory Computer Science Prompts
CSEPrompts: A Benchmark of Introductory Computer Science PromptsInternational Syposium on Methodologies for Intelligent Systems (ISMIS), 2024
Md. Nishat Raihan
Dhiman Goswami
Sadiya Sayara Chowdhury Puspo
Christian D. Newman
Tharindu Ranasinghe
Marcos Zampieri
ELM
263
4
0
03 Apr 2024
ProCQA: A Large-scale Community-based Programming Question Answering
  Dataset for Code Search
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
Zehan Li
Jianfei Zhang
Chuantao Yin
Y. Ouyang
Wenge Rong
191
16
0
25 Mar 2024
Beyond Self-learned Attention: Mitigating Attention Bias in
  Transformer-based Models Using Attention Guidance
Beyond Self-learned Attention: Mitigating Attention Bias in Transformer-based Models Using Attention Guidance
Jiri Gesi
Iftekhar Ahmed
265
1
0
26 Feb 2024
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Code Needs Comments: Enhancing Code LLMs with Comment Augmentation
Demin Song
Honglin Guo
Yunhua Zhou
Shuhao Xing
Yudong Wang
...
Wenwei Zhang
Qipeng Guo
Hang Yan
Xipeng Qiu
Dahua Lin
SyDa
226
19
0
20 Feb 2024
Code Representation Learning At Scale
Code Representation Learning At Scale
Dejiao Zhang
W. Ahmad
Ming Tan
Hantian Ding
Ramesh Nallapati
Dan Roth
Xiaofei Ma
Bing Xiang
OffRL
254
28
0
02 Feb 2024
Investigating the Efficacy of Large Language Models for Code Clone
  Detection
Investigating the Efficacy of Large Language Models for Code Clone DetectionIEEE International Conference on Program Comprehension (ICPC), 2024
Mohamad Khajezade
Jie Wu
Fatemeh H. Fard
Gema Rodríguez-Pérez
Mohamed Sami Shehata
241
35
0
24 Jan 2024
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit
Deep Learning for Code Intelligence: Survey, Benchmark and ToolkitACM Computing Surveys (ACM Comput. Surv.), 2023
Yao Wan
Yang He
Zhangqian Bi
Jianguo Zhang
Hongyu Zhang
Yulei Sui
Guandong Xu
Hai Jin
Philip S. Yu
307
47
0
30 Dec 2023
Language Agnostic Code Embeddings
Language Agnostic Code EmbeddingsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Saiteja Utpala
Alex Gu
Pin-Yu Chen
259
2
0
25 Oct 2023
Rethinking Negative Pairs in Code Search
Rethinking Negative Pairs in Code SearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Haochen Li
Xin Zhou
Anh Tuan Luu
Chunyan Miao
410
18
0
12 Oct 2023
Contrastive Prompt Learning-based Code Search based on Interaction
  Matrix
Contrastive Prompt Learning-based Code Search based on Interaction Matrix
Yubo Zhang
Yanfang Liu
Xinxin Fan
Yunfeng Lu
243
3
0
10 Oct 2023
Laminar: A New Serverless Stream-based Framework with Semantic Code
  Search and Code Completion
Laminar: A New Serverless Stream-based Framework with Semantic Code Search and Code Completion
Zaynab Zahra
Zihao Li
Rosa Filgueira
164
6
0
01 Sep 2023
Large Language Models for Software Engineering: A Systematic Literature
  Review
Large Language Models for Software Engineering: A Systematic Literature ReviewACM Transactions on Software Engineering and Methodology (TOSEM), 2023
Xinying Hou
Yanjie Zhao
Yue Liu
Zhou Yang
Kailong Wang
Li Li
Xiapu Luo
David Lo
John C. Grundy
Haoyu Wang
484
912
0
21 Aug 2023
Evaluating Instruction-Tuned Large Language Models on Code Comprehension
  and Generation
Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation
Zhiqiang Yuan
Junwei Liu
Qiancheng Zi
Wentai Deng
Xin Peng
Xin Peng
ALMELMLRM
257
108
0
02 Aug 2023
Contrastive Learning for API Aspect Analysis
Contrastive Learning for API Aspect AnalysisInternational Conference on Automated Software Engineering (ASE), 2023
G. M. Shahariar
Tahmid Hasan
Anindya Iqbal
Gias Uddin
231
2
0
31 Jul 2023
Natural Language Generation and Understanding of Big Code for
  AI-Assisted Programming: A Review
Natural Language Generation and Understanding of Big Code for AI-Assisted Programming: A ReviewEntropy (Entropy), 2023
M. Wong
Shangxin Guo
Ching Nam Hang
Siu-Wai Ho
C. Tan
288
142
0
04 Jul 2023
Exploring the Robustness of Large Language Models for Solving
  Programming Problems
Exploring the Robustness of Large Language Models for Solving Programming Problems
Atsushi Shirafuji
Yutaka Watanobe
Takumi Ito
Makoto Morishita
Yuki Nakamura
Yusuke Oda
Jun Suzuki
ELM
374
28
0
26 Jun 2023
Multi-target Backdoor Attacks for Code Pre-trained Models
Multi-target Backdoor Attacks for Code Pre-trained ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Yanzhou Li
Shangqing Liu
Kangjie Chen
Xiaofei Xie
Tianwei Zhang
Yang Liu
AAMLSILM
302
34
0
14 Jun 2023
Understanding Programs by Exploiting (Fuzzing) Test Cases
Understanding Programs by Exploiting (Fuzzing) Test CasesAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Jianyu Zhao
Yuyang Rong
Yiwen Guo
Yifeng He
Hao Chen
258
20
0
23 May 2023
Neural Machine Translation for Code Generation
Neural Machine Translation for Code Generation
K. Dharma
Clayton T. Morrison
384
7
0
22 May 2023
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection
  and Code Search
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code SearchNorth American Chapter of the Association for Computational Linguistics (NAACL), 2023
Nikita Sorokin
Dmitry Abulkhanov
Sergey I. Nikolenko
Valentin Malykh
253
7
0
19 May 2023
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval
  Model for Searching by Code Snippets
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code SnippetsInternational Conference on Language Resources and Evaluation (LREC), 2023
I. Sedykh
Dmitry Abulkhanov
Nikita Sorokin
Sergey I. Nikolenko
Valentin Malykh
273
3
0
19 May 2023
CodeT5+: Open Code Large Language Models for Code Understanding and
  Generation
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Yue Wang
Hung Le
Akhilesh Deepak Gotmare
Nghi D. Q. Bui
Junnan Li
Steven C. H. Hoi
ALM
487
686
0
13 May 2023
Code Execution with Pre-trained Language Models
Code Execution with Pre-trained Language ModelsAnnual Meeting of the Association for Computational Linguistics (ACL), 2023
Chenxiao Liu
Shuai Lu
Weizhu Chen
Daxin Jiang
Alexey Svyatkovskiy
Shengyu Fu
Neel Sundaresan
Nan Duan
ELM
288
46
0
08 May 2023
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate
  Representation
Neuro-symbolic Zero-Shot Code Cloning with Cross-Language Intermediate Representation
Krishnam Hasija
Shrishti Pradhan
Manasi Patwardhan
Raveendra Kumar Medicherla
Lovekesh Vig
Ravindra Naik
177
2
0
26 Apr 2023
An Unbiased Transformer Source Code Learning with Semantic Vulnerability
  Graph
An Unbiased Transformer Source Code Learning with Semantic Vulnerability GraphEuropean Symposium on Security and Privacy (Euro S&P), 2023
Nafis Tanveer Islam
G. Parra
Dylan Manuel
E. Bou-Harb
Peyman Najafirad
261
14
0
17 Apr 2023
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code
  Completion
MultiCoder: Multi-Programming-Lingual Pre-Training for Low-Resource Code Completion
Zi Gong
Yinpeng Guo
Pingyi Zhou
Cuiyun Gao
Yasheng Wang
Zenglin Xu
275
9
0
19 Dec 2022
An Empirical Study of Deep Learning Models for Vulnerability Detection
An Empirical Study of Deep Learning Models for Vulnerability DetectionInternational Conference on Software Engineering (ICSE), 2022
Benjamin Steenhoek
Md. Mahbubur Rahman
Richard Jiles
Wei Le
ELMAAML
450
140
0
15 Dec 2022
CLAWSAT: Towards Both Robust and Accurate Code Models
CLAWSAT: Towards Both Robust and Accurate Code ModelsIEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER), 2022
Jinghan Jia
Shashank Srikant
Tamara Mitrovska
Chuang Gan
Shiyu Chang
Sijia Liu
Una-May O’Reilly
AAML
423
15
0
21 Nov 2022
Exploring Representation-Level Augmentation for Code Search
Exploring Representation-Level Augmentation for Code SearchConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Haochen Li
Steven C. H. Hoi
Cyril Leung
Yanxian Huang
Yuan Huang
Hongyu Zhang
Yanlin Wang
234
25
0
21 Oct 2022
Soft-Labeled Contrastive Pre-training for Function-level Code
  Representation
Soft-Labeled Contrastive Pre-training for Function-level Code RepresentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Xiaonan Li
Daya Guo
Yeyun Gong
Yun Lin
Yelong Shen
Xipeng Qiu
Daxin Jiang
Weizhu Chen
Nan Duan
213
20
0
18 Oct 2022
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models
  for Programming Language Attend Code Structure
CAT-probing: A Metric-based Approach to Interpret How Pre-trained Models for Programming Language Attend Code StructureConference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Nuo Chen
Qiushi Sun
Renyu Zhu
Xiang Li
Xuesong Lu
Ming Gao
350
11
0
07 Oct 2022
Semantic-Preserving Adversarial Code Comprehension
Semantic-Preserving Adversarial Code ComprehensionInternational Conference on Computational Linguistics (COLING), 2022
Yiyang Li
Hongqiu Wu
Hai Zhao
AAML
195
10
0
12 Sep 2022
CommitBART: A Large Pre-trained Model for GitHub Commits
CommitBART: A Large Pre-trained Model for GitHub Commits
Shangqing Liu
Yanzhou Li
Xiaofei Xie
Yang Liu
VLMAI4TS
262
26
0
17 Aug 2022
Finding Reusable Machine Learning Components to Build Programming
  Language Processing Pipelines
Finding Reusable Machine Learning Components to Build Programming Language Processing PipelinesEuropean Conference on Software Architecture (ECSA), 2022
Patrick Flynn
T. Vanderbruggen
C. Liao
Pei-Hung Lin
M. Emani
Xipeng Shen
259
5
0
11 Aug 2022
CoditT5: Pretraining for Source Code and Natural Language Editing
CoditT5: Pretraining for Source Code and Natural Language EditingInternational Conference on Automated Software Engineering (ASE), 2022
Jiyang Zhang
Sheena Panthaplackel
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
KELM
343
124
0
10 Aug 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models
  of Source Code
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source CodeInternational Joint Conference on Artificial Intelligence (IJCAI), 2022
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDaVLM
320
62
0
24 May 2022
12
Next
Page 1 of 2