Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.03197
Cited By
Unified Language Model Pre-training for Natural Language Understanding and Generation
8 May 2019
Li Dong
Nan Yang
Wenhui Wang
Furu Wei
Xiaodong Liu
Yu-Chiang Frank Wang
Jianfeng Gao
M. Zhou
H. Hon
ELM
AI4CE
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Unified Language Model Pre-training for Natural Language Understanding and Generation"
50 / 845 papers shown
Title
CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges
Y. Li
Qizhi Pei
Mengyuan Sun
Honglin Lin
Chenlin Ming
Xin Gao
Jiang Wu
C. He
Lijun Wu
ELM
LRM
40
0
0
27 Apr 2025
Unified Molecule Generation and Property Prediction
Adam Izdebski
Jan Olszewski
Pankhil Gawade
Krzysztof Koras
Serra Korkmaz
Valentin Rauscher
Jakub M. Tomczak
E. Szczurek
29
0
0
23 Apr 2025
QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform
Movina Moses
Mohab Elkaref
James Barry
Shinnosuke Tanaka
Vishnudev Kuruvanthodi
Nathan Herr
Campbell D Watson
Geeth de Mel
LM&MA
ELM
52
0
0
08 Apr 2025
Pay More Attention to the Robustness of Prompt for Instruction Data Mining
Qiang Wang
Dawei Feng
Xu Zhang
Ao Shen
Yang Xu
Bo Ding
H. Wang
AAML
41
0
0
31 Mar 2025
UFM: Unified Feature Matching Pre-training with Multi-Modal Image Assistants
Yide Di
Yun Liao
Hao Zhou
Kaijun Zhu
Qing Duan
Junhui Liu
Mingyu Lu
34
0
0
26 Mar 2025
Quantum EigenGame for excited state calculation
David Quiroga
Jason Han
Anastasios Kyrillidis
48
1
0
17 Mar 2025
TuneNSearch: a hybrid transfer learning and local search approach for solving vehicle routing problems
Arthur Corrêa
Cristóvão Silva
Liming Xu
Alexandra Brintrup
Samuel Moniz
48
0
0
16 Mar 2025
OpeNLGauge: An Explainable Metric for NLG Evaluation with Open-Weights LLMs
Ivan Kartáč
Mateusz Lango
Ondrej Dusek
ELM
46
1
0
14 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Yingfeng Luo
Tong Zheng
Yongyu Mu
B. Li
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
82
0
0
09 Mar 2025
PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary Algorithms
Shady Ali
Mahmoud Ashraf
Seif Hegazy
Fatty Salem
Hoda Mokhtar
Mohamed Medhat Gaber
M. Alrefaie
LLMAG
50
0
0
05 Mar 2025
ReaderLM-v2: Small Language Model for HTML to Markdown and JSON
Feng Wang
Zesheng Shi
Bo Wang
Nan Wang
Han Xiao
RALM
72
1
0
03 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
48
1
0
02 Mar 2025
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition
Yifei Duan
Raphael Shang
Deng Liang
Yongqiang Cai
80
0
0
28 Feb 2025
UniASM: Binary Code Similarity Detection without Fine-tuning
Yeming Gu
Hui Shu
Fei Kang
Fan Hu
53
10
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
40
4
0
21 Feb 2025
Note-Level Singing Melody Transcription for Time-Aligned Musical Score Generation
Leekyung Kim
Sungwook Jeon
Wan Heo
Jonghun Park
80
0
0
18 Feb 2025
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM
Zhi-Hua Zhou
Kun-Yang Yu
Shi-Yu Tian
Jiang-Xin Shi
Xiao-Wen Yang
Pengxiao Song
Yi-Xuan Jin
Lan-Zhe Guo
Yu-Feng Li
ELM
AILaw
50
1
0
10 Feb 2025
Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment
Shuo Wang
Bokui Wang
Zhixiang Shen
Boyan Deng
Zhao Kang
90
1
0
04 Feb 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Erik Cambria
LM&MA
AILaw
90
151
0
28 Jan 2025
CrisisSense-LLM: Instruction Fine-Tuned Large Language Model for Multi-label Social Media Text Classification in Disaster Informatics
Kai Yin
Chengkai Liu
Ali Mostafavi
Xia Hu
49
8
0
17 Jan 2025
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving
Yi Zhang
Guangyou Zhou
Zhiwen Xie
Jinjin Ma
Jimmy Xiangji Huang
AIMat
35
3
0
08 Jan 2025
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Zhi Qu
Yiran Wang
Jiannan Mao
Chenchen Ding
Hideki Tanaka
Masao Utiyama
Taro Watanabe
LRM
40
0
0
06 Jan 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
Yulei Qin
Yuncheng Yang
Pengcheng Guo
Gang Li
Hang Shao
Yuchen Shi
Zihan Xu
Yun Gu
Ke Li
Xing Sun
ALM
88
11
0
31 Dec 2024
GPT or BERT: why not both?
Lucas Georges Gabriel Charpentier
David Samuel
47
5
0
31 Dec 2024
Segment-Based Attention Masking for GPTs
Shahar Katz
Liran Ringel
Yaniv Romano
Lior Wolf
CLL
40
1
0
24 Dec 2024
AntLM: Bridging Causal and Masked Language Models
Xinru Yu
Bin Guo
Shiwei Luo
J. Wang
Tao Ji
Yuanbin Wu
CLL
74
1
0
04 Dec 2024
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey
Longxuan Ma
Mingda Li
Weinan Zhang
Jiapeng Li
Ting Liu
40
16
0
14 Nov 2024
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models
Jonathan Fhima
Elad Ben Avraham
Oren Nuriel
Yair Kittenplon
Roy Ganz
Aviad Aberdam
Ron Litman
VLM
26
1
0
07 Nov 2024
TrajGPT: Controlled Synthetic Trajectory Generation Using a Multitask Transformer-Based Spatiotemporal Model
Shang-Ling Hsu
Emmanuel Tung
John Krumm
Cyrus Shahabi
Khurram Hassan-Shafique
19
4
0
07 Nov 2024
No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages
Youssef Mohamed
Runjia Li
Ibrahim Said Ahmad
Kilichbek Haydarov
Philip H. S. Torr
Kenneth Ward Church
Mohamed Elhoseiny
VLM
23
6
0
06 Nov 2024
HG-Adapter: Improving Pre-Trained Heterogeneous Graph Neural Networks with Dual Adapters
Yujie Mo
Runpeng Yu
Xiaofeng Zhu
Xinchao Wang
33
1
0
02 Nov 2024
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Haoyu Song
W. Zhang
Kaiyan Zhang
Ting Liu
32
3
0
26 Oct 2024
Croc: Pretraining Large Multimodal Models with Cross-Modal Comprehension
Yin Xie
Kaicheng Yang
Ninghua Yang
Weimo Deng
Xiangzi Dai
...
Yumeng Wang
Xiang An
Yongle Zhao
Ziyong Feng
Jiankang Deng
MLLM
VLM
40
1
0
18 Oct 2024
ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos
Arpan Phukan
Manish Gupta
Asif Ekbal
VGen
37
0
0
13 Oct 2024
Optimized Biomedical Question-Answering Services with LLM and Multi-BERT Integration
Cheng Qian
Xianglong Shi
Shanshan Yao
Yichen Liu
Fengming Zhou
Zishu Zhang
Junaid Akram
Ali Braytee
Ali Anaissi
AI4MH
21
2
0
11 Oct 2024
Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles
Md. Tarek Hasan
Mohammad Nazmush Shamael
H. M. Mutasim Billah
Arifa Akter
M. Hossain
Sumayra Islam
Salekul Islam
Swakkhar Shatabda
26
0
0
05 Oct 2024
Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques
Olga Iakovenko
Anna Andreeva
Anna Lapidus
Liana Mikaelyan
26
2
0
04 Oct 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
40
0
0
25 Sep 2024
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
X. Shi
Shiyu Wang
Yuqi Nie
Dianqi Li
Zhou Ye
Qingsong Wen
Ming Jin
AI4TS
34
26
0
24 Sep 2024
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training
Yiyi Tao
Zhuoyue Wang
Hang Zhang
Lun Wang
VLM
38
13
0
15 Sep 2024
SongCreator: Lyrics-based Universal Song Generation
Shun Lei
Yixuan Zhou
Boshi Tang
Max W. Y. Lam
Feng Liu
Hangyu Liu
Jingcheng Wu
Shiyin Kang
Zhiyong Wu
Helen Meng
38
4
0
09 Sep 2024
Leveraging Large Language Models for Wireless Symbol Detection via In-Context Learning
Momin Abbas
Koushik Kar
Tianyi Chen
21
5
0
28 Aug 2024
Legilimens: Practical and Unified Content Moderation for Large Language Model Services
Jialin Wu
Jiangyi Deng
Shengyuan Pang
Yanjiao Chen
Jiayang Xu
Xinfeng Li
Wenyuan Xu
32
6
0
28 Aug 2024
A Survey of Large Language Models for European Languages
Wazir Ali
S. Pyysalo
39
2
0
27 Aug 2024
Actra: Optimized Transformer Architecture for Vision-Language-Action Models in Robot Learning
Yueen Ma
Dafeng Chi
Shiguang Wu
Yuecheng Liu
Yuzheng Zhuang
Jianye Hao
Irwin King
29
5
0
02 Aug 2024
Intermittent Semi-working Mask: A New Masking Paradigm for LLMs
Mingcong Lu
Jiangcai Zhu
Wang Hao
Zheng Li
Shusheng Zhang
Kailai Shao
Chao Chen
Nan Li
Feng Wang
Xin Lu
38
0
0
01 Aug 2024
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
Zhenpeng Su
Zijia Lin
Xue Bai
Xing Wu
Yizhe Xiong
...
Guangyuan Ma
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
MoE
31
4
0
13 Jul 2024
Explainable Natural Language Processing for Corporate Sustainability Analysis
Keane Ong
Rui Mao
Ranjan Satapathy
Ricardo Shirota Filho
Erik Cambria
Johan Sulaeman
G. Mengaldo
26
7
0
03 Jul 2024
Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving
Yipin Guo
Yilin Lang
Qinyuan Ren
42
0
0
03 Jul 2024
PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning
Jiaru Zou
Mengyu Zhou
Tao Li
Shi Han
Dongmei Zhang
46
6
0
02 Jul 2024
1
2
3
4
...
15
16
17
Next