Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2309.16609
Cited By
Qwen Technical Report
28 September 2023
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
Xiaodong Deng
Yang Fan
Wenbin Ge
Yu Han
Fei Huang
Binyuan Hui
Luo Ji
Mei Li
Junyang Lin
Runji Lin
Dayiheng Liu
Gao Liu
Chengqiang Lu
Keming Lu
Jianxin Ma
Rui Men
Xingzhang Ren
Xuancheng Ren
Chuanqi Tan
Sinan Tan
Jianhong Tu
Peng Wang
Shijie Wang
Wei Wang
Shengguang Wu
Benfeng Xu
Jin Xu
An Yang
Hao Yang
Jian Yang
Shusheng Yang
Yang Yao
Bowen Yu
Hongyi Yuan
Zheng Yuan
Jianwei Zhang
Xinyu Zhang
Yichang Zhang
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
OSLM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (36 upvotes)
Papers citing
"Qwen Technical Report"
50 / 1,888 papers shown
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
Tajamul Ashraf
Umair Nawaz
Abdelrahman M. Shaker
Rao Muhammad Anwer
Philip Torr
Fahad Shahbaz Khan
Salman Khan
230
0
0
09 Oct 2025
In-Context Clustering with Large Language Models
Ying Wang
Mengye Ren
Andrew Gordon Wilson
159
0
0
09 Oct 2025
JAI-1: A Thai-Centric Large Language Model
Attapol T. Rutherford
Jullajak Karnjanaekarin
Narongkorn Panitsrisit
Pontakorn Trakuekul
Sumana Sumanakul
Natchanon Pollertlam
83
0
0
08 Oct 2025
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments
CAADRIA proceedings (CAADRIA), 2025
Jingfei Huang
Han Tu
222
0
0
08 Oct 2025
Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models
Benjamin Akera
Evelyn Nafula Ouma
Gilbert Yiga
Patrick Walukagga
Phionah Natukunda
...
Imran Sekalala
Nimpamya Janat Namara
Engineer Bainomugisha
Ernest Mwebaze
John Quinn
195
0
0
08 Oct 2025
Populism Meets AI: Advancing Populism Research with LLMs
Eduardo Ryô Tamaki
Eduardo Ryô Tamaki
Julia Chatterley
Grant Mitchell
Semir Dzebo
Cristóbal Sandoval
Levente Littvay
Kirk Hawkins
220
0
0
08 Oct 2025
Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer
Jing-Zong Zhang
Shuang Guo
Li-Lin Zhu
Lingxiao Wang
Guo-Liang Ma
163
10
0
08 Oct 2025
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Yuwen Tan
Xiang Xiang
Kun He
John E. Hopcroft
123
0
0
08 Oct 2025
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Jiuan Zhou
Yu Cheng
Yuan Xie
Z. Yin
128
4
0
08 Oct 2025
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
Tianhao Zhu
Dahu Feng
Erhu Feng
Yubin Xia
142
0
0
07 Oct 2025
CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs
Chengwei Wu
Jiapu Wang
Mingyang Gao
Xingrui Zhuo
Jipeng Guo
...
Haoran Luo
Tianyu Chen
Haoyi Zhou
Shirui Pan
Zechao Li
136
0
0
07 Oct 2025
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Suresh Damodaran
Paul D. Rowe
AAML
141
10
0
07 Oct 2025
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Peter Ochieng
94
1
0
07 Oct 2025
EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
Zefu Lin
Rongxu Cui
Chen Hanning
Xiangyu Wang
Junjia Xu
...
Chen Wenbo
Hui Zhou
Lue Fan
W. Li
Zhaoxiang Zhang
LM&Ro
180
1
0
07 Oct 2025
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
Wenhao Guan
Zhikang Niu
Ziyue Jiang
Kaidi Wang
Peijie Chen
Q. Hong
Lin Li
Xie Chen
AuLLM
344
0
0
06 Oct 2025
Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches
Yicheng Tao
Yao Qin
Yepang Liu
3DV
183
6
0
06 Oct 2025
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models
Ci-Siang Lin
Min-Hung Chen
Yu-Yang Sheng
Y. Wang
VLM
153
0
0
03 Oct 2025
TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling
Junyi Chen
Chuheng Du
Renyuan Liu
Shuochao Yao
Dingtian Yan
Jiang Liao
Shengzhong Liu
Fan Wu
Guihai Chen
182
3
0
03 Oct 2025
Distributed Low-Communication Training with Decoupled Momentum Optimization
S. Nedelkoski
Alexander Acker
O. Kao
Soeren Becker
Dominik Scheinert
104
0
0
03 Oct 2025
Growing Visual Generative Capacity for Pre-Trained MLLMs
Hanyu Wang
Jiaming Han
Ziyan Yang
Qi Zhao
Shanchuan Lin
Xiangyu Yue
Abhinav Shrivastava
Zhenheng Yang
Hao Chen
VLM
217
1
0
02 Oct 2025
Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework
Nii Osae Osae Dade
Moinul Hossain Rahat
146
0
0
02 Oct 2025
Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning
Xinyuan Song
Keyu Wang
Pengxiang Li
L. Yin
Shiwei Liu
302
2
0
02 Oct 2025
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Angen Ye
Zeyu Zhang
Boyuan Wang
Xiaofeng Wang
Dapeng Zhang
Zheng Hua Zhu
LRM
VLM
158
10
0
02 Oct 2025
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Yurun Chen
Xavier Hu
Y. Liu
Ziqi Wang
Zeyi Liao
...
Feng Wei
Yuxi Qian
Bo Zheng
Keting Yin
Shengyu Zhang
LLMAG
244
1
0
01 Oct 2025
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova
B. Bejnordi
Gaurav Kumar
Hanxue Liang
Wanru Zhao
Paul N. Whatmough
MoE
99
1
0
01 Oct 2025
NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description
Hamed Jelodar
Mohammad Meymani
Parisa Hamedi
Tochukwu Emmanuel Nwankwo
Samita Bai
Roozbeh Razavi-Far
Ali Ghorbani
122
2
0
01 Oct 2025
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Zichen Wen
Shaobo Wang
Yufa Zhou
J. Zhang
Qintong Zhang
...
Zhaorun Chen
Bin Wang
W. Li
Conghui He
Linfeng Zhang
VLM
182
8
0
01 Oct 2025
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Kairun Zhang
Xue Yang
Yanjun Zhao
Yifan Sun
Huan Zhang
183
0
0
01 Oct 2025
Automated Structured Radiology Report Generation with Rich Clinical Context
Seongjae Kang
Dong Bok Lee
Juho Jung
Dongseop Kim
Won Hwa Kim
Sunghoon Joo
135
0
0
01 Oct 2025
CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
Mingzhe Zheng
Dingjie Song
Guanyu Zhou
Jun You
Jiahao Zhan
Xuran Ma
Xinyuan Song
Ser-Nam Lim
Qifeng Chen
Harry Yang
176
2
0
01 Oct 2025
Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework
Chengran Yang
Ting Zhang
Jinfeng Jiang
Xin Zhou
Haoye Tian
...
Junkai Chen
Yikun Li
Eng Lieh Ouh
Lwin Khin Shar
David Lo
140
1
0
01 Oct 2025
LongCodeZip: Compress Long Context for Code Language Models
Yuling Shi
Yichun Qian
Hongyu Zhang
Beijun Shen
Xiaodong Gu
152
5
0
01 Oct 2025
OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models
Luca Cotti
Idilio Drago
Anisa Rula
Devis Bianchini
Federico Cerutti
95
0
0
01 Oct 2025
Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment
Vanya Bannihatti Kumar
Divyanshu Goyal
Akhil Eppa
Neel Bhandari
ELM
LRM
109
0
0
01 Oct 2025
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
Zhanda Zhu
Qidong Su
Yaoyao Ding
Kevin Song
Shang Wang
Gennady Pekhimenko
MoMe
194
0
0
30 Sep 2025
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Miao Rang
Zhenni Bi
Hang Zhou
Hanting Chen
An Xiao
Tianyu Guo
Kai Han
Xinghao Chen
Yunhe Wang
168
2
0
30 Sep 2025
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Hanwen Du
Yuxin Dong
Xia Ning
LRM
AI4CE
179
4
0
30 Sep 2025
OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models
Subrata Biswas
Mohammad Nur Hossain Khan
Bashima Islam
VLM
LRM
123
1
0
30 Sep 2025
dParallel: Learnable Parallel Decoding for dLLMs
Zigeng Chen
Gongfan Fang
Xinyin Ma
Ruonan Yu
Xinchao Wang
126
12
0
30 Sep 2025
Effective Model Pruning: Measure The Redundancy of Model Components
Yixuan Wang
Dan Guralnik
Saiedeh Akbari
Warren E. Dixon
63
0
0
30 Sep 2025
BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models
Thierry Blankenstein
Jialin Yu
Ruoyao Xiao
Vassilis Plachouras
Sunando Sengupta
Philip Torr
Y. Gal
Alasdair Paren
Adel Bibi
117
1
0
30 Sep 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLM
ObjD
VLM
LRM
228
2
0
30 Sep 2025
Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
Sangwon Ryu
Heejin Do
Yunsu Kim
G. G. Lee
Jungseul Ok
169
0
0
30 Sep 2025
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
Pingchen Lu
Zhi Hong
Zhiwei Shang
Zhiyong Wang
Yikun Ban
Yao Shu
Min Zhang
Shuang Qiu
Zhongxiang Dai
FedML
159
1
0
29 Sep 2025
Speculative Verification: Exploiting Information Gain to Refine Speculative Decoding
Sungkyun Kim
Jaemin Kim
Dogyung Yoon
Jiho Shin
Junyeol Lee
Jiwon Seo
140
0
0
29 Sep 2025
Multimodal Large Language Models Meet Multimodal Emotion Recognition and Reasoning: A Survey
Yuntao Shou
Tao Meng
Wei Ai
Keqin Li
LRM
215
7
0
29 Sep 2025
Toward a Vision-Language Foundation Model for Medical Data: Multimodal Dataset and Benchmarks for Vietnamese PET/CT Report Generation
H. Nguyen
D. Nguyen
Minh Nguyen
T. Nguyen
Thao Nguyen Truong
...
Quoc Viet Hung Nguyen
Quynh Anh Chau
Hong Son Mai
T. Nguyen
Phi Le Nguyen
VLM
231
4
0
29 Sep 2025
World-Env: Leveraging World Model as a Virtual Environment for VLA Post-Training
Junjin Xiao
Y. Yang
Xinyuan Chang
Ronghan Chen
Feng Xiong
Mu Xu
Wei-Shi Zheng
Qing Zhang
VLM
311
8
0
29 Sep 2025
From Ambiguity to Verdict: A Semiotic-Grounded Multi-Perspective Agent for LLM Logical Reasoning
Yunyao Zhang
Xinglang Zhang
Junxi Sheng
Wenbing Li
Junqing Yu
Wei Yang
Zikai Song
Zikai Song
LRM
288
2
0
29 Sep 2025
LLaDA-MoE: A Sparse MoE Diffusion Language Model
Fengqi Zhu
Zebin You
Yipeng Xing
Zenan Huang
Lin Liu
...
Junbo Zhao
Da Zheng
Chongxuan Li
Jianguo Li
J. Wen
MoE
264
15
0
29 Sep 2025
Previous
1
2
3
4
5
6
...
36
37
38
Next
Page 5 of 38
Page
of 38
Go