ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.16609
  4. Cited By
Qwen Technical Report

Qwen Technical Report

28 September 2023
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
Xiaodong Deng
Yang Fan
Wenbin Ge
Yu Han
Fei Huang
Binyuan Hui
Luo Ji
Mei Li
Junyang Lin
Runji Lin
Dayiheng Liu
Gao Liu
Chengqiang Lu
Keming Lu
Jianxin Ma
Rui Men
Xingzhang Ren
Xuancheng Ren
Chuanqi Tan
Sinan Tan
Jianhong Tu
Peng Wang
Shijie Wang
Wei Wang
Shengguang Wu
Benfeng Xu
Jin Xu
An Yang
Hao Yang
Jian Yang
Shusheng Yang
Yang Yao
Bowen Yu
Hongyi Yuan
Zheng Yuan
Jianwei Zhang
Xinyu Zhang
Yichang Zhang
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
    OSLM
ArXiv (abs)PDFHTMLHuggingFace (36 upvotes)

Papers citing "Qwen Technical Report"

50 / 1,893 papers shown
Active Model Selection for Large Language Models
Active Model Selection for Large Language Models
Yavuz Durmazkeser
Patrik Okanovic
Andreas Kirsch
Torsten Hoefler
Nezihe Merve Gürel
138
1
0
10 Oct 2025
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
Xiaonan Si
Meilin Zhu
Simeng Qin
Lijia Yu
Lijun Zhang
Shuaitong Liu
Xinfeng Li
Ranjie Duan
Yang Liu
Xiaojun Jia
162
2
0
10 Oct 2025
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation
Shaoqi Dong
Chaoyou Fu
Haihan Gao
Y. Zhang
Chi Yan
...
H. Cao
Yang Gao
Xing Sun
Ran He
Caifeng Shan
VLM
205
2
0
10 Oct 2025
VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search
VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search
MingSheng Li
Guangze Zhao
Sichen Liu
136
0
0
10 Oct 2025
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning
Tajamul Ashraf
Umair Nawaz
Abdelrahman M. Shaker
Rao Muhammad Anwer
Philip Torr
Fahad Shahbaz Khan
Salman Khan
232
0
0
09 Oct 2025
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints
Changyao Tian
Hao Li
Gen Luo
Xizhou Zhu
Weijie Su
...
Y. Liu
Lewei Lu
Wenhai Wang
Hongsheng Li
Jifeng Dai
165
2
0
09 Oct 2025
In-Context Clustering with Large Language Models
In-Context Clustering with Large Language Models
Ying Wang
Mengye Ren
Andrew Gordon Wilson
173
1
0
09 Oct 2025
JAI-1: A Thai-Centric Large Language Model
JAI-1: A Thai-Centric Large Language Model
Attapol T. Rutherford
Jullajak Karnjanaekarin
Narongkorn Panitsrisit
Pontakorn Trakuekul
Sumana Sumanakul
Natchanon Pollertlam
88
0
0
08 Oct 2025
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Yuwen Tan
Xiang Xiang
Kun He
John E. Hopcroft
131
0
0
08 Oct 2025
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography
Jiuan Zhou
Yu Cheng
Yuan Xie
Z. Yin
144
7
0
08 Oct 2025
Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models
Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models
Benjamin Akera
Evelyn Nafula Ouma
Gilbert Yiga
Patrick Walukagga
Phionah Natukunda
...
Imran Sekalala
Nimpamya Janat Namara
Engineer Bainomugisha
Ernest Mwebaze
John Quinn
201
0
0
08 Oct 2025
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments
Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban EnvironmentsCAADRIA proceedings (CAADRIA), 2025
Jingfei Huang
Han Tu
253
0
0
08 Oct 2025
Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer
Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer
Jing-Zong Zhang
Shuang Guo
Li-Lin Zhu
Lingxiao Wang
Guo-Liang Ma
183
10
0
08 Oct 2025
Populism Meets AI: Advancing Populism Research with LLMs
Populism Meets AI: Advancing Populism Research with LLMs
Eduardo Ryô Tamaki
Eduardo Ryô Tamaki
Julia Chatterley
Grant Mitchell
Semir Dzebo
Cristóbal Sandoval
Levente Littvay
Kirk Hawkins
233
0
0
08 Oct 2025
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Automated Repeatable Adversary Threat Emulation with Effects Language (EL)
Suresh Damodaran
Paul D. Rowe
AAML
170
12
0
07 Oct 2025
CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs
Chengwei Wu
Jiapu Wang
Mingyang Gao
Xingrui Zhuo
Jipeng Guo
...
Haoran Luo
Tianyu Chen
Haoyi Zhou
Shirui Pan
Zechao Li
152
0
0
07 Oct 2025
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs
Tianhao Zhu
Dahu Feng
Erhu Feng
Yubin Xia
154
1
0
07 Oct 2025
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes
Peter Ochieng
99
1
0
07 Oct 2025
EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
Zefu Lin
Rongxu Cui
Chen Hanning
Xiangyu Wang
Junjia Xu
...
Chen Wenbo
Hui Zhou
Lue Fan
W. Li
Zhaoxiang Zhang
LM&Ro
201
1
0
07 Oct 2025
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
Wenhao Guan
Zhikang Niu
Ziyue Jiang
Kaidi Wang
Peijie Chen
Q. Hong
Lin Li
Xie Chen
AuLLM
389
0
0
06 Oct 2025
Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches
Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches
Yicheng Tao
Yao Qin
Yepang Liu
3DV
193
10
0
06 Oct 2025
TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling
TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling
Junyi Chen
Chuheng Du
Renyuan Liu
Shuochao Yao
Dingtian Yan
Jiang Liao
Shengzhong Liu
Fan Wu
Guihai Chen
196
3
0
03 Oct 2025
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models
LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models
Ci-Siang Lin
Min-Hung Chen
Yu-Yang Sheng
Y. Wang
VLM
160
0
0
03 Oct 2025
Distributed Low-Communication Training with Decoupled Momentum Optimization
Distributed Low-Communication Training with Decoupled Momentum Optimization
S. Nedelkoski
Alexander Acker
O. Kao
Soeren Becker
Dominik Scheinert
118
0
0
03 Oct 2025
Growing Visual Generative Capacity for Pre-Trained MLLMs
Growing Visual Generative Capacity for Pre-Trained MLLMs
Hanyu Wang
Jiaming Han
Ziyan Yang
Qi Zhao
Shanchuan Lin
Xiangyu Yue
Abhinav Shrivastava
Zhenheng Yang
Hao Chen
VLM
243
1
0
02 Oct 2025
Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework
Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework
Nii Osae Osae Dade
Moinul Hossain Rahat
150
0
0
02 Oct 2025
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
Angen Ye
Zeyu Zhang
Boyuan Wang
Xiaofeng Wang
Dapeng Zhang
Zheng Hua Zhu
LRMVLM
165
15
0
02 Oct 2025
Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning
Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning
Xinyuan Song
Keyu Wang
Pengxiang Li
L. Yin
Shiwei Liu
334
4
0
02 Oct 2025
LongCodeZip: Compress Long Context for Code Language Models
LongCodeZip: Compress Long Context for Code Language Models
Yuling Shi
Yichun Qian
Hongyu Zhang
Beijun Shen
Xiaodong Gu
158
11
0
01 Oct 2025
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Yurun Chen
Xavier Hu
Y. Liu
Ziqi Wang
Zeyi Liao
...
Feng Wei
Yuxi Qian
Bo Zheng
Keting Yin
Shengyu Zhang
LLMAG
257
2
0
01 Oct 2025
Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework
Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework
Chengran Yang
Ting Zhang
Jinfeng Jiang
Xin Zhou
Haoye Tian
...
Junkai Chen
Yikun Li
Eng Lieh Ouh
Lwin Khin Shar
David Lo
151
3
0
01 Oct 2025
NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description
NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description
Hamed Jelodar
Mohammad Meymani
Parisa Hamedi
Tochukwu Emmanuel Nwankwo
Samita Bai
Roozbeh Razavi-Far
Ali Ghorbani
134
2
0
01 Oct 2025
CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation
Mingzhe Zheng
Dingjie Song
Guanyu Zhou
Jun You
Jiahao Zhan
Xuran Ma
Xinyuan Song
Ser-Nam Lim
Qifeng Chen
Harry Yang
199
2
0
01 Oct 2025
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs
Leyla Mirvakhabova
B. Bejnordi
Gaurav Kumar
Hanxue Liang
Wanru Zhao
Paul N. Whatmough
MoE
102
1
0
01 Oct 2025
Automated Structured Radiology Report Generation with Rich Clinical Context
Automated Structured Radiology Report Generation with Rich Clinical Context
Seongjae Kang
Dong Bok Lee
Juho Jung
Dongseop Kim
Won Hwa Kim
Sunghoon Joo
160
0
0
01 Oct 2025
OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models
OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models
Luca Cotti
Idilio Drago
Anisa Rula
Devis Bianchini
Federico Cerutti
95
0
0
01 Oct 2025
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation
Zichen Wen
Shaobo Wang
Yufa Zhou
J. Zhang
Qintong Zhang
...
Zhaorun Chen
Bin Wang
W. Li
Conghui He
Linfeng Zhang
VLM
197
11
0
01 Oct 2025
Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment
Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment
Vanya Bannihatti Kumar
Divyanshu Goyal
Akhil Eppa
Neel Bhandari
ELMLRM
115
0
0
01 Oct 2025
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs
Kairun Zhang
Xue Yang
Yanjun Zhao
Yifan Sun
Huan Zhang
185
0
0
01 Oct 2025
dParallel: Learnable Parallel Decoding for dLLMs
dParallel: Learnable Parallel Decoding for dLLMs
Zigeng Chen
Gongfan Fang
Xinyin Ma
Ruonan Yu
Xinchao Wang
158
19
0
30 Sep 2025
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation
Miao Rang
Zhenni Bi
Hang Zhou
Hanting Chen
An Xiao
Tianyu Guo
Kai Han
Xinghao Chen
Yunhe Wang
178
3
0
30 Sep 2025
OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models
OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models
Subrata Biswas
Mohammad Nur Hossain Khan
Bashima Islam
VLMLRM
140
3
0
30 Sep 2025
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
Zhanda Zhu
Qidong Su
Yaoyao Ding
Kevin Song
Shang Wang
Gennady Pekhimenko
MoMe
203
0
0
30 Sep 2025
Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
Sangwon Ryu
Heejin Do
Yunsu Kim
G. G. Lee
Jungseul Ok
188
0
0
30 Sep 2025
Effective Model Pruning: Measure The Redundancy of Model Components
Effective Model Pruning: Measure The Redundancy of Model Components
Yixuan Wang
Dan Guralnik
Saiedeh Akbari
Warren E. Dixon
78
0
0
30 Sep 2025
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
Peng Liu
H. Shen
Chunxin Fang
Zhicheng Sun
Jiajia Liao
T. Zhao
MLLMObjDVLMLRM
298
4
0
30 Sep 2025
BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models
BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models
Thierry Blankenstein
Jialin Yu
Ruoyao Xiao
Vassilis Plachouras
Sunando Sengupta
Philip Torr
Y. Gal
Alasdair Paren
Adel Bibi
129
1
0
30 Sep 2025
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts
Hanwen Du
Yuxin Dong
Xia Ning
LRMAI4CE
214
4
0
30 Sep 2025
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
Pingchen Lu
Zhi Hong
Zhiwei Shang
Zhiyong Wang
Yikun Ban
Yao Shu
Min Zhang
Shuang Qiu
Zhongxiang Dai
FedML
196
1
0
29 Sep 2025
ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models
ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models
Youngeun Kim
Youjia Zhang
Huiling Liu
Aecheon Jung
Sunwoo Lee
Sungeun Hong
VLM
183
1
0
29 Sep 2025
Previous
123456...363738
Next
Page 5 of 38
Pageof 38