Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2309.16609
Cited By

Qwen Technical Report

Qwen Technical Report

28 September 2023

Jinze Bai

Shuai Bai

Yunfei Chu

Zeyu Cui

Xiaodong Deng

Yang Fan

Yu Han

Fei Huang

Luo Ji

Mei Li

Runji Lin

Dayiheng Liu

Gao Liu

Jianxin Ma

Rui Men

Xingzhang Ren

Xuancheng Ren

Chuanqi Tan

Sinan Tan

Jianhong Tu

Peng Wang

Shijie Wang

Shengguang Wu

Benfeng Xu

Jin Xu

Hao Yang

Shusheng Yang

Yang Yao

Bowen Yu

Hongyi Yuan

Jianwei Zhang

Yichang Zhang

Zhenru Zhang

Chang Zhou

Jingren Zhou

Xiaohuan Zhou

Tianhang Zhu

ArXiv (abs)PDF HTML HuggingFace (36 upvotes)

Papers citing "Qwen Technical Report"

50 / 1,893 papers shown

Active Model Selection for Large Language Models

Active Model Selection for Large Language Models

Yavuz Durmazkeser

Patrik Okanovic

Torsten Hoefler

Nezihe Merve Gürel

138

1

0

10 Oct 2025

SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG

SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG

162

2

0

10 Oct 2025

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

VITA-VLA: Efficiently Teaching Vision-Language Models to Act via Action Expert Distillation

...

205

2

0

10 Oct 2025

VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search

VisuoAlign: Safety Alignment of LVLMs with Multimodal Tree Search

136

0

0

10 Oct 2025

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

MATRIX: Multimodal Agent Tuning for Robust Tool-Use Reasoning

Abdelrahman M. Shaker

Rao Muhammad Anwer

Fahad Shahbaz Khan

232

0

0

09 Oct 2025

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

...

165

2

0

09 Oct 2025

In-Context Clustering with Large Language Models

In-Context Clustering with Large Language Models

Andrew Gordon Wilson

173

1

0

09 Oct 2025

JAI-1: A Thai-Centric Large Language Model

JAI-1: A Thai-Centric Large Language Model

Attapol T. Rutherford

Jullajak Karnjanaekarin

Narongkorn Panitsrisit

Pontakorn Trakuekul

Sumana Sumanakul

Natchanon Pollertlam

88

0

0

08 Oct 2025

Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks

Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks

John E. Hopcroft

131

0

0

08 Oct 2025

Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography

Auto-Stega: An Agent-Driven System for Lifelong Strategy Evolution in LLM-Based Text Steganography

144

7

0

08 Oct 2025

Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models

Sunflower: A New Approach To Expanding Coverage of African Languages in Large Language Models

Evelyn Nafula Ouma

Patrick Walukagga

Phionah Natukunda

...

Nimpamya Janat Namara

Engineer Bainomugisha

201

0

0

08 Oct 2025

Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban Environments

Inconsistent Affective Reaction: Sentiment of Perception and Opinion in Urban EnvironmentsCAADRIA proceedings (CAADRIA), 2025

253

0

0

08 Oct 2025

Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer

Latent Representation Learning in Heavy-Ion Collisions with MaskPoint Transformer

Jing-Zong Zhang

183

10

0

08 Oct 2025

Populism Meets AI: Advancing Populism Research with LLMs

Populism Meets AI: Advancing Populism Research with LLMs

Eduardo Ryô Tamaki

Eduardo Ryô Tamaki

Julia Chatterley

Cristóbal Sandoval

Levente Littvay

233

0

0

08 Oct 2025

Automated Repeatable Adversary Threat Emulation with Effects Language (EL)

Automated Repeatable Adversary Threat Emulation with Effects Language (EL)

Suresh Damodaran

170

12

0

07 Oct 2025

CDTP: A Large-Scale Chinese Data-Text Pair Dataset for Comprehensive Evaluation of Chinese LLMs

...

152

0

0

07 Oct 2025

From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs

From Principles to Practice: A Systematic Study of LLM Serving on Multi-core NPUs

154

1

0

07 Oct 2025

Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes

Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes

99

1

0

07 Oct 2025

EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model

EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model

...

Zhaoxiang Zhang

201

1

0

07 Oct 2025

UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models

UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models

389

0

0

06 Oct 2025

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches

Retrieval-Augmented Code Generation: A Survey with Focus on Repository-Level Approaches

193

10

0

06 Oct 2025

TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling

TokenFlow: Responsive LLM Text Streaming Serving under Request Burst via Preemptive Scheduling

196

3

0

03 Oct 2025

LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models

LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models

160

0

0

03 Oct 2025

Distributed Low-Communication Training with Decoupled Momentum Optimization

Distributed Low-Communication Training with Decoupled Momentum Optimization

Alexander Acker

Dominik Scheinert

118

0

0

03 Oct 2025

Growing Visual Generative Capacity for Pre-Trained MLLMs

Growing Visual Generative Capacity for Pre-Trained MLLMs

Abhinav Shrivastava

243

1

0

02 Oct 2025

Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework

Litespark Technical Report: High-Throughput, Energy-Efficient LLM Training Framework

Nii Osae Osae Dade

Moinul Hossain Rahat

150

0

0

02 Oct 2025

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

165

15

0

02 Oct 2025

Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning

Demystifying the Roles of LLM Layers in Retrieval, Knowledge, and Reasoning

334

4

0

02 Oct 2025

LongCodeZip: Compress Long Context for Code Language Models

LongCodeZip: Compress Long Context for Code Language Models

158

11

0

01 Oct 2025

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

...

257

2

0

01 Oct 2025

Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework

Semantics-Aligned, Curriculum-Driven, and Reasoning-Enhanced Vulnerability Repair Framework

...

151

3

0

01 Oct 2025

NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description

NLD-LLM: A systematic framework for evaluating small language transformer models on natural language description

Mohammad Meymani

Tochukwu Emmanuel Nwankwo

Roozbeh Razavi-Far

134

2

0

01 Oct 2025

CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation

CML-Bench: A Framework for Evaluating and Enhancing LLM-Powered Movie Scripts Generation

199

2

0

01 Oct 2025

Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs

Dirichlet-Prior Shaping: Guiding Expert Specialization in Upcycled MoEs

Leyla Mirvakhabova

Paul N. Whatmough

102

1

0

01 Oct 2025

Automated Structured Radiology Report Generation with Rich Clinical Context

Automated Structured Radiology Report Generation with Rich Clinical Context

160

0

0

01 Oct 2025

OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models

OntoLogX: Ontology-Guided Knowledge Graph Extraction from Cybersecurity Logs with Large Language Models

Devis Bianchini

Federico Cerutti

95

0

0

01 Oct 2025

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

...

197

11

0

01 Oct 2025

Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment

Curiosity-Driven LLM-as-a-judge for Personalized Creative Judgment

Vanya Bannihatti Kumar

Divyanshu Goyal

115

0

0

01 Oct 2025

Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs

Learning a Zeroth-Order Optimizer for Fine-Tuning LLMs

185

0

0

01 Oct 2025

dParallel: Learnable Parallel Decoding for dLLMs

dParallel: Learnable Parallel Decoding for dLLMs

158

19

0

30 Sep 2025

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

178

3

0

30 Sep 2025

OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models

OWL: Geometry-Aware Spatial Reasoning for Audio Large Language Models

Mohammad Nur Hossain Khan

140

3

0

30 Sep 2025

LoRAFusion: Efficient LoRA Fine-Tuning for LLMs

LoRAFusion: Efficient LoRA Fine-Tuning for LLMs

Gennady Pekhimenko

203

0

0

30 Sep 2025

Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search

Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search

188

0

0

30 Sep 2025

Effective Model Pruning: Measure The Redundancy of Model Components

Effective Model Pruning: Measure The Redundancy of Model Components

Warren E. Dixon

78

0

0

30 Sep 2025

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

MLLM ObjD VLM LRM

298

4

0

30 Sep 2025

BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models

BiasBusters: Uncovering and Mitigating Tool Selection Bias in Large Language Models

Thierry Blankenstein

Vassilis Plachouras

Sunando Sengupta

129

1

0

30 Sep 2025

Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts

Latent Thinking Optimization: Your Latent Reasoning Language Model Secretly Encodes Reward Signals in Its Latent Thoughts

214

4

0

30 Sep 2025

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits

196

1

0

29 Sep 2025

ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models

ZOO-Prune: Training-Free Token Pruning via Zeroth-Order Gradient Estimation in Vision-Language Models

183

1

0

29 Sep 2025

1 2 3 4 5 6...36 37 38

Page 5 of 38

Pageof 38