ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.16609
  4. Cited By
Qwen Technical Report

Qwen Technical Report

28 September 2023
Jinze Bai
Shuai Bai
Yunfei Chu
Zeyu Cui
Kai Dang
Xiaodong Deng
Yang Fan
Wenbin Ge
Yu Han
Fei Huang
Binyuan Hui
Luo Ji
Mei Li
Junyang Lin
Runji Lin
Dayiheng Liu
Gao Liu
Chengqiang Lu
Keming Lu
Jianxin Ma
Rui Men
Xingzhang Ren
Xuancheng Ren
Chuanqi Tan
Sinan Tan
Jianhong Tu
Peng Wang
Shijie Wang
Wei Wang
Shengguang Wu
Benfeng Xu
Jin Xu
An Yang
Hao Yang
Jian Yang
Shusheng Yang
Yang Yao
Bowen Yu
Hongyi Yuan
Zheng Yuan
Jianwei Zhang
Xinyu Zhang
Yichang Zhang
Zhenru Zhang
Chang Zhou
Jingren Zhou
Xiaohuan Zhou
Tianhang Zhu
    OSLM
ArXiv (abs)PDFHTMLHuggingFace (36 upvotes)

Papers citing "Qwen Technical Report"

50 / 1,888 papers shown
DeCo: Decoupling Token Compression from Semantic Abstraction in
  Multimodal Large Language Models
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models
Linli Yao
Lei Li
Shuhuai Ren
Lean Wang
Yuanxin Liu
Xu Sun
Lu Hou
216
59
0
31 May 2024
Enhancing Noise Robustness of Retrieval-Augmented Language Models with
  Adaptive Adversarial Training
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training
Feiteng Fang
Yuelin Bai
Shiwen Ni
Min Yang
Xiaojun Chen
Ruifeng Xu
AAMLRALM
354
74
0
31 May 2024
Provably Efficient Interactive-Grounded Learning with Personalized
  Reward
Provably Efficient Interactive-Grounded Learning with Personalized Reward
Mengxiao Zhang
Yuheng Zhang
Haipeng Luo
Paul Mineiro
208
1
0
31 May 2024
OR-Bench: An Over-Refusal Benchmark for Large Language Models
OR-Bench: An Over-Refusal Benchmark for Large Language Models
Justin Cui
Wei-Lin Chiang
Ion Stoica
Cho-Jui Hsieh
ALM
738
97
0
31 May 2024
Mind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs' Refusal Boundaries
Mind the Inconspicuous: Revealing the Hidden Weakness in Aligned LLMs' Refusal Boundaries
Jiahao Yu
Haozheng Luo
Jerry Yao-Chieh Hu
Wenbo Guo
Han Liu
Xinyu Xing
329
21
0
31 May 2024
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large
  Language Models Reasoning
Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning
Xinlu Zhang
Zhi Chen
Xi Ye
Xianjun Yang
Lichang Chen
William Y. Wang
Linda R. Petzold
LRM
340
30
0
30 May 2024
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles
  and Committee Discussions
Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Ruochen Zhao
Wenxuan Zhang
Yew Ken Chia
Deli Zhao
Lidong Bing
268
9
0
30 May 2024
TAIA: Large Language Models are Out-of-Distribution Data Learners
TAIA: Large Language Models are Out-of-Distribution Data Learners
Shuyang Jiang
Yusheng Liao
Ya Zhang
Yu Wang
Yanfeng Wang
229
7
0
30 May 2024
Would I Lie To You? Inference Time Alignment of Language Models using
  Direct Preference Heads
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Avelina Asada Hadji-Kyriacou
Ognjen Arandjelović
154
3
0
30 May 2024
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model
  Series
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
Ge Zhang
Scott Qu
Jiaheng Liu
Chenchen Zhang
Chenghua Lin
...
Zi-Kai Zhao
Jiajun Zhang
Wanli Ouyang
Wenhao Huang
Lei Ma
ELM
318
72
0
29 May 2024
PediatricsGPT: Large Language Models as Chinese Medical Assistants for
  Pediatric Applications
PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Dingkang Yang
Jinjie Wei
Dongling Xiao
Shunli Wang
Tong Wu
...
Yue Jiang
Qingyao Xu
Ke Li
Peng Zhai
Lihua Zhang
LM&MA
322
30
0
29 May 2024
Evaluating the External and Parametric Knowledge Fusion of Large
  Language Models
Evaluating the External and Parametric Knowledge Fusion of Large Language Models
Hao Zhang
Yuyang Zhang
Xiaoguang Li
Wenxuan Shi
Haonan Xu
...
Yasheng Wang
Lifeng Shang
Qun Liu
Yong Liu
Ruiming Tang
KELM
246
7
0
29 May 2024
ViG: Linear-complexity Visual Sequence Learning with Gated Linear
  Attention
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention
Bencheng Liao
Xinggang Wang
Lianghui Zhu
Qian Zhang
Chang Huang
310
8
0
28 May 2024
fMRI predictors based on language models of increasing complexity
  recover brain left lateralization
fMRI predictors based on language models of increasing complexity recover brain left lateralization
Laurent Bonnasse-Gahot
Christophe Pallier
154
11
0
28 May 2024
Exploring Activation Patterns of Parameters in Language Models
Exploring Activation Patterns of Parameters in Language Models
Yudong Wang
Damai Dai
Zhifang Sui
173
5
0
28 May 2024
C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark
  for Large Language Models
C3^{3}3Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models
Jiahuan Cao
Yongxin Shi
Dezhi Peng
Yang Liu
Lianwen Jin
ELM
232
0
0
28 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
331
31
0
27 May 2024
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Tokenization Matters! Degrading Large Language Models through Challenging Their Tokenization
Dixuan Wang
Yanda Li
Junyuan Jiang
Zepeng Ding
Ziqin Luo
Guochao Jiang
Jiaqing Liang
Deqing Yang
491
34
0
27 May 2024
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
ReflectionCoder: Learning from Reflection Sequence for Enhanced One-off Code Generation
Houxing Ren
Mingjie Zhan
Zhongyuan Wu
Aojun Zhou
Junting Pan
Jiaming Song
SyDa
415
12
0
27 May 2024
TokenUnify: Scaling Up Autoregressive Pretraining for Neuron Segmentation
TokenUnify: Scaling Up Autoregressive Pretraining for Neuron Segmentation
Yinda Chen
Haoyuan Shi
Xiaoyu Liu
Te Shi
Ruobing Zhang
Dong Liu
Zhiwei Xiong
Feng Wu
419
12
0
27 May 2024
Automatically Generating Numerous Context-Driven SFT Data for LLMs
  across Diverse Granularity
Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity
Shanghaoran Quan
244
6
0
26 May 2024
SED: Self-Evaluation Decoding Enhances Large Language Models for Better
  Generation
SED: Self-Evaluation Decoding Enhances Large Language Models for Better Generation
Ziqin Luo
Haixia Han
Haokun Zhao
Guochao Jiang
Chengyu Du
Tingyun Li
Jiaqing Liang
Deqing Yang
Yanghua Xiao
166
6
0
26 May 2024
ConStat: Performance-Based Contamination Detection in Large Language
  Models
ConStat: Performance-Based Contamination Detection in Large Language Models
Jasper Dekoninck
Mark Niklas Muller
Martin Vechev
167
17
0
25 May 2024
Streaming Long Video Understanding with Large Language Models
Streaming Long Video Understanding with Large Language Models
Rui Qian
Xiao-wen Dong
Pan Zhang
Yuhang Zang
Shuangrui Ding
Dahua Lin
Yuan Liu
VLM
257
113
0
25 May 2024
GECKO: Generative Language Model for English, Code and Korean
GECKO: Generative Language Model for English, Code and Korean
Sungwoo Oh
Donggyu Kim
VLM
181
0
0
24 May 2024
Continuously Learning, Adapting, and Improving: A Dual-Process Approach
  to Autonomous Driving
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous Driving
Jianbiao Mei
Yukai Ma
Xuemeng Yang
Licheng Wen
Xinyu Cai
...
Min Dou
Ding Wang
Xiaoling Wang
Yong-Jin Liu
Yu Qiao
196
25
0
24 May 2024
Linearly Controlled Language Generation with Performative Guarantees
Linearly Controlled Language Generation with Performative Guarantees
Emily Cheng
Marco Baroni
378
13
0
24 May 2024
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models
Jingcheng Deng
Zihao Wei
Liang Pang
Hanxing Ding
Huawei Shen
Xueqi Cheng
KELM
238
2
0
24 May 2024
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Hongyu Wang
Jiayu Xu
Senwei Xie
Ruiping Wang
Jialin Li
Zhaojie Xie
Bin Zhang
Chuyan Xiong
Xilin Chen
ELMVLMLRM
411
9
0
24 May 2024
Bayesian WeakS-to-Strong from Text Classification to Generation
Bayesian WeakS-to-Strong from Text Classification to Generation
Ziyun Cui
Ziyang Zhang
Wen Wu
Wen Wu
Chao Zhang
367
5
0
24 May 2024
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings
R. Reddy
Omar Attia
Yunyao Li
Heng Ji
Saloni Potdar
146
1
0
23 May 2024
Extracting Prompts by Inverting LLM Outputs
Extracting Prompts by Inverting LLM Outputs
Collin Zhang
John X. Morris
Vitaly Shmatikov
233
39
0
23 May 2024
Linking In-context Learning in Transformers to Human Episodic Memory
Linking In-context Learning in Transformers to Human Episodic Memory
Ji-An Li
Corey Y. Zhou
M. Benna
Marcelo G. Mattar
171
13
0
23 May 2024
AnalogCoder: Analog Circuit Design via Training-Free Code Generation
AnalogCoder: Analog Circuit Design via Training-Free Code GenerationAAAI Conference on Artificial Intelligence (AAAI), 2024
Yao Lai
Sungyoung Lee
Guojin Chen
Souradip Poddar
Mengkang Hu
Yao Lai
Ping Luo
335
79
0
23 May 2024
Base of RoPE Bounds Context Length
Base of RoPE Bounds Context LengthNeural Information Processing Systems (NeurIPS), 2024
Xin Men
Mingyu Xu
Bingning Wang
Qingyu Zhang
Hongyu Lin
Xianpei Han
Weipeng Chen
239
41
0
23 May 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training
  Small Data Synthesis Models
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis ModelsNeural Information Processing Systems (NeurIPS), 2024
Kun Zhou
Beichen Zhang
Jiapeng Wang
Zhipeng Chen
Wayne Xin Zhao
Jing Sha
Zhichao Sheng
Shijin Wang
Ji-Rong Wen
SyDaLRM
329
48
0
23 May 2024
Focus Anywhere for Fine-grained Multi-page Document Understanding
Focus Anywhere for Fine-grained Multi-page Document Understanding
Chenglong Liu
Haoran Wei
Jinyue Chen
Lingyu Kong
Zheng Ge
Zining Zhu
Liang Zhao
Jian‐Yuan Sun
Chunrui Han
Xiangyu Zhang
177
44
0
23 May 2024
Federated Domain-Specific Knowledge Transfer on Large Language Models
  Using Synthetic Data
Federated Domain-Specific Knowledge Transfer on Large Language Models Using Synthetic Data
Haoran Li
Xinyuan Zhao
Dadi Guo
Hanlin Gu
Huiping Zhuang
Yuxing Han
Yangqiu Song
Lixin Fan
Qiang Yang
195
4
0
23 May 2024
Super Tiny Language Models
Super Tiny Language Models
Dylan Hillier
Leon Guertler
Cheston Tan
Palaash Agrawal
Ruirui Chen
Bobby Cheng
294
10
0
23 May 2024
Unveiling the Tapestry of Consistency in Large Vision-Language Models
Unveiling the Tapestry of Consistency in Large Vision-Language ModelsNeural Information Processing Systems (NeurIPS), 2024
Yuan Zhang
Fei Xiao
Tao Huang
Chun-Kai Fan
Hongyuan Dong
Jiawen Li
Jiacong Wang
Kuan Cheng
Shanghang Zhang
Haoyuan Guo
341
21
0
23 May 2024
Getting More from Less: Large Language Models are Good Spontaneous
  Multilingual Learners
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners
Shimao Zhang
Changjiang Gao
Wenhao Zhu
Jiajun Chen
Xue Han
Xue Han
Junlan Feng
Chao Deng
Shujian Huang
246
13
0
22 May 2024
Dense Connector for MLLMs
Dense Connector for MLLMs
Huanjin Yao
Wenhao Wu
Taojiannan Yang
Yuxin Song
Mengxi Zhang
Haocheng Feng
Yifan Sun
Zhiheng Li
Wanli Ouyang
Jingdong Wang
MLLMVLM
224
39
0
22 May 2024
ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
Wei Zhang
Xianfu Cheng
Yi Zhang
Zhiqiang Wang
Hongcheng Guo
...
Xi Yin
Xiangyuan Guan
Xu Shi
Liangfan Zheng
Bo Zhang
238
9
0
22 May 2024
360Zhinao Technical Report
360Zhinao Technical Report
360Zhinao Team
221
0
0
22 May 2024
CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
CG-FedLLM: How to Compress Gradients in Federated Fune-tuning for Large Language Models
Huiwen Wu
Xiaohan Li
Deyi Zhang
Xiaohan Li
Yan Han
Puning Zhao
FedML
278
2
0
22 May 2024
MathBench: Evaluating the Theory and Application Proficiency of LLMs
  with a Hierarchical Mathematics Benchmark
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Hongwei Liu
Zilong Zheng
Yuxuan Qiao
Haodong Duan
Zhiwei Fei
Fengzhe Zhou
Wenwei Zhang
Songyang Zhang
Dahua Lin
Kai-xiang Chen
231
116
0
20 May 2024
Imp: Highly Capable Large Multimodal Models for Mobile Devices
Imp: Highly Capable Large Multimodal Models for Mobile Devices
Zhenwei Shao
Zhou Yu
Jun Yu
Xuecheng Ouyang
Lihao Zheng
Zhenbiao Gai
Mingyang Wang
Jiajun Ding
272
23
0
20 May 2024
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering
Jingqun Tang
Qi-dong Liu
Yongjie Ye
Jinghui Lu
Shubo Wei
...
Hao Liu
Xiang Bai
Can Huang
Xiang Bai
Can Huang
793
50
0
20 May 2024
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts
Minghao Wu
Jiahao Xu
Yulin Yuan
Gholamreza Haffari
Longyue Wang
Weihua Luo
Kaifu Zhang
LLMAG
600
43
0
20 May 2024
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Yunxin Li
Shenyuan Jiang
Baotian Hu
Longyue Wang
Wanqi Zhong
Tong Lu
Lin Ma
Min Zhang
MoE
238
100
0
18 May 2024
Previous
123...313233...363738
Next
Page 32 of 38
Pageof 38