ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.04652
  4. Cited By
Yi: Open Foundation Models by 01.AI

Yi: Open Foundation Models by 01.AI

7 March 2024
01. AI
Alex Young
01.AI Alex Young
Bei Chen
Chao Li
Chengen Huang
Guanwei Zhang
Heng Li
Jiangcheng Zhu
Jianqun Chen
Jing Chang
Kaidong Yu
Peng Liu
Qiang Liu
Shawn Yue
Senbin Yang
Shiming Yang
Tao Yu
Wen Xie
Wenhao Huang
Wenhao Huang
Xiaohui Hu
Xiaoyi Ren
Xinyao Niu
Pengcheng Nie
Yuchi Xu
Yudong Liu
Yue Wang
Yuxuan Cai
Zhenyu Gu
Zhiyuan Liu
Zonghong Dai
    OSLM
    LRM
ArXivPDFHTML

Papers citing "Yi: Open Foundation Models by 01.AI"

50 / 389 papers shown
Title
FinDVer: Explainable Claim Verification over Long and Hybrid-Content
  Financial Documents
FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Yilun Zhao
Yitao Long
Yuru Jiang
Chengye Wang
Weiyuan Chen
Hongjun Liu
Yiming Zhang
Xiangru Tang
Chen Zhao
Arman Cohan
VLM
23
1
0
08 Nov 2024
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
D. Song
Sicheng Lai
Shunian Chen
Lichao Sun
Benyou Wang
46
0
0
06 Nov 2024
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under
  Misleading Scenarios
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
Yunkai Dang
Mengxi Gao
Yibo Yan
Xin Zou
Yanggan Gu
Aiwei Liu
Xuming Hu
37
4
0
05 Nov 2024
A Multi-Task Role-Playing Agent Capable of Imitating Character
  Linguistic Styles
A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
Siyuan Chen
Q. Si
Chenxu Yang
Yunzhi Liang
Zheng-Shen Lin
Huan Liu
Weiping Wang
30
1
0
04 Nov 2024
What is Wrong with Perplexity for Long-context Language Modeling?
What is Wrong with Perplexity for Long-context Language Modeling?
Lizhe Fang
Yifei Wang
Zhaoyang Liu
Chenheng Zhang
Stefanie Jegelka
Jinyang Gao
Bolin Ding
Yisen Wang
41
4
0
31 Oct 2024
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression
  of Neural Networks
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks
Yongchang Hao
Yanshuai Cao
Lili Mou
MQ
28
2
0
28 Oct 2024
Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Vipul Gupta
Candace Ross
David Pantoja
R. Passonneau
Megan Ung
Adina Williams
34
1
0
26 Oct 2024
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained
  Visual Document Understanding
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding
Fengbin Zhu
Ziyang Liu
Xiang Yao Ng
Haohui Wu
W. Wang
Fuli Feng
Chao Wang
Huanbo Luan
Tat-Seng Chua
VLM
27
3
0
25 Oct 2024
ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
ChineseSafe: A Chinese Benchmark for Evaluating Safety in Large Language Models
H. Zhang
Hongfu Gao
Qiang Hu
Guanhua Chen
L. Yang
Bingyi Jing
Hongxin Wei
Bing Wang
Haifeng Bai
Lei Yang
AILaw
ELM
38
1
0
24 Oct 2024
CLR-Bench: Evaluating Large Language Models in College-level Reasoning
CLR-Bench: Evaluating Large Language Models in College-level Reasoning
Junnan Dong
Zijin Hong
Yuanchen Bei
Feiran Huang
Xinrun Wang
Xiao Huang
ELM
LRM
20
2
0
23 Oct 2024
A Comprehensive Evaluation of Cognitive Biases in LLMs
A Comprehensive Evaluation of Cognitive Biases in LLMs
Simon Malberg
Roman Poletukhin
Carolin M. Schuster
Georg Groh
ELM
27
5
0
20 Oct 2024
EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models
EPIC: Efficient Position-Independent Context Caching for Serving Large Language Models
Junhao Hu
Wenrui Huang
H. Wang
Weidong Wang
Tiancheng Hu
Qin Zhang
Hao Feng
Xusheng Chen
Yizhou Shan
Tao Xie
RALM
LLMAG
18
2
0
20 Oct 2024
SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human
  Annotated Dataset and Fine-Tuned Instruction Generation
SemiHVision: Enhancing Medical Multimodal Models with a Semi-Human Annotated Dataset and Fine-Tuned Instruction Generation
Junda Wang
Yujan Ting
Eric Z. Chen
Hieu Tran
Hong-ye Yu
Weijing Huang
Terrence Chen
VLM
LM&MA
25
1
0
19 Oct 2024
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems
Nan Xu
Xuezhe Ma
LRM
24
3
0
18 Oct 2024
Can MLLMs Understand the Deep Implication Behind Chinese Images?
Can MLLMs Understand the Deep Implication Behind Chinese Images?
Chenhao Zhang
Xi Feng
Yuelin Bai
Xinrun Du
Jinchang Hou
...
Min Yang
Wenhao Huang
Chenghua Lin
Ge Zhang
Shiwen Ni
ELM
VLM
23
3
0
17 Oct 2024
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model
Siwei Wu
Zhongyuan Peng
Xinrun Du
Tuney Zheng
Minghao Liu
...
Zhaoxiang Zhang
Wenhao Huang
Ge Zhang
Chenghua Lin
J. H. Liu
ELM
LLMAG
LRM
AI4CE
29
28
0
17 Oct 2024
Large Language Models as Narrative-Driven Recommenders
Large Language Models as Narrative-Driven Recommenders
Lukas Eberhard
Thorsten Ruprechter
Denis Helic
LRM
17
0
0
17 Oct 2024
When Does Perceptual Alignment Benefit Vision Representations?
When Does Perceptual Alignment Benefit Vision Representations?
Shobhita Sundaram
Stephanie Fu
Lukas Muttenthaler
Netanel Y. Tamir
Lucy Chai
Simon Kornblith
Trevor Darrell
Phillip Isola
41
6
1
14 Oct 2024
BlackDAN: A Black-Box Multi-Objective Approach for Effective and
  Contextual Jailbreaking of Large Language Models
BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models
Xinyuan Wang
Victor Shea-Jay Huang
Renmiao Chen
Hao Wang
C. Pan
Lei Sha
Minlie Huang
AAML
18
2
0
13 Oct 2024
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback
Y. Li
Miao Zheng
Fan Yang
Guosheng Dong
Bin Cui
Weipeng Chen
Zenan Zhou
Wentao Zhang
ALM
21
5
0
12 Oct 2024
PoisonBench: Assessing Large Language Model Vulnerability to Data
  Poisoning
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Tingchen Fu
Mrinank Sharma
Philip H. S. Torr
Shay B. Cohen
David M. Krueger
Fazl Barez
AAML
29
0
0
11 Oct 2024
Dynamic Multimodal Evaluation with Flexible Complexity by
  Vision-Language Bootstrapping
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Yue Yang
S. Zhang
Wenqi Shao
Kaipeng Zhang
Yi Bin
Yu Wang
Ping Luo
23
0
0
11 Oct 2024
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles
Buu Phan
Brandon Amos
Itai Gat
Marton Havasi
Matthew Muckley
Karen Ullrich
37
1
0
11 Oct 2024
Packing Analysis: Packing Is More Appropriate for Large Models or
  Datasets in Supervised Fine-tuning
Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
Shuhe Wang
Guoyin Wang
Y. Wang
Jiwei Li
Eduard H. Hovy
Chen Guo
24
1
0
10 Oct 2024
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
Philipp Guldimann
Alexander Spiridonov
Robin Staab
Nikola Jovanović
Mark Vero
...
Mislav Balunović
Nikola Konstantinov
Pavol Bielik
Petar Tsankov
Martin Vechev
ELM
35
4
0
10 Oct 2024
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit
  Positional Awareness
PositionID: LLMs can Control Lengths, Copy and Paste with Explicit Positional Awareness
Zekun Wang
Feiyu Duan
Yibo Zhang
Wangchunshu Zhou
Ke Xu
Wenhao Huang
Jie Fu
LLMAG
21
1
0
09 Oct 2024
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration
Heming Xia
Yongqi Li
Jun Zhang
Cunxiao Du
Wenjie Li
LRM
36
4
0
09 Oct 2024
CursorCore: Assist Programming through Aligning Anything
CursorCore: Assist Programming through Aligning Anything
Hao Jiang
Qi Liu
Rui Li
Shengyu Ye
Shijin Wang
34
1
0
09 Oct 2024
Copiloting Diagnosis of Autism in Real Clinical Scenarios via LLMs
Copiloting Diagnosis of Autism in Real Clinical Scenarios via LLMs
Yi Jiang
Qingyang Shen
Shuzhong Lai
Shunyu Qi
Qian Zheng
Lin Yao
Yueming Wang
Gang Pan
18
1
0
08 Oct 2024
Deeper Insights Without Updates: The Power of In-Context Learning Over
  Fine-Tuning
Deeper Insights Without Updates: The Power of In-Context Learning Over Fine-Tuning
Qingyu Yin
Xuzheng He
Luoao Deng
Chak Tou Leong
Fan Wang
Yanzhao Yan
Xiaoyu Shen
Qiang Zhang
19
2
0
07 Oct 2024
Plausibly Problematic Questions in Multiple-Choice Benchmarks for
  Commonsense Reasoning
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
Shramay Palta
Nishant Balepur
Peter Rankel
Sarah Wiegreffe
Marine Carpuat
Rachel Rudinger
ELM
18
1
0
06 Oct 2024
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal
  Large Language Models Via Error Detection
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Hang Li
B. Li
...
Kun Wang
Hui Xiong
Philip S. Yu
Xuming Hu
Qingsong Wen
LRM
25
13
0
06 Oct 2024
Towards a Benchmark for Large Language Models for Business Process
  Management Tasks
Towards a Benchmark for Large Language Models for Business Process Management Tasks
Kiran Busch
Henrik Leopold
38
0
0
04 Oct 2024
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better
  Unsupervised Visual Representations
SynCo: Synthetic Hard Negatives in Contrastive Learning for Better Unsupervised Visual Representations
Nikolaos Giakoumoglou
Tania Stathaki
SSL
33
0
0
03 Oct 2024
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Howard Yen
Tianyu Gao
Minmin Hou
Ke Ding
Daniel Fleischer
Peter Izsak
Moshe Wasserblat
Danqi Chen
ALM
ELM
38
24
0
03 Oct 2024
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling
Determine-Then-Ensemble: Necessity of Top-k Union for Large Language Model Ensembling
Yuxuan Yao
Han Wu
Mingyang Liu
Sichun Luo
Xiongwei Han
Jie Liu
Zhijiang Guo
Linqi Song
37
4
0
03 Oct 2024
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with
  LLM-Guided Knowledge
From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge
Xiefeng Wu
OffRL
19
1
0
02 Oct 2024
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
Codev-Bench: How Do LLMs Understand Developer-Centric Code Completion?
Zhenyu Pan
Rongyu Cao
Yongchang Cao
Yingwei Ma
Binhua Li
Fei Huang
Han Liu
Yongbin Li
32
4
0
02 Oct 2024
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models
Zhenyue Qin
Yu Yin
Dylan Campbell
Xuansheng Wu
Ke Zou
Yih-Chung Tham
Ninghao Liu
Xiuzhen Zhang
Qingyu Chen
36
1
0
02 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu
Pei-Yu Lo
ReLM
LRM
38
2
0
02 Oct 2024
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs
Hong Li
Nanxi Li
Yuanjie Chen
Jianbin Zhu
Qinlu Guo
Cewu Lu
Yong-Lu Li
MLLM
34
1
0
02 Oct 2024
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Zonghang Li
Wenjiao Feng
Mohsen Guizani
Hongfang Yu
26
2
0
01 Oct 2024
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV
  Cache Management
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management
Yi Xiong
Hao Wu
Changxu Shao
Ziqing Wang
Rui Zhang
Yuhong Guo
Junping Zhao
Ke Zhang
Zhenxuan Pan
19
1
0
01 Oct 2024
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report
  Generation on CheXpert Plus Dataset
CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset
Xiao Wang
Fuling Wang
Yuehang Li
Qingchuan Ma
Shiao Wang
Bo Jiang
Chuanfu Li
Jin Tang
24
2
0
01 Oct 2024
From Seconds to Hours: Reviewing MultiModal Large Language Models on
  Comprehensive Long Video Understanding
From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding
Heqing Zou
Tianze Luo
Guiyang Xie
Victor
Zhang
...
Guangcong Wang
Juanyang Chen
Zhuochen Wang
Hansheng Zhang
Huaijian Zhang
VLM
21
6
0
27 Sep 2024
MIO: A Foundation Model on Multimodal Tokens
MIO: A Foundation Model on Multimodal Tokens
Zekun Wang
King Zhu
Chunpu Xu
Wangchunshu Zhou
Jiaheng Liu
...
Yuanxing Zhang
Ge Zhang
Ke Xu
Jie Fu
Wenhao Huang
MLLM
AuLLM
35
11
0
26 Sep 2024
HelloBench: Evaluating Long Text Generation Capabilities of Large
  Language Models
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Haoran Que
Feiyu Duan
Liqun He
Yutao Mou
Wangchunshu Zhou
...
Ge Zhang
Junran Peng
Zhaoxiang Zhang
Songyang Zhang
Kai Chen
LM&MA
ELM
VLM
43
11
0
24 Sep 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining
  for Clinical LLMs
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Clément Christophe
Tathagata Raha
Svetlana Maslenkova
Muhammad Umar Salman
Praveen K Kanithi
Marco AF Pimentel
Shadab Khan
LM&MA
20
1
0
23 Sep 2024
Phantom of Latent for Large Language and Vision Models
Phantom of Latent for Large Language and Vision Models
Byung-Kwan Lee
Sangyun Chung
Chae Won Kim
Beomchan Park
Yong Man Ro
VLM
LRM
34
3
0
23 Sep 2024
Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval
Pareto-Optimized Open-Source LLMs for Healthcare via Context Retrieval
Jordi Bayarri-Planas
Ashwin Kumar Gururajan
Dario Garcia-Gasulla
19
3
0
23 Sep 2024
Previous
12345678
Next