Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.05100
Cited By
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model
9 November 2022
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
Ellie Pavlick
Suzana Ilić
Daniel Hesslow
Roman Castagné
A. Luccioni
François Yvon
Matthias Gallé
J. Tow
Alexander M. Rush
Stella Biderman
Albert Webson
Pawan Sasanka Ammanamanchi
Thomas Wang
Benoît Sagot
Niklas Muennighoff
Albert Villanova del Moral
Olatunji Ruwase
Rachel Bawden
Stas Bekman
Angelina McMillan-Major
Iz Beltagy
Huu Nguyen
Lucile Saulnier
Samson Tan
Pedro Ortiz Suarez
Victor Sanh
Hugo Laurenccon
Yacine Jernite
Julien Launay
Margaret Mitchell
Colin Raffel
Aaron Gokaslan
Adi Simhi
Aitor Soroa Etxabe
Alham Fikri Aji
Amit Alfassy
Anna Rogers
Ariel Kreisberg Nitzav
Canwen Xu
Chenghao Mou
Chris C. Emezue
Christopher Klamm
Colin Leong
Daniel Alexander van Strien
David Ifeoluwa Adelani
Dragomir R. Radev
E. G. Ponferrada
Efrat Levkovizh
Ethan Kim
Eyal Natan
F. Toni
Gérard Dupont
Germán Kruszewski
Giada Pistilli
Hady ElSahar
Hamza Benyamina
H. Tran
Ian Yu
Idris Abdulmumin
Isaac Johnson
Itziar Gonzalez-Dios
Javier de la Rosa
Jenny Chim
Jesse Dodge
Jian Zhu
Jonathan Chang
Jorg Frohberg
Josephine Tobing
J. Bhattacharjee
Khalid Almubarak
Kimbo Chen
Kyle Lo
Leandro von Werra
Leon Weber
Long Phan
Loubna Ben Allal
Ludovic Tanguy
Manan Dey
M. Muñoz
Maraim Masoud
María Grandury
Mario vSavsko
Max Huang
Maximin Coavoux
Mayank Singh
Mike Tian-Jian Jiang
Minh Chien Vu
M. A. Jauhar
Mustafa Ghaleb
Nishant Subramani
Nora Kassner
Nurulaqilla Khamis
Olivier Nguyen
Omar Espejel
Ona de Gibert
Paulo Villegas
Peter Henderson
Pierre Colombo
Priscilla Amuok
Quentin Lhoest
Rheza Harliman
Rishi Bommasani
R. López
Rui Ribeiro
Salomey Osei
S. Pyysalo
Sebastian Nagel
Shamik Bose
Shamsuddeen Hassan Muhammad
Shanya Sharma
Shayne Longpre
Somaieh Nikpoor
S. Silberberg
S. Pai
S. Zink
Tiago Timponi Torrent
Timo Schick
Tristan Thrush
V. Danchev
Vassilina Nikoulina
Veronika Laippala
Violette Lepercq
V. Prabhu
Zaid Alyafeai
Zeerak Talat
Arun Raja
Benjamin Heinzerling
Chenglei Si
Davut Emre Taşar
Elizabeth Salesky
Sabrina J. Mielke
Wilson Y. Lee
Abheesht Sharma
Andrea Santilli
Antoine Chaffin
Arnaud Stiegler
Debajyoti Datta
Eliza Szczechla
Gunjan Chhablani
Han Wang
Harshit Pandey
Hendrik Strobelt
Jason Alan Fries
Jos Rozen
Leo Gao
Lintang Sutawika
M Saiful Bari
Maged S. Al-Shaibani
Matteo Manica
Nihal V. Nayak
Ryan Teehan
Samuel Albanie
Sheng Shen
Srulik Ben-David
Stephen H. Bach
Taewoon Kim
T. Bers
Thibault Févry
Trishala Neeraj
Urmish Thakker
Vikas Raunak
Xiang Tang
Zheng-Xin Yong
Zhiqing Sun
Shaked Brody
Y. Uri
Hadar Tojarieh
Adam Roberts
Hyung Won Chung
Jaesung Tae
Jason Phang
Ofir Press
Conglong Li
Deepak Narayanan
Hatim Bourfoune
Jared Casper
Jeff Rasley
Max Ryabinin
Mayank Mishra
Minjia Zhang
M. Shoeybi
Myriam Peyrounette
N. Patry
Nouamane Tazi
Omar Sanseviero
Patrick von Platen
Pierre Cornette
Pierre Franccois Lavallée
Rémi Lacroix
Samyam Rajbhandari
Sanchit Gandhi
Shaden Smith
S. Requena
Suraj Patil
Tim Dettmers
Ahmed Baruwa
Amanpreet Singh
Anastasia Cheveleva
Anne-Laure Ligozat
Arjun Subramonian
Aurélie Névéol
Charles Lovering
Daniel H Garrette
D. Tunuguntla
Ehud Reiter
Ekaterina Taktasheva
E. Voloshina
Eli Bogdanov
Genta Indra Winata
Hailey Schoelkopf
Jan-Christoph Kalo
Jekaterina Novikova
Jessica Zosa Forde
Zdenvek Kasner
Jungo Kasai
Ken Kawamura
Liam Hazan
Marine Carpuat
Miruna Clinciu
Najoung Kim
Newton Cheng
O. Serikov
Omer Antverg
Oskar van der Wal
Rui Zhang
Ruochen Zhang
Sebastian Gehrmann
Shachar Mirkin
S. Pais
Tatiana Shavrina
Thomas Scialom
Tian Yun
Tomasz Limisiewicz
Verena Rieser
Vitaly Protasov
Vladislav Mikhailov
Yada Pruksachatkun
Yonatan Belinkov
Zachary Bamberger
Zdeněk Kasner
Xiangru Tang
A. Pestana
A. Feizpour
Ammar Khan
Amy Faranak
A. Santos
Anthony Hevia
Antigona Unldreaj
Arash Aghagol
Arezoo Abdollahi
A. Tammour
A. HajiHosseini
Bahareh Behroozi
Benjamin Ayoade Ajibade
B. Saxena
Carlos Muñoz Ferrandis
Daniel McDuff
Danish Contractor
D. Lansky
Davis David
Douwe Kiela
D. A. Nguyen
Edward Tan
Emi Baylor
Ezinwanne Ozoani
F. Mirza
Frankline Ononiwu
Habib Rezanejad
H.A. Jones
Indrani Bhattacharya
Irene Solaiman
Irina Sedenko
I. Nejadgholi
J. Passmore
Joshua Seltzer
Julio Bonis Sanz
Lívia Dutra
Mairon Samagaio
Maraim Elbadri
Margot Mieskes
Marissa Gerchick
Martha Akinlolu
Michael McKenna
Mike Qiu
M. Ghauri
Mykola Burynok
Nafis Abrar
Nazneen Rajani
Nour Elkott
N. Fahmy
Olanrewaju Samuel
Ran An
R. Kromann
Ryan Hao
S. Alizadeh
Sarmad Shubber
Silas L. Wang
Sourav Roy
S. Viguier
Thanh-Cong Le
Tobi Oyebade
T. Le
Yoyo Yang
Zach Nguyen
Abhinav Ramesh Kashyap
Alfredo Palasciano
A. Callahan
Anima Shukla
Antonio Miranda-Escalada
A. Singh
Benjamin Beilharz
Bo Wang
C. Brito
Chenxi Zhou
Chirag Jain
Chuxin Xu
Clémentine Fourrier
Daniel León Perinán
Daniel Molano
Dian Yu
Enrique Manjavacas
Fabio Barth
Florian Fuhrimann
Gabriel Altay
Giyaseddin Bayrak
Gully Burns
Helena U. Vrabec
I. Bello
Isha Dash
J. Kang
John Giorgi
Jonas Golde
J. Posada
Karthi Sivaraman
Lokesh Bulchandani
Lu Liu
Luisa Shinzato
Madeleine Hahn de Bykhovetz
Maiko Takeuchi
Marc Pàmies
M. A. Castillo
Marianna Nezhurina
Mario Sanger
Matthias Samwald
Michael Cullan
Michael Weinberg
M. Wolf
Mina Mihaljcic
Minna Liu
M. Freidank
Myungsun Kang
Natasha Seelam
N. Dahlberg
N. Broad
N. Muellner
Pascale Fung
Patrick Haller
Patricia Haller
R. Eisenberg
Robert Martin
Rodrigo Canalli
Rosaline Su
Ruisi Su
Samuel Cahyawijaya
Samuele Garda
Shlok S Deshmukh
Shubhanshu Mishra
Sid Kiblawi
Simon Ott
Sinee Sang-aroonsiri
Srishti Kumar
Stefan Schweter
S. Bharati
Tanmay Laud
Théo Gigant
Tomoya Kainuma
Wojciech Kusa
Yanis Labrak
Yashasvi Bajaj
Y. Venkatraman
Yifan Xu
Ying Xu
Yu Xu
Z. Tan
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"
50 / 1,621 papers shown
Title
MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering
Vinay K. Verma
S. Kulkarni
Happy Mittal
D. Gupta
MoE
53
0
0
08 Mar 2025
Similarity-Based Domain Adaptation with LLMs
Jie He
Wendi Zhou
Xiang Li
Jeff Z. Pan
44
0
0
07 Mar 2025
Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
Grace Proebsting
Adam Poliak
45
0
0
06 Mar 2025
HeTGB: A Comprehensive Benchmark for Heterophilic Text-Attributed Graphs
Shujie Li
Yuxia Wu
Chuan Shi
Yuan Fang
39
0
0
05 Mar 2025
Zero-Shot Multi-Label Classification of Bangla Documents: Large Decoders Vs. Classic Encoders
Souvika Sarkar
M. Hasan
S. Karmaker
39
0
0
04 Mar 2025
In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models
David Ponce
Thierry Etchegoyhen
65
1
0
03 Mar 2025
Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning
Wenshuai Huo
Xiaocheng Feng
Yichong Huang
Chengpeng Fu
Baohang Li
...
Dandan Tu
Duyu Tang
Yunfei Lu
Hui Wang
Bing Qin
55
0
0
03 Mar 2025
Set-Theoretic Compositionality of Sentence Embeddings
Naman Bansal
Yash Mahajan
Sanjeev Kumar Sinha
S. Karmaker
CoGe
67
0
0
28 Feb 2025
AoECR: AI-ization of Elderly Care Robot
Linkun Zhou
Jian Li
Yadong Mo
X. Zhang
Ying Zhang
Shimin Wei
39
0
0
27 Feb 2025
FOReCAst: The Future Outcome Reasoning and Confidence Assessment Benchmark
Zhangdie Yuan
Zifeng Ding
Andreas Vlachos
AI4TS
77
0
0
27 Feb 2025
SCA3D: Enhancing Cross-modal 3D Retrieval via 3D Shape and Caption Paired Data Augmentation
Junlong Ren
Hao Wu
Hui Xiong
H. Wang
63
0
0
26 Feb 2025
Grandes modelos de lenguaje: de la predicción de palabras a la comprensión?
Carlos Gómez-Rodríguez
SyDa
AILaw
ELM
VLM
94
0
0
25 Feb 2025
Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference
Zhuo Chen
Xinyu Wang
Yong-feng Jiang
Zhen Zhang
Xinyu Geng
Pengjun Xie
Fei Huang
Kewei Tu
98
0
0
25 Feb 2025
Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models
Anirudh Sundar
Sinead Williamson
Katherine Metcalf
B. Theobald
Skyler Seto
Masha Fedzechkina
LLMSV
77
0
0
24 Feb 2025
TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking
Shahriar Kabir Nahin
R. N. Nandi
Sagor Sarker
Quazi Sarwar Muhtaseem
Md. Kowsher
Apu Chandraw Shill
Md Ibrahim
Mehadi Hasan Menon
Tareq Al Muntasir
Firoj Alam
66
0
0
24 Feb 2025
Tokenization is Sensitive to Language Variation
Anna Wegmann
Dong Nguyen
David Jurgens
75
1
0
24 Feb 2025
Stealing Training Data from Large Language Models in Decentralized Training through Activation Inversion Attack
Chenxi Dai
Lin Lu
Pan Zhou
42
0
0
22 Feb 2025
Understanding the Emergence of Multimodal Representation Alignment
Megan Tjandrasuwita
Chanakya Ekbote
Liu Ziyin
Paul Pu Liang
45
1
0
22 Feb 2025
MedForge: Building Medical Foundation Models Like Open Source Software Development
Zheling Tan
Kexin Ding
Jin Gao
Mu Zhou
Dimitris N. Metaxas
Shaoting Zhang
Dequan Wang
AI4CE
45
1
0
22 Feb 2025
CVE-LLM : Ontology-Assisted Automatic Vulnerability Evaluation Using Large Language Models
Rikhiya Ghosh
H. V. Stockhausen
Martin Schmitt
George Marica Vasile
Sanjeev Kumar Karn
Oladimeji Farri
30
1
0
21 Feb 2025
Comprehensive Analysis of Transparency and Accessibility of ChatGPT, DeepSeek, And other SoTA Large Language Models
Ranjan Sapkota
Shaina Raza
Manoj Karkee
40
4
0
21 Feb 2025
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
L. Arras
Bruno Puri
Patrick Kahardipraja
Sebastian Lapuschkin
Wojciech Samek
35
0
0
21 Feb 2025
Multilingual Language Model Pretraining using Machine-translated Data
Jiayi Wang
Yao Lu
Maurice Weber
Max Ryabinin
David Ifeoluwa Adelani
Yihong Chen
Raphael Tang
Pontus Stenetorp
LRM
73
2
0
20 Feb 2025
Talking About the Assumption in the Room
Ramaravind Kommiya Mothilal
Faisal M. Lalani
Syed Ishtiaque Ahmed
Shion Guha
Sharifa Sultana
56
0
0
20 Feb 2025
Multilingual Non-Factoid Question Answering with Answer Paragraph Selection
Ritwik Mishra
Sreeram Vennam
R. Shah
Ponnurangam Kumaraguru
93
0
0
20 Feb 2025
EvoP: Robust LLM Inference via Evolutionary Pruning
Shangyu Wu
Hongchao Du
Ying Xiong
Shuai Chen
Tei-Wei Kuo
Nan Guan
Chun Jason Xue
32
1
0
19 Feb 2025
Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search
Yifan Ji
Zhipeng Xu
Zhenghao Liu
Yukun Yan
S. Yu
Y. Li
Zhiyuan Liu
Yu Gu
Ge Yu
Maosong Sun
RALM
56
0
0
18 Feb 2025
Factual Inconsistency in Data-to-Text Generation Scales Exponentially with LLM Size: A Statistical Validation
Joy Mahapatra
Soumyajit Roy
Utpal Garain
HILM
ALM
75
0
0
17 Feb 2025
User Profile with Large Language Models: Construction, Updating, and Benchmarking
Nusrat Jahan Prottasha
Md. Kowsher
Hafijur Raman
Israt Jahan Anny
Prakash Bhat
Ivan Garibay
O. Garibay
34
0
0
15 Feb 2025
Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages
Shreyan Biswas
Alexander Erlei
U. Gadiraju
103
4
0
13 Feb 2025
LLM-Enhanced Multiple Instance Learning for Joint Rumor and Stance Detection with Social Context Information
Ruichao Yang
Jing Ma
Wei Gao
Hongzhan Lin
63
0
0
13 Feb 2025
Matina: A Large-Scale 73B Token Persian Text Corpus
Sara Bourbour Hosseinbeigi
Fatemeh Taherinezhad
Heshaam Faili
Hamed Baghbani
Fatemeh Nadi
Mostafa Amiri
71
0
0
13 Feb 2025
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models
Xingrun Xing
Zheng Liu
Shitao Xiao
Boyan Gao
Yiming Liang
Wanpeng Zhang
Haokun Lin
Guoqi Li
Jiajun Zhang
LRM
56
1
0
10 Feb 2025
Automated Consistency Analysis of LLMs
Aditya Patwardhan
Vivek Vaidya
Ashish Kundu
50
0
0
10 Feb 2025
Self-Training Large Language Models for Tool-Use Without Demonstrations
Ne Luo
Aryo Pradipta Gema
Xuanli He
Emile van Krieken
Pietro Lesci
Pasquale Minervini
LLMAG
64
1
0
09 Feb 2025
Evaluating Vision-Language Models for Emotion Recognition
Sree Bhattacharyya
James Z. Wang
VLM
53
0
0
08 Feb 2025
Enhancing Knowledge Graph Construction: Evaluating with Emphasis on Hallucination, Omission, and Graph Similarity Metrics
Hussam Ghanem
C. Cruz
56
0
0
07 Feb 2025
FinBloom: Knowledge Grounding Large Language Model with Real-time Financial Data
Ankur Sinha
Chaitanya Agarwal
P. Malo
AIFin
42
0
0
04 Feb 2025
LAST SToP For Modeling Asynchronous Time Series
Shubham Gupta
Thibaut Durand
Graham Taylor
Lilian W. Białokozowicz
AI4TS
42
0
0
04 Feb 2025
CE-LoRA: Computation-Efficient LoRA Fine-Tuning for Language Models
Guanduo Chen
Yutong He
Yipeng Hu
Kun Yuan
Binhang Yuan
41
0
0
03 Feb 2025
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
Kritarth Prasad
Mohammadi Zaki
Pratik Rakesh Singh
Pankaj Wasnik
31
0
0
28 Jan 2025
Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models
Jingwei Yi
Yueqi Xie
Bin Zhu
Emre Kiciman
Guangzhong Sun
Xing Xie
Fangzhao Wu
AAML
51
64
0
28 Jan 2025
Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy
Tunazzina Islam
Dan Goldwasser
79
2
0
28 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
37
7
0
23 Jan 2025
LLMs as Repositories of Factual Knowledge: Limitations and Solutions
Seyed Mahed Mousavi
Simone Alghisi
Giuseppe Riccardi
KELM
47
0
0
22 Jan 2025
NExtLong: Toward Effective Long-Context Training without Long Documents
Chaochen Gao
Xing Wu
Zijia Lin
Debing Zhang
Songlin Hu
SyDa
64
1
0
22 Jan 2025
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science
Kaiyuan Tian
Linbo Qiao
Baihui Liu
Gongqingjian Jiang
Dongsheng Li
31
0
0
21 Jan 2025
A Comprehensive Survey of Foundation Models in Medicine
Wasif Khan
Seowung Leem
Kyle B. See
Joshua K. Wong
Shaoting Zhang
R. Fang
AI4CE
LM&MA
VLM
95
17
0
17 Jan 2025
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs
Hrithik Majumdar Shibu
Shrestha Datta
Md. Sumon Miah
Nasrullah Sami
Mahruba Sharmin Chowdhury
Md. Saiful Islam
59
0
0
17 Jan 2025
A Simple Graph Contrastive Learning Framework for Short Text Classification
Yonghao Liu
Fausto Giunchiglia
Lan Huang
Ximing Li
Xiaoyue Feng
Renchu Guan
32
0
0
17 Jan 2025
Previous
1
2
3
4
5
...
31
32
33
Next