ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.05100
  4. Cited By
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

9 November 2022
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
Ellie Pavlick
Suzana Ilić
Daniel Hesslow
Roman Castagné
A. Luccioni
François Yvon
Matthias Gallé
J. Tow
Alexander M. Rush
Stella Biderman
Albert Webson
Pawan Sasanka Ammanamanchi
Thomas Wang
Benoît Sagot
Niklas Muennighoff
Albert Villanova del Moral
Olatunji Ruwase
Rachel Bawden
Stas Bekman
Angelina McMillan-Major
Iz Beltagy
Huu Nguyen
Lucile Saulnier
Samson Tan
Pedro Ortiz Suarez
Victor Sanh
Hugo Laurenccon
Yacine Jernite
Julien Launay
Margaret Mitchell
Colin Raffel
Aaron Gokaslan
Adi Simhi
Aitor Soroa Etxabe
Alham Fikri Aji
Amit Alfassy
Anna Rogers
Ariel Kreisberg Nitzav
Canwen Xu
Chenghao Mou
Chris C. Emezue
Christopher Klamm
Colin Leong
Daniel Alexander van Strien
David Ifeoluwa Adelani
Dragomir R. Radev
E. G. Ponferrada
Efrat Levkovizh
Ethan Kim
Eyal Natan
F. Toni
Gérard Dupont
Germán Kruszewski
Giada Pistilli
Hady ElSahar
Hamza Benyamina
H. Tran
Ian Yu
Idris Abdulmumin
Isaac Johnson
Itziar Gonzalez-Dios
Javier de la Rosa
Jenny Chim
Jesse Dodge
Jian Zhu
Jonathan Chang
Jorg Frohberg
Josephine Tobing
J. Bhattacharjee
Khalid Almubarak
Kimbo Chen
Kyle Lo
Leandro von Werra
Leon Weber
Long Phan
Loubna Ben Allal
Ludovic Tanguy
Manan Dey
M. Muñoz
Maraim Masoud
María Grandury
Mario vSavsko
Max Huang
Maximin Coavoux
Mayank Singh
Mike Tian-Jian Jiang
Minh Chien Vu
M. A. Jauhar
Mustafa Ghaleb
Nishant Subramani
Nora Kassner
Nurulaqilla Khamis
Olivier Nguyen
Omar Espejel
Ona de Gibert
Paulo Villegas
Peter Henderson
Pierre Colombo
Priscilla Amuok
Quentin Lhoest
Rheza Harliman
Rishi Bommasani
R. López
Rui Ribeiro
Salomey Osei
S. Pyysalo
Sebastian Nagel
Shamik Bose
Shamsuddeen Hassan Muhammad
Shanya Sharma
Shayne Longpre
Somaieh Nikpoor
S. Silberberg
S. Pai
S. Zink
Tiago Timponi Torrent
Timo Schick
Tristan Thrush
V. Danchev
Vassilina Nikoulina
Veronika Laippala
Violette Lepercq
V. Prabhu
Zaid Alyafeai
Zeerak Talat
Arun Raja
Benjamin Heinzerling
Chenglei Si
Davut Emre Taşar
Elizabeth Salesky
Sabrina J. Mielke
Wilson Y. Lee
Abheesht Sharma
Andrea Santilli
Antoine Chaffin
Arnaud Stiegler
Debajyoti Datta
Eliza Szczechla
Gunjan Chhablani
Han Wang
Harshit Pandey
Hendrik Strobelt
Jason Alan Fries
Jos Rozen
Leo Gao
Lintang Sutawika
M Saiful Bari
Maged S. Al-Shaibani
Matteo Manica
Nihal V. Nayak
Ryan Teehan
Samuel Albanie
Sheng Shen
Srulik Ben-David
Stephen H. Bach
Taewoon Kim
T. Bers
Thibault Févry
Trishala Neeraj
Urmish Thakker
Vikas Raunak
Xiang Tang
Zheng-Xin Yong
Zhiqing Sun
Shaked Brody
Y. Uri
Hadar Tojarieh
Adam Roberts
Hyung Won Chung
Jaesung Tae
Jason Phang
Ofir Press
Conglong Li
Deepak Narayanan
Hatim Bourfoune
Jared Casper
Jeff Rasley
Max Ryabinin
Mayank Mishra
Minjia Zhang
M. Shoeybi
Myriam Peyrounette
N. Patry
Nouamane Tazi
Omar Sanseviero
Patrick von Platen
Pierre Cornette
Pierre Franccois Lavallée
Rémi Lacroix
Samyam Rajbhandari
Sanchit Gandhi
Shaden Smith
S. Requena
Suraj Patil
Tim Dettmers
Ahmed Baruwa
Amanpreet Singh
Anastasia Cheveleva
Anne-Laure Ligozat
Arjun Subramonian
Aurélie Névéol
Charles Lovering
Daniel H Garrette
D. Tunuguntla
Ehud Reiter
Ekaterina Taktasheva
E. Voloshina
Eli Bogdanov
Genta Indra Winata
Hailey Schoelkopf
Jan-Christoph Kalo
Jekaterina Novikova
Jessica Zosa Forde
Zdenvek Kasner
Jungo Kasai
Ken Kawamura
Liam Hazan
Marine Carpuat
Miruna Clinciu
Najoung Kim
Newton Cheng
O. Serikov
Omer Antverg
Oskar van der Wal
Rui Zhang
Ruochen Zhang
Sebastian Gehrmann
Shachar Mirkin
S. Pais
Tatiana Shavrina
Thomas Scialom
Tian Yun
Tomasz Limisiewicz
Verena Rieser
Vitaly Protasov
Vladislav Mikhailov
Yada Pruksachatkun
Yonatan Belinkov
Zachary Bamberger
Zdeněk Kasner
Xiangru Tang
A. Pestana
A. Feizpour
Ammar Khan
Amy Faranak
A. Santos
Anthony Hevia
Antigona Unldreaj
Arash Aghagol
Arezoo Abdollahi
A. Tammour
A. HajiHosseini
Bahareh Behroozi
Benjamin Ayoade Ajibade
B. Saxena
Carlos Muñoz Ferrandis
Daniel McDuff
Danish Contractor
D. Lansky
Davis David
Douwe Kiela
D. A. Nguyen
Edward Tan
Emi Baylor
Ezinwanne Ozoani
F. Mirza
Frankline Ononiwu
Habib Rezanejad
H.A. Jones
Indrani Bhattacharya
Irene Solaiman
Irina Sedenko
I. Nejadgholi
J. Passmore
Joshua Seltzer
Julio Bonis Sanz
Lívia Dutra
Mairon Samagaio
Maraim Elbadri
Margot Mieskes
Marissa Gerchick
Martha Akinlolu
Michael McKenna
Mike Qiu
M. Ghauri
Mykola Burynok
Nafis Abrar
Nazneen Rajani
Nour Elkott
N. Fahmy
Olanrewaju Samuel
Ran An
R. Kromann
Ryan Hao
S. Alizadeh
Sarmad Shubber
Silas L. Wang
Sourav Roy
S. Viguier
Thanh-Cong Le
Tobi Oyebade
T. Le
Yoyo Yang
Zach Nguyen
Abhinav Ramesh Kashyap
Alfredo Palasciano
A. Callahan
Anima Shukla
Antonio Miranda-Escalada
A. Singh
Benjamin Beilharz
Bo Wang
C. Brito
Chenxi Zhou
Chirag Jain
Chuxin Xu
Clémentine Fourrier
Daniel León Perinán
Daniel Molano
Dian Yu
Enrique Manjavacas
Fabio Barth
Florian Fuhrimann
Gabriel Altay
Giyaseddin Bayrak
Gully Burns
Helena U. Vrabec
I. Bello
Isha Dash
J. Kang
John Giorgi
Jonas Golde
J. Posada
Karthi Sivaraman
Lokesh Bulchandani
Lu Liu
Luisa Shinzato
Madeleine Hahn de Bykhovetz
Maiko Takeuchi
Marc Pàmies
M. A. Castillo
Marianna Nezhurina
Mario Sanger
Matthias Samwald
Michael Cullan
Michael Weinberg
M. Wolf
Mina Mihaljcic
Minna Liu
M. Freidank
Myungsun Kang
Natasha Seelam
N. Dahlberg
N. Broad
N. Muellner
Pascale Fung
Patrick Haller
Patricia Haller
R. Eisenberg
Robert Martin
Rodrigo Canalli
Rosaline Su
Ruisi Su
Samuel Cahyawijaya
Samuele Garda
Shlok S Deshmukh
Shubhanshu Mishra
Sid Kiblawi
Simon Ott
Sinee Sang-aroonsiri
Srishti Kumar
Stefan Schweter
S. Bharati
Tanmay Laud
Théo Gigant
Tomoya Kainuma
Wojciech Kusa
Yanis Labrak
Yashasvi Bajaj
Y. Venkatraman
Yifan Xu
Ying Xu
Yu Xu
Z. Tan
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
    VLM
ArXivPDFHTML

Papers citing "BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"

50 / 1,621 papers shown
Title
SantaCoder: don't reach for the stars!
SantaCoder: don't reach for the stars!
Loubna Ben Allal
Raymond Li
Denis Kocetkov
Chenghao Mou
Christopher Akiki
...
Sean M. Hughes
Daniel Fried
Arjun Guha
H. D. Vries
Leandro von Werra
19
189
0
09 Jan 2023
InPars-Light: Cost-Effective Unsupervised Training of Efficient Rankers
InPars-Light: Cost-Effective Unsupervised Training of Efficient Rankers
Leonid Boytsov
Preksha Patel
Vivek Sourabh
Riddhi Nisar
Sayan Kundu
R. Ramanathan
Eric Nyberg
19
19
0
08 Jan 2023
Semi-Structured Object Sequence Encoders
Semi-Structured Object Sequence Encoders
V. Rudramurthy
Riyaz Ahmad Bhat
Chulaka Gunasekara
Siva Sankalp Patel
H. Wan
Tejas I. Dhamecha
Danish Contractor
Marina Danilevsky
54
0
0
03 Jan 2023
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Elias Frantar
Dan Alistarh
VLM
20
621
0
02 Jan 2023
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on
  Simplified Radiology Reports
ChatGPT Makes Medicine Easy to Swallow: An Exploratory Case Study on Simplified Radiology Reports
Katharina Jeblick
B. Schachtner
Jakob Dexl
Andreas Mittermeier
Anna Theresa Stüber
...
Tobias Weber
Philipp Wesp
B. Sabel
J. Ricke
Michael Ingrisch
LM&MA
MedIm
103
369
0
30 Dec 2022
GPT Takes the Bar Exam
GPT Takes the Bar Exam
M. Bommarito
Daniel Martin Katz
ELM
11
149
0
29 Dec 2022
Countering Malicious Content Moderation Evasion in Online Social
  Networks: Simulation and Detection of Word Camouflage
Countering Malicious Content Moderation Evasion in Online Social Networks: Simulation and Detection of Word Camouflage
Álvaro Huertas-García
Alejandro Martín
Javier Huertas-Tato
David Camacho
21
9
0
27 Dec 2022
Large Language Models Encode Clinical Knowledge
Large Language Models Encode Clinical Knowledge
K. Singhal
Shekoofeh Azizi
T. Tu
S. S. Mahdavi
Jason W. Wei
...
A. Rajkomar
Joelle Barral
Christopher Semturs
Alan Karthikesalingam
Vivek Natarajan
LM&MA
ELM
AI4MH
19
2,131
0
26 Dec 2022
SERENGETI: Massively Multilingual Language Models for Africa
SERENGETI: Massively Multilingual Language Models for Africa
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Alcides Alcoba Inciarte
17
29
0
21 Dec 2022
JASMINE: Arabic GPT Models for Few-Shot Learning
JASMINE: Arabic GPT Models for Few-Shot Learning
El Moatez Billah Nagoudi
Muhammad Abdul-Mageed
AbdelRahim Elmadany
Alcides Alcoba Inciarte
Md. Tawkat Islam Khondaker
9
7
0
21 Dec 2022
Perplexed by Quality: A Perplexity-based Method for Adult and Harmful
  Content Detection in Multilingual Heterogeneous Web Data
Perplexed by Quality: A Perplexity-based Method for Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data
Timm Jansen
Yangling Tong
V. Zevallos
Pedro Ortiz Suarez
11
17
0
20 Dec 2022
Towards Reasoning in Large Language Models: A Survey
Towards Reasoning in Large Language Models: A Survey
Jie Huang
Kevin Chen-Chuan Chang
LM&MA
ELM
LRM
19
576
0
20 Dec 2022
Dissecting Transformer Length Extrapolation via the Lens of Receptive
  Field Analysis
Dissecting Transformer Length Extrapolation via the Lens of Receptive Field Analysis
Ta-Chung Chi
Ting-Han Fan
Alexander I. Rudnicky
Peter J. Ramadge
14
40
0
20 Dec 2022
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for
  Indian Languages
IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages
Ananya B. Sai
Vignesh Nagarajan
Tanay Dixit
Raj Dabre
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
34
20
0
20 Dec 2022
Inducing Character-level Structure in Subword-based Language Models with
  Type-level Interchange Intervention Training
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
17
13
0
19 Dec 2022
Training Trajectories of Language Models Across Scales
Training Trajectories of Language Models Across Scales
Mengzhou Xia
Mikel Artetxe
Chunting Zhou
Xi Victoria Lin
Ramakanth Pasunuru
Danqi Chen
Luke Zettlemoyer
Ves Stoyanov
AIFin
LRM
20
52
0
19 Dec 2022
The case for 4-bit precision: k-bit Inference Scaling Laws
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers
Luke Zettlemoyer
MQ
14
210
0
19 Dec 2022
The Decades Progress on Code-Switching Research in NLP: A Systematic
  Survey on Trends and Challenges
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges
Genta Indra Winata
Alham Fikri Aji
Zheng-Xin Yong
Thamar Solorio
37
31
0
19 Dec 2022
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Samuel Cahyawijaya
Holy Lovenia
Alham Fikri Aji
Genta Indra Winata
Bryan Wilie
...
Timothy Baldwin
Sebastian Ruder
Herry Sujaini
S. Sakti
Ayu Purwarianti
19
47
0
19 Dec 2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Zheng-Xin Yong
Hailey Schoelkopf
Niklas Muennighoff
Alham Fikri Aji
David Ifeoluwa Adelani
...
Genta Indra Winata
Stella Biderman
Edward Raff
Dragomir R. Radev
Vassilina Nikoulina
CLL
VLM
AI4CE
LRM
27
81
0
19 Dec 2022
Large Language Models Meet NL2Code: A Survey
Large Language Models Meet NL2Code: A Survey
Daoguang Zan
B. Chen
Fengji Zhang
Di Lu
Bingchao Wu
Bei Guan
Yongji Wang
Jian-Guang Lou
ELM
ALM
26
166
0
19 Dec 2022
Rainproof: An Umbrella To Shield Text Generators From
  Out-Of-Distribution Data
Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data
Maxime Darrin
Pablo Piantanida
Pierre Colombo
OODD
24
12
0
18 Dec 2022
Lessons learned from the evaluation of Spanish Language Models
Lessons learned from the evaluation of Spanish Language Models
Rodrigo Agerri
Eneko Agirre
ELM
28
15
0
16 Dec 2022
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in
  Zero-Shot Reasoning
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLM
LRM
27
181
0
15 Dec 2022
Manifestations of Xenophobia in AI Systems
Manifestations of Xenophobia in AI Systems
Nenad Tomašev
J. L. Maynard
Iason Gabriel
19
9
0
15 Dec 2022
Artificial Intelligence for Health Message Generation: Theory, Method,
  and an Empirical Study Using Prompt Engineering
Artificial Intelligence for Health Message Generation: Theory, Method, and an Empirical Study Using Prompt Engineering
Sue Lim
Ralf Schmälzle
16
55
0
14 Dec 2022
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Structured Prompting: Scaling In-Context Learning to 1,000 Examples
Y. Hao
Yutao Sun
Li Dong
Zhixiong Han
Yuxian Gu
Furu Wei
LRM
14
46
0
13 Dec 2022
Elixir: Train a Large Language Model on a Small GPU Cluster
Elixir: Train a Large Language Model on a Small GPU Cluster
Haichen Huang
Jiarui Fang
Hongxin Liu
Shenggui Li
Yang You
VLM
16
7
0
10 Dec 2022
DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and
  Training Efficiency via Efficient Data Sampling and Routing
DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing
Conglong Li
Z. Yao
Xiaoxia Wu
Minjia Zhang
Connor Holmes
Cheng Li
Yuxiong He
17
23
0
07 Dec 2022
In-context Examples Selection for Machine Translation
In-context Examples Selection for Machine Translation
Sweta Agrawal
Chunting Zhou
M. Lewis
Luke Zettlemoyer
Marjan Ghazvininejad
LRM
20
185
0
05 Dec 2022
Nonparametric Masked Language Modeling
Nonparametric Masked Language Modeling
Sewon Min
Weijia Shi
M. Lewis
Xilun Chen
Wen-tau Yih
Hannaneh Hajishirzi
Luke Zettlemoyer
RALM
40
48
0
02 Dec 2022
A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective
A Comprehensive Survey on Enterprise Financial Risk Analysis from Big Data Perspective
Yu Zhao
Huaming Du
Qing Li
Fuzhen Zhuang
Ji Liu
Gang Kou
Gang Kou
30
1
0
28 Nov 2022
Understanding BLOOM: An empirical study on diverse NLP tasks
Understanding BLOOM: An empirical study on diverse NLP tasks
Parag Dakle
Sai Krishna Rallabandi
Preethi Raghavan
AI4CE
17
3
0
27 Nov 2022
Undesirable Biases in NLP: Addressing Challenges of Measurement
Undesirable Biases in NLP: Addressing Challenges of Measurement
Oskar van der Wal
Dominik Bachmann
Alina Leidinger
L. Maanen
Willem H. Zuidema
K. Schulz
17
6
0
24 Nov 2022
Multitask Vision-Language Prompt Tuning
Multitask Vision-Language Prompt Tuning
Sheng Shen
Shijia Yang
Tianjun Zhang
Bohan Zhai
Joseph E. Gonzalez
Kurt Keutzer
Trevor Darrell
VLM
VPVLM
17
49
0
21 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large
  Language Models
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
24
724
0
18 Nov 2022
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
GAMMT: Generative Ambiguity Modeling Using Multiple Transformers
Xingcheng Xu
14
0
0
16 Nov 2022
Large Language Models Struggle to Learn Long-Tail Knowledge
Large Language Models Struggle to Learn Long-Tail Knowledge
Nikhil Kandpal
H. Deng
Adam Roberts
Eric Wallace
Colin Raffel
RALM
KELM
31
373
0
15 Nov 2022
Astronomia ex machina: a history, primer, and outlook on neural networks
  in astronomy
Astronomia ex machina: a history, primer, and outlook on neural networks in astronomy
Michael J. Smith
James E. Geach
16
32
0
07 Nov 2022
Knowledge Graph Embedding: A Survey from the Perspective of
  Representation Spaces
Knowledge Graph Embedding: A Survey from the Perspective of Representation Spaces
Jiahang Cao
Jinyuan Fang
Zaiqiao Meng
Shangsong Liang
18
60
0
07 Nov 2022
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language
  Model
Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model
A. Luccioni
S. Viguier
Anne-Laure Ligozat
33
256
0
03 Nov 2022
Learning New Tasks from a Few Examples with Soft-Label Prototypes
Learning New Tasks from a Few Examples with Soft-Label Prototypes
Avyav Kumar Singh
Ekaterina Shutova
H. Yannakoudakis
VLM
19
0
0
31 Oct 2022
What Language Model to Train if You Have One Million GPU Hours?
What Language Model to Train if You Have One Million GPU Hours?
Teven Le Scao
Thomas Wang
Daniel Hesslow
Lucile Saulnier
Stas Bekman
...
Lintang Sutawika
Jaesung Tae
Zheng-Xin Yong
Julien Launay
Iz Beltagy
MoE
AI4CE
225
103
0
27 Oct 2022
Machine Generated Text: A Comprehensive Survey of Threat Models and
  Detection Methods
Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods
Evan Crothers
Nathalie Japkowicz
H. Viktor
DeLMO
20
107
0
13 Oct 2022
MTEB: Massive Text Embedding Benchmark
MTEB: Massive Text Embedding Benchmark
Niklas Muennighoff
Nouamane Tazi
L. Magne
Nils Reimers
16
363
0
13 Oct 2022
Bootstrapping Multilingual Semantic Parsers using Large Language Models
Bootstrapping Multilingual Semantic Parsers using Large Language Models
Abhijeet Awasthi
Nitish Gupta
Bidisha Samanta
Shachi Dave
Sunita Sarawagi
Partha P. Talukdar
16
7
0
13 Oct 2022
Benchmarking Long-tail Generalization with Likelihood Splits
Benchmarking Long-tail Generalization with Likelihood Splits
Ameya Godbole
Robin Jia
ALM
15
8
0
13 Oct 2022
LLMEffiChecker: Understanding and Testing Efficiency Degradation of
  Large Language Models
LLMEffiChecker: Understanding and Testing Efficiency Degradation of Large Language Models
Simin Chen
Cong Liu
Mirazul Haque
Wei Yang
34
21
0
07 Oct 2022
GLM-130B: An Open Bilingual Pre-trained Model
GLM-130B: An Open Bilingual Pre-trained Model
Aohan Zeng
Xiao Liu
Zhengxiao Du
Zihan Wang
Hanyu Lai
...
Jidong Zhai
Wenguang Chen
Peng-Zhen Zhang
Yuxiao Dong
Jie Tang
BDL
LRM
242
1,070
0
05 Oct 2022
Petals: Collaborative Inference and Fine-tuning of Large Models
Petals: Collaborative Inference and Fine-tuning of Large Models
Alexander Borzunov
Dmitry Baranchuk
Tim Dettmers
Max Ryabinin
Younes Belkada
Artem Chumachenko
Pavel Samygin
Colin Raffel
VLM
17
61
0
02 Sep 2022
Previous
123...313233
Next