ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.05100
  4. Cited By
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

9 November 2022
BigScience Workshop
:
Teven Le Scao
Angela Fan
Christopher Akiki
Ellie Pavlick
Suzana Ilić
Daniel Hesslow
Roman Castagné
A. Luccioni
François Yvon
Matthias Gallé
J. Tow
Alexander M. Rush
Stella Biderman
Albert Webson
Pawan Sasanka Ammanamanchi
Thomas Wang
Benoît Sagot
Niklas Muennighoff
Albert Villanova del Moral
Olatunji Ruwase
Rachel Bawden
Stas Bekman
Angelina McMillan-Major
Iz Beltagy
Huu Nguyen
Lucile Saulnier
Samson Tan
Pedro Ortiz Suarez
Victor Sanh
Hugo Laurenccon
Yacine Jernite
Julien Launay
Margaret Mitchell
Colin Raffel
Aaron Gokaslan
Adi Simhi
Aitor Soroa Etxabe
Alham Fikri Aji
Amit Alfassy
Anna Rogers
Ariel Kreisberg Nitzav
Canwen Xu
Chenghao Mou
Chris C. Emezue
Christopher Klamm
Colin Leong
Daniel Alexander van Strien
David Ifeoluwa Adelani
Dragomir R. Radev
E. G. Ponferrada
Efrat Levkovizh
Ethan Kim
Eyal Natan
F. Toni
Gérard Dupont
Germán Kruszewski
Giada Pistilli
Hady ElSahar
Hamza Benyamina
H. Tran
Ian Yu
Idris Abdulmumin
Isaac Johnson
Itziar Gonzalez-Dios
Javier de la Rosa
Jenny Chim
Jesse Dodge
Jian Zhu
Jonathan Chang
Jorg Frohberg
Josephine Tobing
J. Bhattacharjee
Khalid Almubarak
Kimbo Chen
Kyle Lo
Leandro von Werra
Leon Weber
Long Phan
Loubna Ben Allal
Ludovic Tanguy
Manan Dey
M. Muñoz
Maraim Masoud
María Grandury
Mario vSavsko
Max Huang
Maximin Coavoux
Mayank Singh
Mike Tian-Jian Jiang
Minh Chien Vu
M. A. Jauhar
Mustafa Ghaleb
Nishant Subramani
Nora Kassner
Nurulaqilla Khamis
Olivier Nguyen
Omar Espejel
Ona de Gibert
Paulo Villegas
Peter Henderson
Pierre Colombo
Priscilla Amuok
Quentin Lhoest
Rheza Harliman
Rishi Bommasani
R. López
Rui Ribeiro
Salomey Osei
S. Pyysalo
Sebastian Nagel
Shamik Bose
Shamsuddeen Hassan Muhammad
Shanya Sharma
Shayne Longpre
Somaieh Nikpoor
S. Silberberg
S. Pai
S. Zink
Tiago Timponi Torrent
Timo Schick
Tristan Thrush
V. Danchev
Vassilina Nikoulina
Veronika Laippala
Violette Lepercq
V. Prabhu
Zaid Alyafeai
Zeerak Talat
Arun Raja
Benjamin Heinzerling
Chenglei Si
Davut Emre Taşar
Elizabeth Salesky
Sabrina J. Mielke
Wilson Y. Lee
Abheesht Sharma
Andrea Santilli
Antoine Chaffin
Arnaud Stiegler
Debajyoti Datta
Eliza Szczechla
Gunjan Chhablani
Han Wang
Harshit Pandey
Hendrik Strobelt
Jason Alan Fries
Jos Rozen
Leo Gao
Lintang Sutawika
M Saiful Bari
Maged S. Al-Shaibani
Matteo Manica
Nihal V. Nayak
Ryan Teehan
Samuel Albanie
Sheng Shen
Srulik Ben-David
Stephen H. Bach
Taewoon Kim
T. Bers
Thibault Févry
Trishala Neeraj
Urmish Thakker
Vikas Raunak
Xiang Tang
Zheng-Xin Yong
Zhiqing Sun
Shaked Brody
Y. Uri
Hadar Tojarieh
Adam Roberts
Hyung Won Chung
Jaesung Tae
Jason Phang
Ofir Press
Conglong Li
Deepak Narayanan
Hatim Bourfoune
Jared Casper
Jeff Rasley
Max Ryabinin
Mayank Mishra
Minjia Zhang
M. Shoeybi
Myriam Peyrounette
N. Patry
Nouamane Tazi
Omar Sanseviero
Patrick von Platen
Pierre Cornette
Pierre Franccois Lavallée
Rémi Lacroix
Samyam Rajbhandari
Sanchit Gandhi
Shaden Smith
S. Requena
Suraj Patil
Tim Dettmers
Ahmed Baruwa
Amanpreet Singh
Anastasia Cheveleva
Anne-Laure Ligozat
Arjun Subramonian
Aurélie Névéol
Charles Lovering
Daniel H Garrette
D. Tunuguntla
Ehud Reiter
Ekaterina Taktasheva
E. Voloshina
Eli Bogdanov
Genta Indra Winata
Hailey Schoelkopf
Jan-Christoph Kalo
Jekaterina Novikova
Jessica Zosa Forde
Zdenvek Kasner
Jungo Kasai
Ken Kawamura
Liam Hazan
Marine Carpuat
Miruna Clinciu
Najoung Kim
Newton Cheng
O. Serikov
Omer Antverg
Oskar van der Wal
Rui Zhang
Ruochen Zhang
Sebastian Gehrmann
Shachar Mirkin
S. Pais
Tatiana Shavrina
Thomas Scialom
Tian Yun
Tomasz Limisiewicz
Verena Rieser
Vitaly Protasov
Vladislav Mikhailov
Yada Pruksachatkun
Yonatan Belinkov
Zachary Bamberger
Zdeněk Kasner
Xiangru Tang
A. Pestana
A. Feizpour
Ammar Khan
Amy Faranak
A. Santos
Anthony Hevia
Antigona Unldreaj
Arash Aghagol
Arezoo Abdollahi
A. Tammour
A. HajiHosseini
Bahareh Behroozi
Benjamin Ayoade Ajibade
B. Saxena
Carlos Muñoz Ferrandis
Daniel McDuff
Danish Contractor
D. Lansky
Davis David
Douwe Kiela
D. A. Nguyen
Edward Tan
Emi Baylor
Ezinwanne Ozoani
F. Mirza
Frankline Ononiwu
Habib Rezanejad
H.A. Jones
Indrani Bhattacharya
Irene Solaiman
Irina Sedenko
I. Nejadgholi
J. Passmore
Joshua Seltzer
Julio Bonis Sanz
Lívia Dutra
Mairon Samagaio
Maraim Elbadri
Margot Mieskes
Marissa Gerchick
Martha Akinlolu
Michael McKenna
Mike Qiu
M. Ghauri
Mykola Burynok
Nafis Abrar
Nazneen Rajani
Nour Elkott
N. Fahmy
Olanrewaju Samuel
Ran An
R. Kromann
Ryan Hao
S. Alizadeh
Sarmad Shubber
Silas L. Wang
Sourav Roy
S. Viguier
Thanh-Cong Le
Tobi Oyebade
T. Le
Yoyo Yang
Zach Nguyen
Abhinav Ramesh Kashyap
Alfredo Palasciano
A. Callahan
Anima Shukla
Antonio Miranda-Escalada
A. Singh
Benjamin Beilharz
Bo Wang
C. Brito
Chenxi Zhou
Chirag Jain
Chuxin Xu
Clémentine Fourrier
Daniel León Perinán
Daniel Molano
Dian Yu
Enrique Manjavacas
Fabio Barth
Florian Fuhrimann
Gabriel Altay
Giyaseddin Bayrak
Gully Burns
Helena U. Vrabec
I. Bello
Isha Dash
J. Kang
John Giorgi
Jonas Golde
J. Posada
Karthi Sivaraman
Lokesh Bulchandani
Lu Liu
Luisa Shinzato
Madeleine Hahn de Bykhovetz
Maiko Takeuchi
Marc Pàmies
M. A. Castillo
Marianna Nezhurina
Mario Sanger
Matthias Samwald
Michael Cullan
Michael Weinberg
M. Wolf
Mina Mihaljcic
Minna Liu
M. Freidank
Myungsun Kang
Natasha Seelam
N. Dahlberg
N. Broad
N. Muellner
Pascale Fung
Patrick Haller
Patricia Haller
R. Eisenberg
Robert Martin
Rodrigo Canalli
Rosaline Su
Ruisi Su
Samuel Cahyawijaya
Samuele Garda
Shlok S Deshmukh
Shubhanshu Mishra
Sid Kiblawi
Simon Ott
Sinee Sang-aroonsiri
Srishti Kumar
Stefan Schweter
S. Bharati
Tanmay Laud
Théo Gigant
Tomoya Kainuma
Wojciech Kusa
Yanis Labrak
Yashasvi Bajaj
Y. Venkatraman
Yifan Xu
Ying Xu
Yu Xu
Z. Tan
Zhongli Xie
Zifan Ye
M. Bras
Younes Belkada
Thomas Wolf
    VLM
ArXivPDFHTML

Papers citing "BLOOM: A 176B-Parameter Open-Access Multilingual Language Model"

50 / 1,621 papers shown
Title
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language
  Models
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
Minseok Choi
Kyunghyun Min
Jaegul Choo
MU
AAML
22
1
0
18 Jun 2024
VoCo-LLaMA: Towards Vision Compression with Large Language Models
VoCo-LLaMA: Towards Vision Compression with Large Language Models
Xubing Ye
Yukang Gan
Xiaoke Huang
Yixiao Ge
Yansong Tang
MLLM
VLM
27
22
0
18 Jun 2024
AI "News" Content Farms Are Easy to Make and Hard to Detect: A Case
  Study in Italian
AI "News" Content Farms Are Easy to Make and Hard to Detect: A Case Study in Italian
Giovanni Puccetti
Anna Rogers
Chiara Alzetta
F. Dell’Orletta
Andrea Esuli
28
7
0
17 Jun 2024
LiLiuM: eBay's Large Language Models for e-commerce
LiLiuM: eBay's Large Language Models for e-commerce
Christian Herold
Michael Kozielski
Leonid Ekimov
Pavel Petrushkov
P. Vandenbussche
Shahram Khadivi
35
1
0
17 Jun 2024
Prefixing Attention Sinks can Mitigate Activation Outliers for Large
  Language Model Quantization
Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization
Seungwoo Son
Wonpyo Park
Woohyun Han
Kyuyeun Kim
Jaeho Lee
MQ
20
10
0
17 Jun 2024
Save It All: Enabling Full Parameter Tuning for Federated Large Language
  Models via Cycle Block Gradient Descent
Save It All: Enabling Full Parameter Tuning for Federated Large Language Models via Cycle Block Gradient Descent
Lin Wang
Zhichao Wang
Xiaoying Tang
29
1
0
17 Jun 2024
Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance
Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance
Somnath Banerjee
Avik Halder
Rajarshi Mandal
Sayan Layek
Ian Soboroff
Rima Hazra
Animesh Mukherjee
52
0
0
17 Jun 2024
Promoting Data and Model Privacy in Federated Learning through Quantized
  LoRA
Promoting Data and Model Privacy in Federated Learning through Quantized LoRA
Jianhao Zhu
Changze Lv
Xiaohua Wang
Muling Wu
Wenhao Liu
Tianlong Li
Zixuan Ling
Cenyuan Zhang
Xiaoqing Zheng
Xuanjing Huang
29
2
0
16 Jun 2024
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for
  Vision-Language Models
AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Xiyang Wu
Tianrui Guan
Dianqi Li
Shuaiyi Huang
Xiaoyu Liu
...
Abhinav Shrivastava
Furong Huang
Jordan L. Boyd-Graber
Tianyi Zhou
Dinesh Manocha
HILM
LRM
VLM
MLLM
25
13
0
16 Jun 2024
ShareLoRA: Parameter Efficient and Robust Large Language Model
  Fine-tuning via Shared Low-Rank Adaptation
ShareLoRA: Parameter Efficient and Robust Large Language Model Fine-tuning via Shared Low-Rank Adaptation
Yurun Song
Junchen Zhao
Ian G. Harris
S. Jyothi
27
3
0
16 Jun 2024
Breaking the Memory Wall: A Study of I/O Patterns and GPU Memory
  Utilization for Hybrid CPU-GPU Offloaded Optimizers
Breaking the Memory Wall: A Study of I/O Patterns and GPU Memory Utilization for Hybrid CPU-GPU Offloaded Optimizers
Avinash Maurya
Jie Ye
M. Rafique
Franck Cappello
Bogdan Nicolae
21
1
0
15 Jun 2024
A Survey of Large Language Models for Financial Applications: Progress,
  Prospects and Challenges
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges
Yuqi Nie
Yaxuan Kong
Xiaowen Dong
John M. Mulvey
H. Vincent Poor
Qingsong Wen
Stefan Zohren
AIFin
40
41
0
15 Jun 2024
Datasets for Multilingual Answer Sentence Selection
Datasets for Multilingual Answer Sentence Selection
Matteo Gabburo
S. Campese
Federico Agostini
Alessandro Moschitti
26
0
0
14 Jun 2024
A Survey on Large Language Models from General Purpose to Medical
  Applications: Datasets, Methodologies, and Evaluations
A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations
Jinqiang Wang
Huansheng Ning
Yi Peng
Qikai Wei
Daniel Tesfai
Wenwei Mao
Tao Zhu
Runhe Huang
LM&MA
AI4MH
ELM
36
4
0
14 Jun 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
Holy Lovenia
Rahmad Mahendra
Salsabil Maulana Akbar
Lester James Validad Miranda
Jennifer Santoso
...
Genta Indra Winata
Ruochen Zhang
Fajri Koto
Zheng-Xin Yong
Samuel Cahyawijaya
77
9
0
14 Jun 2024
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via
  Proxy Models
ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
David Anugraha
Genta Indra Winata
Chenyue Li
Patrick Amadeus Irawan
En-Shiun Annie Lee
33
7
0
13 Jun 2024
Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs
Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs
Weixuan Wang
Barry Haddow
Wei Peng
Alexandra Birch
MILM
28
9
0
13 Jun 2024
Deep Exploration of Cross-Lingual Zero-Shot Generalization in
  Instruction Tuning
Deep Exploration of Cross-Lingual Zero-Shot Generalization in Instruction Tuning
Janghoon Han
Changho Lee
Joongbo Shin
Stanley Jungkyu Choi
Honglak Lee
Kynghoon Bae
ALM
24
0
0
13 Jun 2024
Image Textualization: An Automatic Framework for Creating Accurate and
  Detailed Image Descriptions
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions
Renjie Pi
Jianshu Zhang
Jipeng Zhang
Rui Pan
Zhekai Chen
Tong Zhang
3DV
42
19
0
11 Jun 2024
MINERS: Multilingual Language Models as Semantic Retrievers
MINERS: Multilingual Language Models as Semantic Retrievers
Genta Indra Winata
Ruochen Zhang
David Ifeoluwa Adelani
RALM
44
5
0
11 Jun 2024
BertaQA: How Much Do Language Models Know About Local Culture?
BertaQA: How Much Do Language Models Know About Local Culture?
Julen Etxaniz
Gorka Azkune
A. Soroa
Oier López de Lacalle
Mikel Artetxe
36
6
0
11 Jun 2024
Efficiently Exploring Large Language Models for Document-Level Machine
  Translation with In-context Learning
Efficiently Exploring Large Language Models for Document-Level Machine Translation with In-context Learning
Menglong Cui
Jiangcun Du
Shaolin Zhu
Deyi Xiong
24
11
0
11 Jun 2024
Effectively Compress KV Heads for LLM
Effectively Compress KV Heads for LLM
Hao Yu
Zelan Yang
Shen Li
Yong Li
Jianxin Wu
MQ
VLM
31
12
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image
  Generation
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
55
220
0
10 Jun 2024
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training
  Multiplication-Less Reparameterization
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
Haoran You
Yipin Guo
Yichao Fu
Wei Zhou
Huihong Shi
Xiaofan Zhang
Souvik Kundu
Amir Yazdanbakhsh
Y. Lin
KELM
44
7
0
10 Jun 2024
Are Large Language Models Actually Good at Text Style Transfer?
Are Large Language Models Actually Good at Text Style Transfer?
Sourabrata Mukherjee
Atul Kr. Ojha
Ondrej Dusek
21
10
0
09 Jun 2024
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
Yanis Labrak
Adel Moumen
Richard Dufour
Mickael Rouvier
ELM
LM&MA
MedIm
29
0
0
09 Jun 2024
SinkLoRA: Enhanced Efficiency and Chat Capabilities for Long-Context
  Large Language Models
SinkLoRA: Enhanced Efficiency and Chat Capabilities for Long-Context Large Language Models
Hengyu Zhang
RALM
25
2
0
09 Jun 2024
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and
  Effective for LMMs
DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs
Lingchen Meng
Jianwei Yang
Rui Tian
Xiyang Dai
Zuxuan Wu
Jianfeng Gao
Yu-Gang Jiang
VLM
22
8
0
06 Jun 2024
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language
  Model
Legal Documents Drafting with Fine-Tuned Pre-Trained Large Language Model
Chun-Hsien Lin
Pu-Jen Cheng
AILaw
24
3
0
06 Jun 2024
Repurposing Language Models into Embedding Models: Finding the
  Compute-Optimal Recipe
Repurposing Language Models into Embedding Models: Finding the Compute-Optimal Recipe
Alicja Ziarko
Albert Q. Jiang
Bartosz Piotrowski
Wenda Li
M. Jamnik
Piotr Miłoś
26
0
0
06 Jun 2024
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility
  Data
Pre-trained Transformer Uncovers Meaningful Patterns in Human Mobility Data
Alameen Najjar
29
0
0
06 Jun 2024
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
David Ifeoluwa Adelani
Jessica Ojo
Israel Abebe Azime
Jian Yun Zhuang
Jesujoba Oluwadara Alabi
...
Salomey Osei
Sokhar Samb
Tadesse Kebede Guge
Pontus Stenetorp
Pontus Stenetorp
ELM
50
7
0
05 Jun 2024
LLM-based Rewriting of Inappropriate Argumentation using Reinforcement
  Learning from Machine Feedback
LLM-based Rewriting of Inappropriate Argumentation using Reinforcement Learning from Machine Feedback
Timon Ziegenbein
Gabriella Skitalinskaya
Alireza Bayat Makou
Henning Wachsmuth
LLMAG
KELM
29
5
0
05 Jun 2024
Which Side Are You On? A Multi-task Dataset for End-to-End Argument
  Summarisation and Evaluation
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
Hao Li
Yuping Wu
Viktor Schlegel
R. Batista-Navarro
Tharindu Madusanka
...
Jiayan Zeng
Xiaochi Wang
Xinran He
Yizhi Li
Goran Nenadic
31
6
0
05 Jun 2024
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning
  using Large Language Models
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models
Ancheng Xu
Minghuan Tan
Lei Wang
Min Yang
Ruifeng Xu
LRM
44
0
0
05 Jun 2024
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language
  Models
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language Models
Tao Fan
Guoqiang Ma
Yan Kang
Hanlin Gu
Yuanfeng Song
Lixin Fan
Kai Chen
Qiang Yang
18
9
0
04 Jun 2024
UniOQA: A Unified Framework for Knowledge Graph Question Answering with
  Large Language Models
UniOQA: A Unified Framework for Knowledge Graph Question Answering with Large Language Models
Zhuoyang Li
Liran Deng
Hui Liu
Qiaoqiao Liu
Junzhao Du
RALM
24
4
0
04 Jun 2024
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with
  Cross-Lingual Feedback
LLMs Beyond English: Scaling the Multilingual Capability of LLMs with Cross-Lingual Feedback
Wen Lai
Mohsen Mesgar
Alexander M. Fraser
LRM
ALM
48
18
0
03 Jun 2024
The Life Cycle of Large Language Models: A Review of Biases in Education
The Life Cycle of Large Language Models: A Review of Biases in Education
Jinsook Lee
Yann Hicke
Renzhe Yu
Christopher A. Brooks
René F. Kizilcec
AI4Ed
29
1
0
03 Jun 2024
Demonstration Augmentation for Zero-shot In-context Learning
Demonstration Augmentation for Zero-shot In-context Learning
Yi Su
Yunpeng Tai
Yixin Ji
Juntao Li
Bowen Yan
Min Zhang
RALM
33
6
0
03 Jun 2024
Strengthened Symbol Binding Makes Large Language Models Reliable
  Multiple-Choice Selectors
Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors
Mengge Xue
Zhenyu Hu
Liqun Liu
Kuo Liao
Shuang Li
Honglin Han
Meng Zhao
Chengguo Yin
38
5
0
03 Jun 2024
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in
  Zero and Few-shot Learning
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning
Keqi Deng
Guangzhi Sun
Phil Woodland
VLM
28
4
0
01 Jun 2024
A Survey on Large Language Models for Code Generation
A Survey on Large Language Models for Code Generation
Juyong Jiang
Fan Wang
Jiasi Shen
Sungju Kim
Sunghun Kim
40
158
0
01 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
29
4
0
31 May 2024
Improving Reward Models with Synthetic Critiques
Improving Reward Models with Synthetic Critiques
Zihuiwen Ye
Fraser Greenlee-Scott
Max Bartolo
Phil Blunsom
Jon Ander Campos
Matthias Gallé
ALM
SyDa
LRM
38
17
0
31 May 2024
Using Large Language Models for Humanitarian Frontline Negotiation:
  Opportunities and Considerations
Using Large Language Models for Humanitarian Frontline Negotiation: Opportunities and Considerations
Zilin Ma
Susannah Su
Su
Nathan Zhao
Linn Bieske
...
Boxiang Wang
Jinglun Gao
Zihan Wen
Claude Bruderlein
Weiwei Pan
15
0
0
30 May 2024
InstructionCP: A fast approach to transfer Large Language Models into
  target language
InstructionCP: A fast approach to transfer Large Language Models into target language
Kuang-Ming Chen
Hung-yi Lee
CLL
36
2
0
30 May 2024
Quest: Query-centric Data Synthesis Approach for Long-context Scaling of
  Large Language Model
Quest: Query-centric Data Synthesis Approach for Long-context Scaling of Large Language Model
Chaochen Gao
Xing Wu
Qingfang Fu
Songlin Hu
SyDa
27
5
0
30 May 2024
Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural
  Language Understanding
Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding
Kuo Liao
Shuang Li
Meng Zhao
Liqun Liu
Mengge Xue
Zhenyu Hu
Honglin Han
Chengguo Yin
33
1
0
30 May 2024
Previous
123...789...313233
Next