ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.03771
  4. Cited By
HuggingFace's Transformers: State-of-the-art Natural Language Processing
v1v2v3v4v5 (latest)

HuggingFace's Transformers: State-of-the-art Natural Language Processing

9 October 2019
Thomas Wolf
Lysandre Debut
Victor Sanh
Julien Chaumond
Clement Delangue
Anthony Moi
Pierric Cistac
Tim Rault
Rémi Louf
Morgan Funtowicz
Joe Davison
Sam Shleifer
Patrick von Platen
Clara Ma
Yacine Jernite
J. Plu
Canwen Xu
Teven Le Scao
Sylvain Gugger
Mariama Drame
Quentin Lhoest
Alexander M. Rush
    AI4CE
ArXiv (abs)PDFHTMLGithub (144926★)

Papers citing "HuggingFace's Transformers: State-of-the-art Natural Language Processing"

50 / 516 papers shown
Title
Selective Annotation Makes Language Models Better Few-Shot Learners
Selective Annotation Makes Language Models Better Few-Shot Learners
Hongjin Su
Jungo Kasai
Chen Henry Wu
Weijia Shi
Tianlu Wang
...
Rui Zhang
Mari Ostendorf
Luke Zettlemoyer
Noah A. Smith
Tao Yu
118
262
0
05 Sep 2022
ChemBERTa-2: Towards Chemical Foundation Models
ChemBERTa-2: Towards Chemical Foundation Models
Walid Ahmad
Elana Simon
Seyone Chithrananda
Gabriel Grand
Bharath Ramsundar
AI4CE
68
141
0
05 Sep 2022
ArgLegalSumm: Improving Abstractive Summarization of Legal Documents
  with Argument Mining
ArgLegalSumm: Improving Abstractive Summarization of Legal Documents with Argument Mining
Mohamed S. Elaraby
Diane Litman
AILawELM
103
33
0
04 Sep 2022
DualVoice: Speech Interaction that Discriminates between Normal and
  Whispered Voice Input
DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input
Jun Rekimoto
58
6
0
22 Aug 2022
Reliable Decision from Multiple Subtasks through Threshold Optimization:
  Content Moderation in the Wild
Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild
Donghyun Son
Byounggyu Lew
Kwanghee Choi
Yongsu Baek
Seungwoo Choi
Beomjun Shin
S. Ha
Buru Chang
59
10
0
16 Aug 2022
Reproduction and Replication of an Adversarial Stylometry Experiment
Reproduction and Replication of an Adversarial Stylometry Experiment
Haining Wang
P. Juola
A. Riddell
57
2
0
15 Aug 2022
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Tim Dettmers
M. Lewis
Younes Belkada
Luke Zettlemoyer
MQ
147
666
0
15 Aug 2022
An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse
  Transformers
An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
Chao Fang
Aojun Zhou
Zhongfeng Wang
MoE
76
54
0
12 Aug 2022
A Multimodal Transformer: Fusing Clinical Notes with Structured EHR Data
  for Interpretable In-Hospital Mortality Prediction
A Multimodal Transformer: Fusing Clinical Notes with Structured EHR Data for Interpretable In-Hospital Mortality Prediction
Weimin Lyu
Xinyu Dong
Rachel Wong
Songzhu Zheng
Kayley Abell-Hart
Fusheng Wang
Chao Chen
113
52
0
09 Aug 2022
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq
  Model
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model
Saleh Soltan
Shankar Ananthakrishnan
Jack G. M. FitzGerald
Rahul Gupta
Wael Hamza
...
Mukund Sridhar
Fabian Triefenbach
Apurv Verma
Gokhan Tur
Premkumar Natarajan
129
83
0
02 Aug 2022
Dynamics and triggers of misinformation on vaccines
Dynamics and triggers of misinformation on vaccines
Emanuele Brugnoli
Marco Delmastro
98
4
0
25 Jul 2022
CodeT: Code Generation with Generated Tests
CodeT: Code Generation with Generated Tests
Bei Chen
Fengji Zhang
A. Nguyen
Daoguang Zan
Zeqi Lin
Jian-Guang Lou
Weizhu Chen
105
349
0
21 Jul 2022
SparseTIR: Composable Abstractions for Sparse Compilation in Deep
  Learning
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning
Zihao Ye
Ruihang Lai
Junru Shao
Tianqi Chen
Luis Ceze
125
98
0
11 Jul 2022
Probing Classifiers are Unreliable for Concept Removal and Detection
Probing Classifiers are Unreliable for Concept Removal and Detection
Abhinav Kumar
Chenhao Tan
Amit Sharma
AAML
101
25
0
08 Jul 2022
AST-Probe: Recovering abstract syntax trees from hidden representations
  of pre-trained language models
AST-Probe: Recovering abstract syntax trees from hidden representations of pre-trained language models
José Antonio Hernández López
Martin Weyssow
Jesús Sánchez Cuadrado
H. Sahraoui
55
23
0
23 Jun 2022
Twitter conversations predict the daily confirmed COVID-19 cases
Twitter conversations predict the daily confirmed COVID-19 cases
Rabindra Lamsal
Aaron Harwood
M. Read
58
26
0
21 Jun 2022
A Universal Adversarial Policy for Text Classifiers
A Universal Adversarial Policy for Text Classifiers
Gallil Maimon
Lior Rokach
AAML
131
10
0
19 Jun 2022
CERT: Continual Pre-Training on Sketches for Library-Oriented Code
  Generation
CERT: Continual Pre-Training on Sketches for Library-Oriented Code Generation
Daoguang Zan
Bei Chen
Dejian Yang
Zeqi Lin
Minsu Kim
Bei Guan
Yongji Wang
Weizhu Chen
Jian-Guang Lou
86
129
0
14 Jun 2022
Exploring Adversarial Attacks and Defenses in Vision Transformers
  trained with DINO
Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO
Javier Rando
Nasib Naimi
Thomas Baumann
Max Mathys
AAML
53
6
0
14 Jun 2022
Task Transfer and Domain Adaptation for Zero-Shot Question Answering
Task Transfer and Domain Adaptation for Zero-Shot Question Answering
X. Pan
Alex Sheng
David R Shimshoni
Aditya Singhal
Sara Rosenthal
Avirup Sil
53
4
0
14 Jun 2022
Memory-Based Model Editing at Scale
Memory-Based Model Editing at Scale
E. Mitchell
Charles Lin
Antoine Bosselut
Christopher D. Manning
Chelsea Finn
KELM
116
362
0
13 Jun 2022
Improving Pre-trained Language Model Fine-tuning with Noise Stability
  Regularization
Improving Pre-trained Language Model Fine-tuning with Noise Stability Regularization
Hang Hua
Xingjian Li
Dejing Dou
Chengzhong Xu
Jiebo Luo
94
15
0
12 Jun 2022
SsciBERT: A Pre-trained Language Model for Social Science Texts
SsciBERT: A Pre-trained Language Model for Social Science Texts
Si Shen
Jiang-fu Liu
Litao Lin
Ying Huang
Lin Zhang
Changting Liu
Yu Feng
Dongbo Wang
106
28
0
09 Jun 2022
Learning to Ask Like a Physician
Learning to Ask Like a Physician
Eric P. Lehman
Vladislav Lialin
K. Y. Legaspi
Anne Janelle R. Sy
Patricia Therese S. Pile
...
Anna Rumshisky
Jenifer Liang
Preethi Raghavan
Leo Anthony Celi
Peter Szolovits
OOD
80
20
0
06 Jun 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for
  Large-Scale Transformers
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLMMQ
174
484
0
04 Jun 2022
Hollywood Identity Bias Dataset: A Context Oriented Bias Analysis of
  Movie Dialogues
Hollywood Identity Bias Dataset: A Context Oriented Bias Analysis of Movie Dialogues
Sandhya Singh
Prapti Roy
Nihar Ranjan Sahoo
Niteesh Mallela
Himanshu Gupta
...
Milind Savagaonkar
Nidhi
Roshni Ramnani
Anutosh Maitra
Shubhashis Sengupta
39
14
0
31 May 2022
Quark: Controllable Text Generation with Reinforced Unlearning
Quark: Controllable Text Generation with Reinforced Unlearning
Ximing Lu
Sean Welleck
Jack Hessel
Liwei Jiang
Lianhui Qin
Peter West
Prithviraj Ammanabrolu
Yejin Choi
MU
172
220
0
26 May 2022
Large Language Models are Few-Shot Clinical Information Extractors
Large Language Models are Few-Shot Clinical Information Extractors
Monica Agrawal
S. Hegselmann
Hunter Lang
Yoon Kim
David Sontag
BDLLM&MA
253
349
0
25 May 2022
Generating Information-Seeking Conversations from Unlabeled Documents
Generating Information-Seeking Conversations from Unlabeled Documents
Gangwoo Kim
Sungdong Kim
Kang Min Yoo
Jaewoo Kang
61
13
0
25 May 2022
ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate
  Speech Detection
ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection
Badr AlKhamissi
Faisal Ladhak
Srini Iyer
Ves Stoyanov
Zornitsa Kozareva
Xian Li
Pascale Fung
Lambert Mathias
Asli Celikyilmaz
Mona T. Diab
111
18
0
25 May 2022
New Intent Discovery with Pre-training and Contrastive Learning
New Intent Discovery with Pre-training and Contrastive Learning
Yuwei Zhang
Haode Zhang
Li-Ming Zhan
Albert Y.S. Lam
Albert Y. S. Lam
SSLVLM
137
43
0
25 May 2022
Disentangling Active and Passive Cosponsorship in the U.S. Congress
Disentangling Active and Passive Cosponsorship in the U.S. Congress
Giuseppe Russo
Christoph Gote
L. Brandenberger
Sophia Schlosser
F. Schweitzer
LLMSVAI4CE
56
7
0
19 May 2022
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for
  Pre-Trained Encoder Transfer Learning
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning
Orion Weller
Kevin Seppi
Matt Gardner
62
23
0
17 May 2022
User Guide for KOTE: Korean Online Comments Emotions Dataset
User Guide for KOTE: Korean Online Comments Emotions Dataset
Duyoung Jeon
Junho Lee
Cheongtag Kim
56
0
0
11 May 2022
Interactive Model Cards: A Human-Centered Approach to Model
  Documentation
Interactive Model Cards: A Human-Centered Approach to Model Documentation
Anamaria Crisan
Margaret Drouhard
Jesse Vig
Nazneen Rajani
HAI
83
92
0
05 May 2022
COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both
  Party Personas
COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party Personas
Chengshi Xu
Pijian Li
Wei Wang
Haoran Yang
Siyun Wang
Chuangbai Xiao
93
28
0
02 May 2022
AdapterBias: Parameter-efficient Token-dependent Representation Shift
  for Adapters in NLP Tasks
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks
Chin-Lun Fu
Zih-Ching Chen
Yun-Ru Lee
Hung-yi Lee
85
49
0
30 Apr 2022
HiNER: A Large Hindi Named Entity Recognition Dataset
HiNER: A Large Hindi Named Entity Recognition Dataset
Rudra Murthy
Pallab Bhattacharjee
R. Sharnagat
Jyotsana Khatri
Diptesh Kanojia
P. Bhattacharyya
73
14
0
28 Apr 2022
Translation between Molecules and Natural Language
Translation between Molecules and Natural Language
Carl Edwards
T. Lai
Kevin Ros
Garrett Honke
Kyunghyun Cho
Heng Ji
136
172
0
25 Apr 2022
Making the Most of Text Semantics to Improve Biomedical Vision--Language
  Processing
Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing
Benedikt Boecking
Naoto Usuyama
Shruthi Bannur
Daniel Coelho De Castro
Anton Schwaighofer
...
Tristan Naumann
A. Nori
Javier Alvarez-Valle
Hoifung Poon
Ozan Oktay
87
247
0
21 Apr 2022
Optimize_Prime@DravidianLangTech-ACL2022: Emotion Analysis in Tamil
Optimize_Prime@DravidianLangTech-ACL2022: Emotion Analysis in Tamil
Omkar Gokhale
Shantanu Patankar
Onkar Litake
Aditya Mandke
Dipali M. Kadam
48
1
0
19 Apr 2022
Last Layer Re-Training is Sufficient for Robustness to Spurious
  Correlations
Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
Polina Kirichenko
Pavel Izmailov
A. Wilson
OOD
102
339
0
06 Apr 2022
Inferring Rewards from Language in Context
Inferring Rewards from Language in Context
Jessy Lin
Daniel Fried
Dan Klein
Anca Dragan
LM&Ro
87
55
0
05 Apr 2022
Applying Automatic Text Summarization for Fake News Detection
Applying Automatic Text Summarization for Fake News Detection
Philipp Hartl
Udo Kruschwitz
82
12
0
04 Apr 2022
ChartQA: A Benchmark for Question Answering about Charts with Visual and
  Logical Reasoning
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning
Ahmed Masry
Do Xuan Long
J. Tan
Shafiq Joty
Enamul Hoque
AIMat
138
687
0
19 Mar 2022
Probing Factually Grounded Content Transfer with Factual Ablation
Probing Factually Grounded Content Transfer with Factual Ablation
Peter West
Chris Quirk
Michel Galley
Yejin Choi
HILM
79
9
0
18 Mar 2022
Evaluating BERT-based Pre-training Language Models for Detecting
  Misinformation
Evaluating BERT-based Pre-training Language Models for Detecting Misinformation
Rini Anggrainingsih
Ghulam Mubashar Hassan
A. Datta
51
7
0
15 Mar 2022
Learning to Reduce False Positives in Analytic Bug Detectors
Learning to Reduce False Positives in Analytic Bug Detectors
Anant Kharkar
Roshanak Zilouchian Moghaddam
Matthew Jin
Xiaoyu Liu
Xin Shi
Colin B. Clement
Neel Sundaresan
64
44
0
08 Mar 2022
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration
  Network for Multilingual Complex Named Entity Recognition
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition
Beiduo Chen
Jun-Yu Ma
Jiajun Qi
Wu Guo
Zhen-Hua Ling
Quan Liu
56
16
0
07 Mar 2022
Deep Lexical Hypothesis: Identifying personality structure in natural
  language
Deep Lexical Hypothesis: Identifying personality structure in natural language
A. Cutler
D. Condon
65
31
0
04 Mar 2022
Previous
123456...91011
Next