Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1607.04606
Cited By
v1
v2 (latest)
Enriching Word Vectors with Subword Information
Transactions of the Association for Computational Linguistics (TACL), 2016
15 July 2016
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Enriching Word Vectors with Subword Information"
50 / 2,761 papers shown
BioADAPT-MRC: Adversarial Learning-based Domain Adaptation Improves Biomedical Machine Reading Comprehension Task
Maria Mahbub
Sudarshan Srinivasan
Edmon Begoli
Gregory D. Peterson
318
12
0
26 Feb 2022
Automated Identification of Toxic Code Reviews Using ToxiCR
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
Jaydeb Sarker
Asif Kamal Turzo
Mingyou Dong
Amiangshu Bosu
182
44
0
26 Feb 2022
NeuroView-RNN: It's About Time
Conference on Fairness, Accountability and Transparency (FAccT), 2022
C. Barberan
Sina Alemohammad
Naiming Liu
Randall Balestriero
Richard G. Baraniuk
AI4TS
HAI
196
2
0
23 Feb 2022
Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching
Geewook Kim
Wonseok Hwang
Minjoon Seo
Seunghyun Park
138
0
0
23 Feb 2022
Manage risks in complex engagements by leveraging organization-wide knowledge using Machine Learning
H. Prasad
A. Goyal
Shivram Ramasubramanian
44
0
0
21 Feb 2022
Seeing the advantage: visually grounding word embeddings to better capture human semantic knowledge
Workshop on Cognitive Modeling and Computational Linguistics (CMCL), 2022
Danny Merkx
S. Frank
M. Ernestus
122
4
0
21 Feb 2022
Deep Learning for Hate Speech Detection: A Comparative Study
International Journal of Data Science and Analysis (JDSA), 2022
Jitendra Malik
Hezhe Qiao
Guansong Pang
Anton Van Den Hengel
308
66
0
19 Feb 2022
Data-Driven Mitigation of Adversarial Text Perturbation
Rasika Bhalerao
Mohammad Al-Rubaie
Anand Bhaskar
Igor L. Markov
127
8
0
19 Feb 2022
Synthetic Disinformation Attacks on Automated Fact Verification Systems
AAAI Conference on Artificial Intelligence (AAAI), 2022
Y. Du
Antoine Bosselut
Christopher D. Manning
AAML
OffRL
279
53
0
18 Feb 2022
Evaluating the Construct Validity of Text Embeddings with Application to Survey Questions
EPJ Data Science (EPJ Data Sci.), 2022
Qixiang Fang
D. Nguyen
Daniel L. Oberski
201
15
0
18 Feb 2022
Processing the structure of documents: Logical Layout Analysis of historical newspapers in French
Journal of Data Mining and Digital Humanities (JDMDH), 2022
Nicolas Gutehrlé
Iana Atanassova
143
10
0
16 Feb 2022
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
Malte Ostendorff
Nils Rethmeier
Isabelle Augenstein
Bela Gipp
Georg Rehm
406
102
0
14 Feb 2022
Double-Barreled Question Detection at Momentive
Peng Jiang
K. S. Muppalla
Qingyue Wei
C. N. Gopal
Chun Wang
75
0
0
12 Feb 2022
Using a Language Model in a Kiosk Recommender System at Fast-Food Restaurants
Eduard Zubchuk
Dmitry Menshikov
N. Mikhaylovskiy
140
2
0
08 Feb 2022
How Effective is Incongruity? Implications for Code-mix Sarcasm Detection
ICON (ICON), 2022
Aditya Shah
Chandresh Kumar Maurya
122
6
0
06 Feb 2022
A Survey on Automated Sarcasm Detection on Twitter
Bleau Moores
Vijay K. Mago
205
20
0
05 Feb 2022
A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility
European Conference on Computer Vision (ECCV), 2022
Andrea Burns
Deniz Arsan
Sanjna Agrawal
Ranjitha Kumar
Kate Saenko
Bryan A. Plummer
419
82
0
04 Feb 2022
Identifying Self-Admitted Technical Debt in Issue Tracking Systems using Machine Learning
Empirical Software Engineering (EMSE), 2022
Yikun Li
Mohamed Soliman
P. Avgeriou
120
37
0
04 Feb 2022
A Benchmark Corpus for the Detection of Automatically Generated Text in Academic Publications
International Conference on Language Resources and Evaluation (LREC), 2022
Vijini Liyanage
Davide Buscaldi
A. Nazarenko
DeLMO
266
32
0
04 Feb 2022
Fairness for Text Classification Tasks with Identity Information Data Augmentation Methods
Mohit Wadhwa
Mohan Bhambhani
Ashvini Jindal
Uma Sawant
Ramanujam Madhavan
148
4
0
04 Feb 2022
L3Cube-MahaCorpus and MahaBERT: Marathi Monolingual Corpus, Marathi BERT Language Models, and Resources
Raviraj Joshi
222
66
0
02 Feb 2022
Correcting diacritics and typos with a ByT5 transformer model
Applied Sciences (Appl. Sci.), 2022
Lukas Stankevicius
M. Lukoševičius
J. Kapočiūtė-Dzikienė
Monika Briediene
Tomas Krilavičius
195
24
0
31 Jan 2022
Nyström Kernel Mean Embeddings
International Conference on Machine Learning (ICML), 2022
Antoine Chatalic
Nicolas Schreuder
Alessandro Rudi
Lorenzo Rosasco
255
24
0
31 Jan 2022
Grammatical cues to subjecthood are redundant in a majority of simple clauses across languages
Kyle Mahowald
Evgeniia Diachek
E. Gibson
Evelina Fedorenko
Richard Futrell
327
11
0
30 Jan 2022
A Unified Approach to Entity-Centric Context Tracking in Social Conversations
International Conference on Language Resources and Evaluation (LREC), 2022
Ulrich Ruckert
Srinivas Sunkara
Abhinav Rastogi
Sushant Prakash
Pranav Khaitan
141
2
0
28 Jan 2022
Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages
International Conference on Language Resources and Evaluation (LREC), 2022
Silvia Severini
Ayyoob Imani
Philipp Dufter
Hinrich Schütze
193
9
0
28 Jan 2022
Zero-Shot Sketch Based Image Retrieval using Graph Transformer
International Conference on Pattern Recognition (ICPR), 2022
Sumrit Gupta
Ushasi Chaudhuri
Biplab Banerjee
375
11
0
25 Jan 2022
Weight Expansion: A New Perspective on Dropout and Generalization
Gao Jin
Xinping Yi
Pengfei Yang
Lijun Zhang
S. Schewe
Xiaowei Huang
283
6
0
23 Jan 2022
Taxonomy Enrichment with Text and Graph Vector Representations
Irina Nikishina
M. Tikhomirov
V. Logacheva
Yuriy Nazarov
Sergey Petrakov
Natalia Loukachevitch
201
11
0
21 Jan 2022
APIRO: A Framework for Automated Security Tools API Recommendation
ACM Transactions on Software Engineering and Methodology (TOSEM), 2022
Zarrin Tasnim Sworna
Chadni Islam
M. Babar
129
33
0
20 Jan 2022
Unveiling Project-Specific Bias in Neural Code Models
International Conference on Language Resources and Evaluation (LREC), 2022
Zhiming Li
Yanzhou Li
Tianlin Li
Mengnan Du
Bozhi Wu
Yushi Cao
Yi Li
Yang Liu
289
6
0
19 Jan 2022
TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters
The Web Conference (WWW), 2022
Dongha Lee
Jiaming Shen
SeongKu Kang
Susik Yoon
Jiawei Han
Hwanjo Yu
237
44
0
18 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
International Conference on Language Resources and Evaluation (LREC), 2022
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
206
193
0
17 Jan 2022
BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR
Neurocomputing (Neurocomputing), 2022
Ushasi Chaudhuri
Ruchika Chavan
Biplab Banerjee
Anjan Dutta
Zeynep Akata
170
23
0
17 Jan 2022
Addressing the Challenges of Cross-Lingual Hate Speech Detection
Irina Bigoulaeva
Viktor Hangya
Iryna Gurevych
Kangyang Luo
198
4
0
15 Jan 2022
Cost-Effective Training in Low-Resource Neural Machine Translation
Sai Koneru
Danni Liu
Jan Niehues
138
1
0
14 Jan 2022
Model Stability with Continuous Data Updates
Huiting Liu
Avinesh P.V.S
Siddharth Patwardhan
Peter Grasch
Sachin Agarwal
133
17
0
14 Jan 2022
A Warm Start and a Clean Crawled Corpus -- A Recipe for Good Language Models
International Conference on Language Resources and Evaluation (LREC), 2022
Vésteinn Snæbjarnarson
Haukur Barri Símonarson
Pétur Orri Ragnarsson
Svanhvít Lilja Ingólfsdóttir
H. Jónsson
Vilhjálmur Þorsteinsson
H. Einarsson
274
32
0
14 Jan 2022
Sentiment Analysis with Deep Learning Models: A Comparative Study on a Decade of Sinhala Language Facebook Data
International Conference on Artificial Intelligence in Electronics Engineering (AIEE), 2022
Gihan Weeraprameshwara
Vihanga Jayawickrama
Nisansa de Silva
Yudhanjaya Wijeratne
159
6
0
11 Jan 2022
Deriving discriminative classifiers from generative models
E. Azeraf
E. Monfrini
W. Pieczynski
124
2
0
03 Jan 2022
Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Rendi Chevi
Radityo Eko Prasojo
Alham Fikri Aji
178
7
0
03 Jan 2022
Semantic Search for Large Scale Clinical Ontologies
American Medical Informatics Association Annual Symposium (AMIA), 2022
Duy-Hoa Ngo
Madonna Kemp
Donna L. Truran
Bevan Koopman
Alejandro Metke-Jimenez
VLM
66
3
0
01 Jan 2022
Clustering Vietnamese Conversations From Facebook Page To Build Training Dataset For Chatbot
Jordanian Journal of Computers and Information Technology (JJCIT), 2021
Tri Nguyen
Thi-Kim-Ngoan Pham
T. Bui
Thanh-Quynh-Chau Nguyen
129
1
0
31 Dec 2021
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study
Haiyang Yu
Jingye Chen
Bin Li
Jianqi Ma
Mengnan Guan
Xixi Xu
Xiaocong Wang
Shaobo Qu
Xiangyang Xue
249
69
0
30 Dec 2021
"A Passage to India": Pre-trained Word Embeddings for Indian Languages
Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), 2020
Saurav Kumar
Saunack Kumar
Helen Treharne
P. Bhattacharyya
225
36
0
27 Dec 2021
Secondary Use of Clinical Problem List Entries for Neural Network-Based Disease Code Assignment
Medical Informatics Europe (MIE), 2021
Markus Kreuzthaler
Bastian Pfeifer
Diether Kramer
S. Schulz
151
0
0
27 Dec 2021
PerCQA: Persian Community Question Answering Dataset
International Conference on Language Resources and Evaluation (LREC), 2021
Naghme Jamali
Yadollah Yaghoobzadeh
H. Faili
86
11
0
25 Dec 2021
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
AAAI Conference on Artificial Intelligence (AAAI), 2021
Y. He
Chen Chen
Jing Zhang
Juhua Liu
Fengxiang He
Chaoyue Wang
Bo Du
230
59
0
24 Dec 2021
Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents
International Conference on Legal Knowledge and Information Systems (JURIX), 2020
Hannes Westermann
Jaromír Šavelka
Vern R. Walker
Kevin D. Ashley
Karim Benyekhlef
AILaw
92
27
0
21 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
312
198
0
20 Dec 2021
Previous
1
2
3
...
19
20
21
...
54
55
56
Next
Page 20 of 56
Page
of 56
Go