Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1607.04606
Cited By
v1
v2 (latest)
Enriching Word Vectors with Subword Information
Transactions of the Association for Computational Linguistics (TACL), 2016
15 July 2016
Piotr Bojanowski
Edouard Grave
Armand Joulin
Tomas Mikolov
NAI
SSL
VLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Enriching Word Vectors with Subword Information"
50 / 2,761 papers shown
Latent Functional Maps: a spectral framework for representation alignment
Marco Fumero
Marco Pegoraro
Valentino Maiorca
Francesco Locatello
Emanuele Rodolà
522
1
0
20 Jun 2024
Lexically Grounded Subword Segmentation
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Jindřich Libovický
Jindřich Helcl
263
9
0
19 Jun 2024
Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality
A. Robrecht
Hendric Voss
Lisa Gottschalk
Stefan Kopp
289
1
0
18 Jun 2024
Building Knowledge-Guided Lexica to Model Cultural Variation
Shreya Havaldar
Salvatore Giorgi
Sunny Rai
Thomas Talhelm
Sharath Chandra Guntuku
Lyle Ungar
268
11
0
17 Jun 2024
Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings
Hiroaki Yamagiwa
Momose Oyama
Hidetoshi Shimodaira
LLMSV
276
6
0
16 Jun 2024
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges
Yuqi Nie
Yaxuan Kong
Xiaowen Dong
John M. Mulvey
H. Vincent Poor
Qingsong Wen
Stefan Zohren
AIFin
305
118
0
15 Jun 2024
HelpSteer2: Open-source dataset for training top-performing reward models
Zhilin Wang
Yi Dong
Olivier Delalleau
Jiaqi Zeng
Gerald Shen
Daniel Egert
Jimmy J. Zhang
Makesh Narsimhan Sreedhar
Oleksii Kuchaiev
AI4TS
313
163
0
12 Jun 2024
Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification
Martin Juan José Bucher
Marco Martini
ALM
AI4MH
335
75
0
12 Jun 2024
MaskLID: Code-Switching Language Identification through Iterative Masking
Amir Hossein Kargaran
François Yvon
Hinrich Schütze
151
7
0
10 Jun 2024
Every Answer Matters: Evaluating Commonsense with Probabilistic Measures
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Qi Cheng
Michael Boratko
Pranay Kumar Yelugam
T. O’Gorman
Nalini Singh
Andrew McCallum
X. Li
ELM
LRM
266
6
0
06 Jun 2024
Explaining the Contributing Factors for Vulnerability Detection in Machine Learning
Esma Mouine
Yan Liu
Lu Xiao
Rick Kazman
Xiao Wang
129
0
0
05 Jun 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
Kenneth Enevoldsen
Márton Kardos
Niklas Muennighoff
Kristoffer Nielbo
253
20
0
04 Jun 2024
Predicting drug-gene relations via analogy tasks with word embeddings
Hiroaki Yamagiwa
Ryoma Hashimoto
Kiwamu Arakane
Ken Murakami
Shou Soeda
Momose Oyama
Yihua Zhu
Mariko Okada
Hidetoshi Shimodaira
421
2
0
03 Jun 2024
Multimodal Metadata Assignment for Cultural Heritage Artifacts
Luis Rei
Dunja Mladenić
M. Dorozynski
Franz Rottensteiner
Thomas Schleider
Raphael Troncy
J. Lozano
Mar Gaitán Salvatella
281
13
0
01 Jun 2024
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
325
31
0
27 May 2024
E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems
Yuma Miyazaki
Valdemar Švábenský
Yuta Taniguchi
Fumiya Okubo
T. Minematsu
Atsushi Shimada
158
2
0
24 May 2024
Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning
Jihui Nie
Dehui Du
Jiangnan Zhao
AI4CE
162
0
0
24 May 2024
360Zhinao Technical Report
360Zhinao Team
218
0
0
22 May 2024
''You should probably read this'': Hedge Detection in Text
Denys Katerenchuk
Rivka Levitan
241
1
0
22 May 2024
GotFunding: A grant recommendation system based on scientific articles
Tong Zeng
Daniel Ernesto Acuna
AI4TS
85
4
0
21 May 2024
Reducing Biases towards Minoritized Populations in Medical Curricular Content via Artificial Intelligence for Fairer Health Outcomes
Chiman Salavati
Shannon Song
Willmar Sosa Diaz
Scott A. Hale
Roberto E. Montenegro
Fabricio Murai
Shiri Dori-Hacohen
84
5
0
21 May 2024
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus
Eduard Poesina
Cornelia Caragea
Radu Tudor Ionescu
216
8
0
20 May 2024
Large Language Models Lack Understanding of Character Composition of Words
Andrew Shin
Kunitake Kaneko
421
19
0
18 May 2024
Multilingual Substitution-based Word Sense Induction
International Conference on Language Resources and Evaluation (LREC), 2024
Denis Kokosinskii
Nikolay Arefyev
184
2
0
17 May 2024
PL-MTEB: Polish Massive Text Embedding Benchmark
Rafal Po'swiata
Slawomir Dadas
Michal Perelkiewicz
169
16
0
16 May 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
Liam Dugan
Alyssa Hwang
Filip Trhlik
Josh Magnus Ludan
Andrew Zhu
Hainiu Xu
Daphne Ippolito
Christopher Callison-Burch
DeLMO
AAML
296
102
0
13 May 2024
A Comprehensive Analysis of Static Word Embeddings for Turkish
Expert systems with applications (ESWA), 2024
Karahan Sarıtaş
Cahid Arda Öz
Tunga Güngör
141
10
0
13 May 2024
LLAniMAtion: LLAMA Driven Gesture Animation
John T. Windle
Iain Matthews
Sarah Taylor
249
1
0
13 May 2024
Word-specific tonal realizations in Mandarin
Yu-Ying Chuang
Melanie J. Bell
Yu-Hsiang Tseng
R. Baayen
463
7
0
11 May 2024
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias
Neural Information Processing Systems (NeurIPS), 2024
Shan Chen
Jack Gallifant
Mingye Gao
Pedro Moreira
Nikolaj Munch
...
Hugo J. W. L. Aerts
Brian Anthony
Leo Anthony Celi
William G. La Cava
Danielle S. Bitterman
194
17
0
09 May 2024
Honeyfile Camouflage: Hiding Fake Files in Plain Sight
Roelien C. Timmer
David Liebowitz
Surya Nepal
S. Kanhere
66
0
0
08 May 2024
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
AAML
244
6
0
07 May 2024
Few Shot Class Incremental Learning using Vision-Language models
Anurag Kumar
Chinmay Bharti
Saikat Dutta
Srikrishna Karanam
Biplab Banerjee
VLM
CLL
243
1
0
02 May 2024
Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery
Iftakhar Ahmad
Lannan Luo
220
1
0
29 Apr 2024
Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs
Bahar Radmehr
Adish Singla
Tanja Käser
LLMAG
AI4CE
227
8
0
29 Apr 2024
GPT-4 passes most of the 297 written Polish Board Certification Examinations
Jakub Pokrywka
Jeremi Kaczmarek
Edward Gorzelañczyk
LM&MA
ELM
195
7
0
29 Apr 2024
OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search
Prabhat Agarwal
SK MinhazulIslam
Nikil Pancha
Kurchi Subhra Hazra
Jiajing Xu
Chuck Rosenberg
206
14
0
25 Apr 2024
Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting
Nicholas Harris
Anand Butani
Syed Hashmy
159
10
0
18 Apr 2024
Context-Aware Siamese Networks for Efficient Emotion Recognition in Conversation
Barbara Gendron
Gaël Guibon
236
1
0
17 Apr 2024
AI Competitions and Benchmarks: Dataset Development
Romain Egele
Julio C. S. Jacques Junior
Jan N. van Rijn
Isabelle M Guyon
Xavier Baró
Albert Clapés
Dali Wang
Sergio Escalera
T. Moeslund
Jun Wan
173
0
0
15 Apr 2024
Relational Prompt-based Pre-trained Language Models for Social Event Detection
Pu Li
Xiaoyan Yu
Hao Peng
Yantuan Xian
Linqin Wang
Li Sun
Jingyun Zhang
Philip S. Yu
243
16
0
12 Apr 2024
Measuring Cross-lingual Transfer in Bytes
Leandro Rodrigues de Souza
Thales Sales Almeida
R.A. Lotufo
Rodrigo Nogueira
CLL
133
4
0
12 Apr 2024
Identifying Shopping Intent in Product QA for Proactive Recommendations
B. Fetahu
Nachshon Cohen
Elad Haramaty
L. Lewin-Eytan
Oleg Rokhlenko
S. Malmasi
176
0
0
09 Apr 2024
Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation
F. Alam
M. Islam
Robert Vacareanu
Mihai Surdeanu
206
5
0
05 Apr 2024
How Lexical is Bilingual Lexicon Induction?
Harsh Kohli
Helian Feng
Nicholas Dronen
Calvin McCarter
Sina Moeini
Ali Kebarighotbi
263
2
0
05 Apr 2024
Knowledge Graph Representation for Political Information Sources
Tinatin Osmonova
Alexey Tikhonov
Ivan P. Yamshchikov
126
0
0
04 Apr 2024
Multi-modal Learning for WebAssembly Reverse Engineering
International Symposium on Software Testing and Analysis (ISSTA), 2024
Hanxian Huang
Jishen Zhao
231
5
0
04 Apr 2024
Toward Informal Language Processing: Knowledge of Slang in Large Language Models
North American Chapter of the Association for Computational Linguistics (NAACL), 2024
Zhewei Sun
Qian Hu
Rahul Gupta
Richard Zemel
Yang Xu
236
11
0
02 Apr 2024
Constructing and Expanding Low-Resource and Underrepresented Parallel Datasets for Indonesian Local Languages
Joanito Agili Lopo
Radius Tanone
245
6
0
01 Apr 2024
A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias
Yuemei Xu
Ling Hu
Jiayi Zhao
Zihan Qiu
Yuqi Ye
Hanwen Gu
LRM
447
94
0
01 Apr 2024
Previous
1
2
3
...
5
6
7
...
54
55
56
Next