v1v2 (latest)

Enriching Word Vectors with Subword Information

Transactions of the Association for Computational Linguistics (TACL), 2016

15 July 2016

Papers citing "Enriching Word Vectors with Subword Information"

50 / 2,761 papers shown

Latent Functional Maps: a spectral framework for representation alignment

522

20 Jun 2024

Lexically Grounded Subword SegmentationConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

Jindřich Libovický

Jindřich Helcl

263

19 Jun 2024

Integrating Representational Gestures into Automatically Generated Embodied Explanations and its Effects on Understanding and Interaction Quality

289

18 Jun 2024

Building Knowledge-Guided Lexica to Model Cultural Variation

Sharath Chandra Guntuku

Lyle Ungar

268

17 Jun 2024

Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings

276

16 Jun 2024

A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

H. Vincent Poor

Qingsong Wen

Stefan Zohren

AIFin

305

118

15 Jun 2024

HelpSteer2: Open-source dataset for training top-performing reward models

Zhilin Wang

Yi Dong

Jimmy J. Zhang

Makesh Narsimhan Sreedhar

Oleksii Kuchaiev

AI4TS

313

163

12 Jun 2024

Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification

Martin Juan José Bucher

Marco Martini

ALM AI4MH

335

12 Jun 2024

MaskLID: Code-Switching Language Identification through Iterative Masking

Amir Hossein Kargaran

François Yvon

Hinrich Schütze

151

10 Jun 2024

Every Answer Matters: Evaluating Commonsense with Probabilistic MeasuresAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

266

06 Jun 2024

Explaining the Contributing Factors for Vulnerability Detection in Machine Learning

Yan Liu

129

05 Jun 2024

The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding

253

04 Jun 2024

Predicting drug-gene relations via analogy tasks with word embeddings

421

03 Jun 2024

Multimodal Metadata Assignment for Cultural Heritage Artifacts

Mar Gaitán Salvatella

281

01 Jun 2024

Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark

Hongliu Cao

AI4TS

325

27 May 2024

E2Vec: Feature Embedding with Temporal Information for Analyzing Student Actions in E-Book Systems

158

24 May 2024

Spatio-temporal Value Semantics-based Abstraction for Dense Deep Reinforcement Learning

162

24 May 2024

360Zhinao Technical Report

360Zhinao Team

218

22 May 2024

''You should probably read this'': Hedge Detection in Text

Denys Katerenchuk

Rivka Levitan

241

22 May 2024

GotFunding: A grant recommendation system based on scientific articles

Tong Zeng

Daniel Ernesto Acuna

AI4TS

21 May 2024

Reducing Biases towards Minoritized Populations in Medical Curricular Content via Artificial Intelligence for Fairer Health Outcomes

Roberto E. Montenegro

Fabricio Murai

Shiri Dori-Hacohen

21 May 2024

A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus

Eduard Poesina

Cornelia Caragea

Radu Tudor Ionescu

216

20 May 2024

Large Language Models Lack Understanding of Character Composition of Words

Andrew Shin

Kunitake Kaneko

421

18 May 2024

Multilingual Substitution-based Word Sense InductionInternational Conference on Language Resources and Evaluation (LREC), 2024

Denis Kokosinskii

Nikolay Arefyev

184

17 May 2024

PL-MTEB: Polish Massive Text Embedding Benchmark

Rafal Po'swiata

Slawomir Dadas

Michal Perelkiewicz

169

16 May 2024

RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text DetectorsAnnual Meeting of the Association for Computational Linguistics (ACL), 2024

Christopher Callison-Burch

DeLMO AAML

296

102

13 May 2024

A Comprehensive Analysis of Static Word Embeddings for TurkishExpert systems with applications (ESWA), 2024

Karahan Sarıtaş

Cahid Arda Öz

Tunga Güngör

141

13 May 2024

LLAniMAtion: LLAMA Driven Gesture Animation

John T. Windle

Iain Matthews

Sarah Taylor

249

13 May 2024

Word-specific tonal realizations in Mandarin

463

11 May 2024

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasNeural Information Processing Systems (NeurIPS), 2024

Shan Chen

...

Danielle S. Bitterman

194

09 May 2024

Honeyfile Camouflage: Hiding Fake Files in Plain Sight

08 May 2024

Revisiting character-level adversarial attacks

244

07 May 2024

Few Shot Class Incremental Learning using Vision-Language models

243

02 May 2024

Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery

Iftakhar Ahmad

Lannan Luo

220

29 Apr 2024

Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs

227

29 Apr 2024

GPT-4 passes most of the 297 written Polish Board Certification Examinations

195

29 Apr 2024

OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

206

25 Apr 2024

Enhancing Embedding Performance through Large Language Model-based Text Enrichment and Rewriting

Nicholas Harris

Anand Butani

Syed Hashmy

159

18 Apr 2024

Context-Aware Siamese Networks for Efficient Emotion Recognition in Conversation

Barbara Gendron

Gaël Guibon

236

17 Apr 2024

AI Competitions and Benchmarks: Dataset Development

Romain Egele

Julio C. S. Jacques Junior

Jan N. van Rijn

173

15 Apr 2024

Relational Prompt-based Pre-trained Language Models for Social Event Detection

Pu Li

Xiaoyan Yu

Hao Peng

Philip S. Yu

243

12 Apr 2024

Measuring Cross-lingual Transfer in Bytes

Leandro Rodrigues de Souza

133

12 Apr 2024

Identifying Shopping Intent in Product QA for Proactive Recommendations

176

09 Apr 2024

Towards Realistic Few-Shot Relation Extraction: A New Meta Dataset and Evaluation

206

05 Apr 2024

How Lexical is Bilingual Lexicon Induction?

263

05 Apr 2024

Knowledge Graph Representation for Political Information Sources

Tinatin Osmonova

Alexey Tikhonov

Ivan P. Yamshchikov

126

04 Apr 2024

Multi-modal Learning for WebAssembly Reverse EngineeringInternational Symposium on Software Testing and Analysis (ISSTA), 2024

Hanxian Huang

Jishen Zhao

231

04 Apr 2024

Toward Informal Language Processing: Knowledge of Slang in Large Language ModelsNorth American Chapter of the Association for Computational Linguistics (NAACL), 2024

236

02 Apr 2024

Constructing and Expanding Low-Resource and Underrepresented Parallel Datasets for Indonesian Local Languages

Joanito Agili Lopo

Radius Tanone

245

01 Apr 2024

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

447

01 Apr 2024