Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1612.03651
Cited By
FastText.zip: Compressing text classification models
12 December 2016
Armand Joulin
Edouard Grave
Piotr Bojanowski
Matthijs Douze
Hervé Jégou
Tomáš Mikolov
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"FastText.zip: Compressing text classification models"
50 / 99 papers shown
Title
PREMISE: Matching-based Prediction for Accurate Review Recommendation
Wei Han
Hui Chen
Soujanya Poria
29
0
0
02 May 2025
Improving Informally Romanized Language Identification
Adrian Benton
Alexander Gutkin
Christo Kirov
Brian Roark
43
0
0
30 Apr 2025
PythonPal: Enhancing Online Programming Education through Chatbot-Driven Personalized Feedback
Sirinda Palahan
36
0
0
09 Mar 2025
Sorting the Babble in Babel: Assessing the Performance of Language Detection Algorithms on the OpenAlex Database
Maxime Holmberg Sainte-Marie
Diego Kozlowski
Lucía Céspedes
Vincent Larivière
80
0
0
05 Feb 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
Menglong Cui
Pengzhi Gao
Wei Liu
Jian Luan
Bin Wang
LRM
43
0
0
04 Feb 2025
Enhancing Web Service Anomaly Detection via Fine-grained Multi-modal Association and Frequency Domain Analysis
Xixuan Yang
Xin Huang
Chiming Duan
Tong Jia
Shandong Dong
Ying Li
Gang Huang
59
0
0
28 Jan 2025
Evolutionary Optimization of Model Merging Recipes
Takuya Akiba
Makoto Shing
Yujin Tang
Qi Sun
David Ha
MoMe
98
99
0
28 Jan 2025
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs
Syeda Nahida Akter
Shrimai Prabhumoye
John Kamalu
S. Satheesh
Eric Nyberg
M. Patwary
M. Shoeybi
Bryan Catanzaro
LRM
SyDa
ReLM
98
1
0
15 Oct 2024
Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary?
Adam Nohejl
Frederikus Hudi
Eunike Andriani Kardinata
Shintaro Ozaki
Maria Angelica Riera Machin
Hongyu Sun
Justin Vasselli
Taro Watanabe
23
2
0
04 Oct 2024
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach
Diogo Pernes
Gonçalo M. Correia
Afonso Mendes
21
1
0
01 Oct 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models
Shaoxiong Ji
Zihao Li
Indraneil Paul
Jaakko Paavola
Peiqin Lin
...
Dayyán O'Brien
Hengyu Luo
Hinrich Schütze
Jörg Tiedemann
Barry Haddow
CLL
35
3
0
26 Sep 2024
GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation
Ashirbad Mishra
Soumik Dey
Marshall Wu
Jinyu Zhao
He Yu
Kaichen Ni
Binbin Li
Kamesh Madduri
49
1
0
05 Sep 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding
K. Enevoldsen
Márton Kardos
Niklas Muennighoff
Kristoffer Laigaard Nielbo
29
9
0
04 Jun 2024
Methods for Generating Drift in Text Streams
C. M. Garcia
A. L. Koerich
A. Britto
J. P. Barddal
28
1
0
18 Mar 2024
A Rational Analysis of the Speech-to-Song Illusion
Raja Marjieh
Pol van Rijn
Ilia Sucholutsky
Harin Lee
Thomas L. Griffiths
Nori Jacoby
19
1
0
10 Feb 2024
Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis
Hale Sirin
Sabrina Li
Thomas Lippincott
11
0
0
25 Jan 2024
Semi-supervised learning via DQN for log anomaly detection
Yingying He
Xiaobing Pei
Lihong Shen
28
1
0
06 Jan 2024
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs
Shengyin Sun
Yuxiang Ren
Chen Ma
Xuecang Zhang
103
20
0
24 Nov 2023
Experience and Prediction: A Metric of Hardness for a Novel Litmus Test
Nicos Isaak
Loizos Michael
24
3
0
05 Sep 2023
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit
Franz Deimling
Michela Fazzolari
15
1
0
31 Jul 2023
Data Augmentation for Machine Translation via Dependency Subtree Swapping
Attila Nagy
Dorina Lakatos
Botond Barta
Patrick Nanys
Judit Ács
28
1
0
13 Jul 2023
Frameless Graph Knowledge Distillation
Dai Shi
Zhiqi Shao
Yi Guo
Junbin Gao
28
4
0
13 Jul 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
21
13
0
22 May 2023
Korean Named Entity Recognition Based on Language-Specific Features
Yige Chen
Kyungtae Lim
Jungyeul Park
11
3
0
10 May 2023
MAUPQA: Massive Automatically-created Polish Question Answering Dataset
Piotr Rybak
26
12
0
09 May 2023
Web Content Filtering through knowledge distillation of Large Language Models
Tamás Vörös
Sean P. Bergeron
Konstantin Berlin
27
7
0
08 May 2023
Hallucinations in Large Multilingual Translation Models
Nuno M. Guerreiro
Duarte M. Alves
Jonas Waldendorf
Barry Haddow
Alexandra Birch
Pierre Colombo
André F.T. Martins
VLM
HILM
LRM
18
140
0
28 Mar 2023
Transformadores: Fundamentos teoricos y Aplicaciones
J. D. L. Torre
63
0
0
18 Feb 2023
Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN
Ziyi Chen
Ren Yang
S. Fu
Nansu Zong
Hongfang Liu
Ming Huang
AI4MH
18
14
0
03 Feb 2023
Color Me Intrigued: Quantifying Usage of Colors in Fiction
Siyan Li
10
0
0
09 Jan 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges
Genta Indra Winata
Alham Fikri Aji
Zheng-Xin Yong
Thamar Solorio
37
33
0
19 Dec 2022
Embedding Compression for Text Classification Using Dictionary Screening
Jing Zhou
Xinru Jing
Mu Liu
Hansheng Wang
19
0
0
23 Nov 2022
SMAuC -- The Scientific Multi-Authorship Corpus
Janek Bevendorff
Philipp Sauer
Lukas Gienapp
Wolfgang Kircheis
Erik Korner
Benno Stein
Martin Potthast
19
0
0
04 Nov 2022
ProVe: A Pipeline for Automated Provenance Verification of Knowledge Graphs against Textual Sources
Gabriel Amaral
Odinaldo Rodrigues
Elena Simperl
14
3
0
26 Oct 2022
SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP
Jie Chen
Shouzhen Chen
Mingyuan Bai
Junbin Gao
Junping Zhang
Jian Pu
32
10
0
18 Oct 2022
SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis
Jiaxin Pei
Vítor Silva
Maarten W. Bos
Yozon Liu
Leonardo Neves
David Jurgens
Francesco Barbieri
53
28
0
03 Oct 2022
WikiLink: an encyclopedia-based semantic network for design innovation
H. Zuo
Qianzhi Jing
Tianqi Song
Huiting Liu
Lingyun Sun
P. Childs
Liuqing Chen
HAI
33
12
0
30 Aug 2022
Is this Change the Answer to that Problem? Correlating Descriptions of Bug and Code Changes for Evaluating Patch Correctness
Haoye Tian
Xunzhu Tang
Andrew Habib
Shangwen Wang
Kui Liu
Xin Xia
Jacques Klein
Tegawende F. Bissyande
30
25
0
08 Aug 2022
CSSAM:Code Search via Attention Matching of Code Semantics and Structures
Y. Hu
Bowen Cai
Yaoxiang Yu
15
3
0
08 Aug 2022
Data-Centric Epidemic Forecasting: A Survey
Alexander Rodríguez
Harshavardhan Kamarthi
Pulak Agarwal
Javen Ho
Mira Patel
Suchet Sapre
B. Prakash
OOD
24
18
0
19 Jul 2022
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Peilin Zhou
Dading Chong
Helin Wang
Qingcheng Zeng
16
5
0
23 May 2022
What Do Compressed Multilingual Machine Translation Models Forget?
Alireza Mohammadshahi
Vassilina Nikoulina
Alexandre Berard
Caroline Brun
James Henderson
Laurent Besacier
AI4CE
40
9
0
22 May 2022
Towards Debiasing Translation Artifacts
Koel Dutta Chowdhury
Rricha Jalota
C. España-Bonet
Josef van Genabith
23
6
0
16 May 2022
Logical Inference for Counting on Semi-structured Tables
Tomoya Kurosawa
Hitomi Yanaka
LMTD
15
2
0
16 Apr 2022
CoNTACT: A Dutch COVID-19 Adapted BERT for Vaccine Hesitancy and Argumentation Detection
Jens Lemmens
Jens Van Nooten
Tim Kreutz
Walter Daelemans
14
6
0
14 Mar 2022
Deep Learning for Hate Speech Detection: A Comparative Study
Jitendra Malik
Hezhe Qiao
Guansong Pang
A. Hengel
37
43
0
19 Feb 2022
Log-based Anomaly Detection with Deep Learning: How Far Are We?
Van-Hoang Le
Hongyu Zhang
14
157
0
09 Feb 2022
Calibrated Learning to Defer with One-vs-All Classifiers
Rajeev Verma
Eric Nalisnick
17
42
0
08 Feb 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
34
153
0
17 Jan 2022
NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Sandeep Subramanian
Oleksii Hrinchuk
Virginia Adams
Oleksii Kuchaiev
VLM
16
16
0
16 Nov 2021
1
2
Next