FastText.zip: Compressing text classification models

12 December 2016

Papers citing "FastText.zip: Compressing text classification models"

50 / 99 papers shown

Title
PREMISE: Matching-based Prediction for Accurate Review Recommendation Wei Han Hui Chen Soujanya Poria 29 0 0 02 May 2025
Improving Informally Romanized Language Identification Adrian Benton Alexander Gutkin Christo Kirov Brian Roark 43 0 0 30 Apr 2025
PythonPal: Enhancing Online Programming Education through Chatbot-Driven Personalized Feedback Sirinda Palahan 36 0 0 09 Mar 2025
Sorting the Babble in Babel: Assessing the Performance of Language Detection Algorithms on the OpenAlex Database Maxime Holmberg Sainte-Marie Diego Kozlowski Lucía Céspedes Vincent Larivière 80 0 0 05 Feb 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Menglong Cui Pengzhi Gao Wei Liu Jian Luan Bin Wang LRM 43 0 0 04 Feb 2025
Enhancing Web Service Anomaly Detection via Fine-grained Multi-modal Association and Frequency Domain Analysis Xixuan Yang Xin Huang Chiming Duan Tong Jia Shandong Dong Ying Li Gang Huang 59 0 0 28 Jan 2025
Evolutionary Optimization of Model Merging Recipes Takuya Akiba Makoto Shing Yujin Tang Qi Sun David Ha MoMe 98 99 0 28 Jan 2025
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs Syeda Nahida Akter Shrimai Prabhumoye John Kamalu S. Satheesh Eric Nyberg M. Patwary M. Shoeybi Bryan Catanzaro LRM SyDa ReLM 98 1 0 15 Oct 2024
Beyond Film Subtitles: Is YouTube the Best Approximation of Spoken Vocabulary? Adam Nohejl Frederikus Hudi Eunike Andriani Kardinata Shintaro Ozaki Maria Angelica Riera Machin Hongyu Sun Justin Vasselli Taro Watanabe 23 2 0 04 Oct 2024
Multi-Target Cross-Lingual Summarization: a novel task and a language-neutral approach Diogo Pernes Gonçalo M. Correia Afonso Mendes 21 1 0 01 Oct 2024
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Shaoxiong Ji Zihao Li Indraneil Paul Jaakko Paavola Peiqin Lin ... Dayyán O'Brien Hengyu Luo Hinrich Schütze Jörg Tiedemann Barry Haddow CLL 35 3 0 26 Sep 2024
GraphEx: A Graph-based Extraction Method for Advertiser Keyphrase Recommendation Ashirbad Mishra Soumik Dey Marshall Wu Jinyu Zhao He Yu Kaichen Ni Binbin Li Kamesh Madduri 49 1 0 05 Sep 2024
The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding K. Enevoldsen Márton Kardos Niklas Muennighoff Kristoffer Laigaard Nielbo 29 9 0 04 Jun 2024
Methods for Generating Drift in Text Streams C. M. Garcia A. L. Koerich A. Britto J. P. Barddal 28 1 0 18 Mar 2024
A Rational Analysis of the Speech-to-Song Illusion Raja Marjieh Pol van Rijn Ilia Sucholutsky Harin Lee Thomas L. Griffiths Nori Jacoby 19 1 0 10 Feb 2024
Detecting Structured Language Alternations in Historical Documents by Combining Language Identification with Fourier Analysis Hale Sirin Sabrina Li Thomas Lippincott 11 0 0 25 Jan 2024
Semi-supervised learning via DQN for log anomaly detection Yingying He Xiaobing Pei Lihong Shen 28 1 0 06 Jan 2024
Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs Shengyin Sun Yuxiang Ren Chen Ma Xuecang Zhang 103 20 0 24 Nov 2023
Experience and Prediction: A Metric of Hardness for a Novel Litmus Test Nicos Isaak Loizos Michael 24 3 0 05 Sep 2023
AMOE: a Tool to Automatically Extract and Assess Organizational Evidence for Continuous Cloud Audit Franz Deimling Michela Fazzolari 15 1 0 31 Jul 2023
Data Augmentation for Machine Translation via Dependency Subtree Swapping Attila Nagy Dorina Lakatos Botond Barta Patrick Nanys Judit Ács 28 1 0 13 Jul 2023
Frameless Graph Knowledge Distillation Dai Shi Zhiqi Shao Yi Guo Junbin Gao 28 4 0 13 Jul 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages Ariel Ekgren Amaru Cuba Gyllensten Felix Stollenwerk Joey Öhman T. Isbister Evangelia Gogoulou F. Carlsson Alice Heiman Judit Casademont Magnus Sahlgren 21 13 0 22 May 2023
Korean Named Entity Recognition Based on Language-Specific Features Yige Chen Kyungtae Lim Jungyeul Park 11 3 0 10 May 2023
MAUPQA: Massive Automatically-created Polish Question Answering Dataset Piotr Rybak 26 12 0 09 May 2023
Web Content Filtering through knowledge distillation of Large Language Models Tamás Vörös Sean P. Bergeron Konstantin Berlin 27 7 0 08 May 2023
Hallucinations in Large Multilingual Translation Models Nuno M. Guerreiro Duarte M. Alves Jonas Waldendorf Barry Haddow Alexandra Birch Pierre Colombo André F.T. Martins VLM HILM LRM 18 140 0 28 Mar 2023
Transformadores: Fundamentos teoricos y Aplicaciones J. D. L. Torre 63 0 0 18 Feb 2023
Detecting Reddit Users with Depression Using a Hybrid Neural Network SBERT-CNN Ziyi Chen Ren Yang S. Fu Nansu Zong Hongfang Liu Ming Huang AI4MH 18 14 0 03 Feb 2023
Color Me Intrigued: Quantifying Usage of Colors in Fiction Siyan Li 10 0 0 09 Jan 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges Genta Indra Winata Alham Fikri Aji Zheng-Xin Yong Thamar Solorio 37 33 0 19 Dec 2022
Embedding Compression for Text Classification Using Dictionary Screening Jing Zhou Xinru Jing Mu Liu Hansheng Wang 19 0 0 23 Nov 2022
SMAuC -- The Scientific Multi-Authorship Corpus Janek Bevendorff Philipp Sauer Lukas Gienapp Wolfgang Kircheis Erik Korner Benno Stein Martin Potthast 19 0 0 04 Nov 2022
ProVe: A Pipeline for Automated Provenance Verification of Knowledge Graphs against Textual Sources Gabriel Amaral Odinaldo Rodrigues Elena Simperl 14 3 0 26 Oct 2022
SA-MLP: Distilling Graph Knowledge from GNNs into Structure-Aware MLP Jie Chen Shouzhen Chen Mingyuan Bai Junbin Gao Junping Zhang Jian Pu 32 10 0 18 Oct 2022
SemEval 2023 Task 9: Multilingual Tweet Intimacy Analysis Jiaxin Pei Vítor Silva Maarten W. Bos Yozon Liu Leonardo Neves David Jurgens Francesco Barbieri 53 28 0 03 Oct 2022
WikiLink: an encyclopedia-based semantic network for design innovation H. Zuo Qianzhi Jing Tianqi Song Huiting Liu Lingyun Sun P. Childs Liuqing Chen HAI 33 12 0 30 Aug 2022
Is this Change the Answer to that Problem? Correlating Descriptions of Bug and Code Changes for Evaluating Patch Correctness Haoye Tian Xunzhu Tang Andrew Habib Shangwen Wang Kui Liu Xin Xia Jacques Klein Tegawende F. Bissyande 30 25 0 08 Aug 2022
CSSAM:Code Search via Attention Matching of Code Semantics and Structures Y. Hu Bowen Cai Yaoxiang Yu 15 3 0 08 Aug 2022
Data-Centric Epidemic Forecasting: A Survey Alexander Rodríguez Harshavardhan Kamarthi Pulak Agarwal Javen Ho Mira Patel Suchet Sapre B. Prakash OOD 24 18 0 19 Jul 2022
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection Peilin Zhou Dading Chong Helin Wang Qingcheng Zeng 16 5 0 23 May 2022
What Do Compressed Multilingual Machine Translation Models Forget? Alireza Mohammadshahi Vassilina Nikoulina Alexandre Berard Caroline Brun James Henderson Laurent Besacier AI4CE 40 9 0 22 May 2022
Towards Debiasing Translation Artifacts Koel Dutta Chowdhury Rricha Jalota C. España-Bonet Josef van Genabith 23 6 0 16 May 2022
Logical Inference for Counting on Semi-structured Tables Tomoya Kurosawa Hitomi Yanaka LMTD 15 2 0 16 Apr 2022
CoNTACT: A Dutch COVID-19 Adapted BERT for Vaccine Hesitancy and Argumentation Detection Jens Lemmens Jens Van Nooten Tim Kreutz Walter Daelemans 14 6 0 14 Mar 2022
Deep Learning for Hate Speech Detection: A Comparative Study Jitendra Malik Hezhe Qiao Guansong Pang A. Hengel 37 43 0 19 Feb 2022
Log-based Anomaly Detection with Deep Learning: How Far Are We? Van-Hoang Le Hongyu Zhang 14 157 0 09 Feb 2022
Calibrated Learning to Defer with One-vs-All Classifiers Rajeev Verma Eric Nalisnick 17 42 0 08 Feb 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus Julien Abadji Pedro Ortiz Suarez Laurent Romary Benoît Sagot CLL 34 153 0 17 Jan 2022
NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21 Sandeep Subramanian Oleksii Hrinchuk Virginia Adams Oleksii Kuchaiev VLM 16 16 0 16 Nov 2021