ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.08478
  4. Cited By
Simple, Scalable Adaptation for Neural Machine Translation

Simple, Scalable Adaptation for Neural Machine Translation

18 September 2019
Ankur Bapna
N. Arivazhagan
Orhan Firat
    AI4CE
ArXivPDFHTML

Papers citing "Simple, Scalable Adaptation for Neural Machine Translation"

50 / 256 papers shown
Title
Not All LoRA Parameters Are Essential: Insights on Inference Necessity
Not All LoRA Parameters Are Essential: Insights on Inference Necessity
Guanhua Chen
Yutong Yao
Ci-Jun Gao
Lidia S. Chao
Feng Wan
Derek F. Wong
39
0
0
30 Mar 2025
From Priest to Doctor: Domain Adaptation for Low-Resource Neural Machine Translation
From Priest to Doctor: Domain Adaptation for Low-Resource Neural Machine Translation
Ali Marashian
Enora Rice
Luke Gessler
Alexis Palmer
K. Wense
79
1
0
24 Feb 2025
Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models
Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models
Atharva Mehta
Shivam Chauhan
Amirbek Djanibekov
Atharva Kulkarni
Gus Xia
Monojit Choudhury
69
0
0
11 Feb 2025
Learning to Adapt to Low-Resource Paraphrase Generation
Learning to Adapt to Low-Resource Paraphrase Generation
Zhigen Li
Yanmeng Wang
Rizhao Fan
Ye Wang
Jianfeng Li
Shaojun Wang
113
3
0
22 Dec 2024
Transducer Tuning: Efficient Model Adaptation for Software Tasks Using
  Code Property Graphs
Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs
Imam Nur Bani Yusuf
Lingxiao Jiang
82
0
0
18 Dec 2024
Adapter-based Approaches to Knowledge-enhanced Language Models -- A
  Survey
Adapter-based Approaches to Knowledge-enhanced Language Models -- A Survey
Alexander Fichtl
Juraj Vladika
Georg Groh
KELM
78
0
0
25 Nov 2024
Expanding Sparse Tuning for Low Memory Usage
Expanding Sparse Tuning for Low Memory Usage
Shufan Shen
Junshu Sun
Xiangyang Ji
Qingming Huang
Shuhui Wang
40
0
0
04 Nov 2024
Towards Optimal Adapter Placement for Efficient Transfer Learning
Towards Optimal Adapter Placement for Efficient Transfer Learning
Aleksandra I. Nowak
Otniel-Bogdan Mercea
Anurag Arnab
Jonas Pfeiffer
Yann N. Dauphin
Utku Evci
25
0
0
21 Oct 2024
Scalable Multi-Domain Adaptation of Language Models using Modular
  Experts
Scalable Multi-Domain Adaptation of Language Models using Modular Experts
Peter Schafhalter
Shun Liao
Yanqi Zhou
Chih-Kuan Yeh
Arun Kandoor
James Laudon
MoE
26
1
0
14 Oct 2024
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized
  Distributions
OD-Stega: LLM-Based Near-Imperceptible Steganography via Optimized Distributions
Yu-Shin Huang
Peter Just
Krishna Narayanan
Chao Tian
34
1
0
06 Oct 2024
Evaluating and explaining training strategies for zero-shot
  cross-lingual news sentiment analysis
Evaluating and explaining training strategies for zero-shot cross-lingual news sentiment analysis
Luka Andrenšek
Boshko Koloski
Andraz Pelicon
Nada Lavrac
Senja Pollak
Matthew Purver
21
1
0
30 Sep 2024
Scalable Fine-tuning from Multiple Data Sources:A First-Order
  Approximation Approach
Scalable Fine-tuning from Multiple Data Sources:A First-Order Approximation Approach
Dongyue Li
Ziniu Zhang
Lu Wang
Hongyang R. Zhang
38
0
0
28 Sep 2024
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient
  Modification
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification
Ming Li
J. Zhong
Chenxin Li
Liuzhuozheng Li
Nie Lin
Masashi Sugiyama
CLIP
VLM
28
4
0
25 Sep 2024
Parameter-Efficient Transfer Learning under Federated Learning for
  Automatic Speech Recognition
Parameter-Efficient Transfer Learning under Federated Learning for Automatic Speech Recognition
Xuan Kan
Yonghui Xiao
Tien-Ju Yang
Nanxin Chen
Rajiv Mathews
FedML
21
0
0
19 Aug 2024
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?
Zhengbo Wang
Jian Liang
Ran He
Zilei Wang
Tieniu Tan
50
15
0
25 Jul 2024
Fixed and Adaptive Simultaneous Machine Translation Strategies Using
  Adapters
Fixed and Adaptive Simultaneous Machine Translation Strategies Using Adapters
Abderrahmane Issam
Yusuf Can Semerci
Jan Scholtes
Gerasimos Spanakis
42
0
0
18 Jul 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for
  Resource-Limited Transfer Learning
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
Haiwen Diao
Bo Wan
Xu Jia
Yunzhi Zhuge
Ying Zhang
Huchuan Lu
Long Chen
VLM
47
4
0
10 Jul 2024
Investigating the potential of Sparse Mixtures-of-Experts for
  multi-domain neural machine translation
Investigating the potential of Sparse Mixtures-of-Experts for multi-domain neural machine translation
Nadezhda Chirkova
Vassilina Nikoulina
Jean-Luc Meunier
Alexandre Berard
MoE
34
0
0
01 Jul 2024
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Crosslingual Capabilities and Knowledge Barriers in Multilingual Large Language Models
Lynn Chua
Badih Ghazi
Yangsibo Huang
Pritish Kamath
Ravi Kumar
Pasin Manurangsi
Amer Sinha
Chulin Xie
Chiyuan Zhang
63
1
0
23 Jun 2024
GOAL: A Generalist Combinatorial Optimization Agent Learner
GOAL: A Generalist Combinatorial Optimization Agent Learner
Darko Drakulic
Sofia Michel
J. Andreoli
39
6
0
21 Jun 2024
Mixture of In-Context Prompters for Tabular PFNs
Mixture of In-Context Prompters for Tabular PFNs
Derek Xu
Olcay Cirit
Reza Asadi
Yizhou Sun
Wei Wang
31
9
0
25 May 2024
RE-Adapt: Reverse Engineered Adaptation of Large Language Models
RE-Adapt: Reverse Engineered Adaptation of Large Language Models
William Fleshman
Benjamin Van Durme
VLM
29
3
0
23 May 2024
DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation
DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation
Jie Xu
Karthikeyan P. Saravanan
Rogier van Dalen
Haaris Mehmood
David Tuckey
Mete Ozay
56
5
0
10 May 2024
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval
Jiacheng Cheng
Hijung Valentina Shin
Nuno Vasconcelos
Bryan C. Russell
Fabian Caba Heilbron
VLM
29
1
0
06 May 2024
The Trade-off between Performance, Efficiency, and Fairness in Adapter
  Modules for Text Classification
The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification
Minh Duc Bui
K. Wense
31
0
0
03 May 2024
No Train but Gain: Language Arithmetic for training-free Language
  Adapters enhancement
No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement
Mateusz Klimaszewski
Piotr Andruszkiewicz
Alexandra Birch
MoMe
47
4
0
24 Apr 2024
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and
  Historical Languages
TartuNLP @ SIGTYP 2024 Shared Task: Adapting XLM-RoBERTa for Ancient and Historical Languages
Aleksei Dorkin
Kairit Sirts
19
1
0
19 Apr 2024
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks
  with Adapters
AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters
Hao-Wei Chen
Yu-Syuan Xu
Kelvin C. K. Chan
Hsien-Kai Kuo
Chun-Yi Lee
Ming-Hsuan Yang
29
1
0
17 Apr 2024
Neuron Specialization: Leveraging intrinsic task modularity for
  multilingual machine translation
Neuron Specialization: Leveraging intrinsic task modularity for multilingual machine translation
Shaomu Tan
Di Wu
Christof Monz
MoMe
34
7
0
17 Apr 2024
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers
Nithin Gopalakrishnan Nair
Jeya Maria Jose Valanarasu
Vishal M. Patel
40
1
0
15 Apr 2024
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees
William Fleshman
Aleem Khan
Marc Marone
Benjamin Van Durme
CLL
KELM
55
3
0
12 Apr 2024
F-MALLOC: Feed-forward Memory Allocation for Continual Learning in
  Neural Machine Translation
F-MALLOC: Feed-forward Memory Allocation for Continual Learning in Neural Machine Translation
Junhong Wu
Yuchen Liu
Chengqing Zong
CLL
36
1
0
07 Apr 2024
Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language
  Translation
Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation
Tong Su
Xin Peng
Sarubi Thillainathan
David Guzmán
Surangika Ranathunga
En-Shiun Annie Lee
35
2
0
05 Apr 2024
Is Modularity Transferable? A Case Study through the Lens of Knowledge
  Distillation
Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation
Mateusz Klimaszewski
Piotr Andruszkiewicz
Alexandra Birch
22
0
0
27 Mar 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and
  Finetuning Method
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
Biao Zhang
Zhongtao Liu
Colin Cherry
Orhan Firat
LRM
52
124
0
27 Feb 2024
Does Combining Parameter-efficient Modules Improve Few-shot Transfer
  Accuracy?
Does Combining Parameter-efficient Modules Improve Few-shot Transfer Accuracy?
Nader Asadi
Mahdi Beitollahi
Yasser H. Khalil
Yinchuan Li
Guojun Zhang
Xi Chen
MoMe
35
8
0
23 Feb 2024
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot
  Multilingual Information Retrieval
ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval
Antoine Louis
V. Saxena
Gijs van Dijck
Gerasimos Spanakis
40
5
0
23 Feb 2024
Key ingredients for effective zero-shot cross-lingual knowledge transfer
  in generative tasks
Key ingredients for effective zero-shot cross-lingual knowledge transfer in generative tasks
Nadezhda Chirkova
Vassilina Nikoulina
26
7
0
19 Feb 2024
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual
  and Multilingual Approaches for Detecting AI-generated Text
Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text
Xiaoman Xu
Xiangrun Li
Taihang Wang
Jianxiang Tian
Ye Jiang
DeLMO
29
3
0
19 Feb 2024
Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of
  Language Models
Prompt-Based Bias Calibration for Better Zero/Few-Shot Learning of Language Models
Kang He
Yinghan Long
Kaushik Roy
28
2
0
15 Feb 2024
Efficient Language Adaptive Pre-training: Extending State-of-the-Art
  Large Language Models for Polish
Efficient Language Adaptive Pre-training: Extending State-of-the-Art Large Language Models for Polish
Szymon Ruciñski
36
5
0
15 Feb 2024
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models
MAFIA: Multi-Adapter Fused Inclusive LanguAge Models
Prachi Jain
Ashutosh Sathe
Varun Gumma
Kabir Ahuja
Sunayana Sitaram
28
1
0
12 Feb 2024
A Morphologically-Aware Dictionary-based Data Augmentation Technique for
  Machine Translation of Under-Represented Languages
A Morphologically-Aware Dictionary-based Data Augmentation Technique for Machine Translation of Under-Represented Languages
Md Mahfuz Ibn Alam
Sina Ahmadi
Antonios Anastasopoulos
54
0
0
02 Feb 2024
What the Weight?! A Unified Framework for Zero-Shot Knowledge
  Composition
What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition
Carolin Holtermann
Markus Frohmann
Navid Rekabsaz
Anne Lauscher
MoMe
24
5
0
23 Jan 2024
Leveraging Large Language Models for NLG Evaluation: Advances and
  Challenges
Leveraging Large Language Models for NLG Evaluation: Advances and Challenges
Zhen Li
Xiaohan Xu
Tao Shen
Can Xu
Jia-Chen Gu
Yuxuan Lai
Chongyang Tao
Shuai Ma
LM&MA
ELM
34
9
0
13 Jan 2024
Chain of History: Learning and Forecasting with LLMs for Temporal
  Knowledge Graph Completion
Chain of History: Learning and Forecasting with LLMs for Temporal Knowledge Graph Completion
Ruilin Luo
Tianle Gu
Haoling Li
Junzhe Li
Zicheng Lin
Jiayi Li
Yujiu Yang
AI4CE
31
7
0
11 Jan 2024
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual
  Learning
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning
Wenhan Xia
Chengwei Qin
Elad Hazan
54
52
0
08 Jan 2024
Diversifying Knowledge Enhancement of Biomedical Language Models using
  Adapter Modules and Knowledge Graphs
Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs
Juraj Vladika
Alexander Fichtl
Florian Matthes
KELM
22
1
0
21 Dec 2023
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models:
  A Critical Review and Assessment
Parameter-Efficient Fine-Tuning Methods for Pretrained Language Models: A Critical Review and Assessment
Lingling Xu
Haoran Xie
S. J. Qin
Xiaohui Tao
F. Wang
46
132
0
19 Dec 2023
Gradient-based Parameter Selection for Efficient Fine-Tuning
Gradient-based Parameter Selection for Efficient Fine-Tuning
Zhi Zhang
Qizhe Zhang
Zijun Gao
Renrui Zhang
Ekaterina Shutova
Shiji Zhou
Shanghang Zhang
28
15
0
15 Dec 2023
123456
Next