Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1409.3215
Cited By
Sequence to Sequence Learning with Neural Networks
10 September 2014
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sequence to Sequence Learning with Neural Networks"
50 / 4,779 papers shown
Title
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles
Rui Zhao
Weijia Mao
Mike Zheng Shou
66
0
0
05 Mar 2025
Deep Causal Behavioral Policy Learning: Applications to Healthcare
Jonas Knecht
Anna Zink
Jonathan Kolstad
Maya Petersen
CML
88
0
0
05 Mar 2025
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA
Zhengyang Ji
Shang Gao
Li Liu
Yifan Jia
Yutao Yue
46
0
0
04 Mar 2025
Learning to Substitute Components for Compositional Generalization
ZeLin Li
Gangwei Jiang
Chenwang Wu
Ying Wei
Defu Lian
Enhong Chen
64
0
0
28 Feb 2025
Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
Chahine-Nicolas Zede
Laurent Carrafa
Valérie Gouet-Brunet
3DPC
43
0
0
28 Feb 2025
Chitranuvad: Adapting Multi-Lingual LLMs for Multimodal Translation
Shaharukh Khan
Ayush Tarun
Ali Faraz
Palash Kamble
Vivek Dahiya
Praveen Kumar Pokala
Ashish Kulkarni
Chandra Khatri
Abhinav Ravi
Shubham Agarwal
184
0
0
27 Feb 2025
Introduction to Sequence Modeling with Transformers
Joni-Kristian Kämäräinen
62
1
0
26 Feb 2025
Chemical knowledge-informed framework for privacy-aware retrosynthesis learning
Guikun Chen
Xu Zhang
Yuqing Yang
Wenguan Wang
47
0
0
26 Feb 2025
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models
Yuxuan Zhang
CLL
ALM
73
1
0
25 Feb 2025
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences
Yangshijie Zhang
AAML
47
0
0
24 Feb 2025
More for Keys, Less for Values: Adaptive KV Cache Quantization
Mohsen Hariri
Lam Nguyen
Sixu Chen
Shaochen Zhong
Qifan Wang
Xia Hu
Xiaotian Han
V. Chaudhary
MQ
48
0
0
24 Feb 2025
Forecasting Local Ionospheric Parameters Using Transformers
D. J. Alford-Lago
C. Curtis
Alexander T. Ihler
Katherine A. Zawdie
Douglas P. Drob
69
0
0
24 Feb 2025
CSTRL: Context-Driven Sequential Transfer Learning for Abstractive Radiology Report Summarization
Mst. Fahmida Sultana Naznin
Adnan Ibney Faruq
Mostafa Rifat Tazwar
Md Jobayer
Md. Mehedi Hasan Shawon
Md Rakibul Hasan
MedIm
38
0
0
21 Feb 2025
Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning
Caihua Liu
Xu Li
Wenjing Xue
Wei Tang
Xia Feng
56
0
0
20 Feb 2025
Quantum Recurrent Neural Networks with Encoder-Decoder for Time-Dependent Partial Differential Equations
Yuan Chen
Abdul Khaliq
Khaled M. Furati
AI4CE
61
0
0
20 Feb 2025
Language Models Can Predict Their Own Behavior
Dhananjay Ashok
Jonathan May
ReLM
AI4TS
LRM
63
0
0
18 Feb 2025
Investigating Inference-time Scaling for Chain of Multi-modal Thought: A Preliminary Study
Yujie Lin
Ante Wang
Moye Chen
Jingyao Liu
Hao Liu
Jinsong Su
Xinyan Xiao
LRM
50
2
0
17 Feb 2025
Spatiotemporal Graph Neural Networks in short term load forecasting: Does adding Graph Structure in Consumption Data Improve Predictions?
Quoc Viet Nguyen
Joaquín Delgado Fernández
Sergio Potenciano Menci
AI4TS
59
0
0
14 Feb 2025
A Robust Attack: Displacement Backdoor Attack
Yong Li
Han Gao
AAML
36
0
0
14 Feb 2025
Theoretical Benefit and Limitation of Diffusion Language Model
Guhao Feng
Yihan Geng
Jian Guan
Wei Wu
Liwei Wang
Di He
DiffM
55
0
0
13 Feb 2025
Enhancing LLM Character-Level Manipulation via Divide and Conquer
Zhen Xiong
Yujun Cai
Bryan Hooi
Nanyun Peng
Kai-Wei Chang
Zhecheng Li
70
0
0
12 Feb 2025
What makes a good feedforward computational graph?
Alex Vitvitskyi
J. G. Araújo
Marc Lackenby
Petar Velickovic
85
1
0
10 Feb 2025
Do we really have to filter out random noise in pre-training data for language models?
Jinghan Ru
Yuxin Xie
Xianwei Zhuang
Yuguo Yin
Zhihui Guo
Zhiming Liu
Qianli Ren
Yuexian Zou
83
4
0
10 Feb 2025
Comprehensive Framework for Evaluating Conversational AI Chatbots
Shailja Gupta
Rajesh Ranjan
Surya Narayan Singh
46
0
0
10 Feb 2025
A comparison of translation performance between DeepL and Supertext
Alex Flückiger
Chantal Amrhein
Tim Graf
Frédéric Odermatt
Martin Pömsl
Philippe Schläpfer
Florian Schottmann
Samuel Laubli
ELM
45
0
0
04 Feb 2025
Emotion Recognition and Generation: A Comprehensive Review of Face, Speech, and Text Modalities
Rebecca Mobbs
Dimitrios Makris
Vasileios Argyriou
43
0
0
02 Feb 2025
A Hardware-Efficient Photonic Tensor Core: Accelerating Deep Neural Networks with Structured Compression
Shupeng Ning
Hanqing Zhu
Chenghao Feng
Jiaqi Gu
David Z. Pan
Ray T. Chen
42
0
0
01 Feb 2025
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
85
0
0
28 Jan 2025
State-space models are accurate and efficient neural operators for dynamical systems
Zheyuan Hu
Nazanin Ahmadi Daryakenari
Qianli Shen
Kenji Kawaguchi
George Karniadakis
Mamba
AI4CE
75
13
0
28 Jan 2025
Can summarization approximate simplification? A gold standard comparison
Giacomo Magnifico
Eduard Barbu
35
0
0
28 Jan 2025
Data re-uploading in Quantum Machine Learning for time series: application to traffic forecasting
Nikolaos Schetakis
Paolo Bonfini
Negin Alisoltani
Konstantinos Blazakis
Symeon I. Tsintzos
Alexis Askitopoulos
Davit Aghamalyan
Panagiotis Fafoutellis
Eleni I. Vlahogianni
51
0
0
22 Jan 2025
Reliable Text-to-SQL with Adaptive Abstention
Kaiwen Chen
Yueting Chen
Xiaohui Yu
Nick Koudas
RALM
41
0
0
18 Jan 2025
The Theater Stage as Laboratory: Review of Real-Time Comedy LLM Systems for Live Performance
Piotr Wojciech Mirowski
Boyd Branch
Kory W. Mathewson
41
0
0
14 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
46
0
0
13 Jan 2025
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer
Vladimir Bataev
Subhankar Ghosh
Vitaly Lavrukhin
Jason Chun Lok Li
AI4TS
46
0
0
10 Jan 2025
On Creating A Brain-To-Text Decoder
Zenon Lamprou
Yashar Moshfeghi
41
0
0
10 Jan 2025
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation
Alireza Salemi
Cheng-rong Li
Mingyang Zhang
Qiaozhu Mei
Weize Kong
Tao Chen
Zhuowan Li
Michael Bendersky
Hamed Zamani
LRM
RALM
ReLM
52
6
0
07 Jan 2025
Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data
Alec S. Xu
Can Yaras
Peng Wang
Q. Qu
32
0
0
04 Jan 2025
Exploring the Implicit Semantic Ability of Multimodal Large Language Models: A Pilot Study on Entity Set Expansion
Hebin Wang
Yangning Li
Hai-Tao Zheng
Hai-Tao Zheng
Wenhao Jiang
Hong-Gee Kim
44
0
0
03 Jan 2025
PsychAdapter: Adapting LLM Transformers to Reflect Traits, Personality and Mental Health
Huy-Hien Vu
Huy Anh Nguyen
Adithya V Ganesan
Swanie Juhng
O. Kjell
...
Margaret L. Kern
Ryan L. Boyd
L. Ungar
H. Andrew Schwartz
J. Eichstaedt
81
0
0
03 Jan 2025
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language
Jeong Hun Yeo
Chae Won Kim
Hyunjun Kim
Hyeongseop Rha
Seunghee Han
Wen-Huang Cheng
Y. Ro
59
3
0
03 Jan 2025
A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine
Hanguang Xiao
Feizhong Zhou
Xianglong Liu
Tianqi Liu
Zhipeng Li
Xin Liu
Xiaoxuan Huang
AILaw
LM&MA
LRM
61
18
0
31 Dec 2024
STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction
Yufei Guo
Zehua Peng
Yijia Zhang
Dengbo He
Lei Chen
39
0
0
23 Dec 2024
Speech-Based Depression Prediction Using Encoder-Weight-Only Transfer Learning and a Large Corpus
Amir Harati
Elizabeth Shriberg
Tomasz Rutowski
Piotr Chlebek
Yang Lu
Ricardo Oliveira
97
21
0
22 Dec 2024
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time
Alireza Pourali
Arian Boukani
Hamzeh Khazaei
72
0
0
20 Dec 2024
LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration
Sangmin Lee
Woo-Jin Chung Hong-Goo Kang
Hong-Goo Kang
85
0
0
19 Dec 2024
Knowledge Distillation in RNN-Attention Models for Early Prediction of Student Performance
Sukrit Leelaluk
Cheng Tang
Valdemar Švábenský
Atsushi Shimada
80
1
0
19 Dec 2024
Hansel: Output Length Controlling Framework for Large Language Models
Seoha Song
Junhyun Lee
Hyeonmok Ko
75
0
0
18 Dec 2024
CRM: Retrieval Model with Controllable Condition
Chi Liu
Jiangxia Cao
Rui Huang
Kuo Cai
Weifeng Ding
Qiang Luo
Kun Gai
Guorui Zhou
72
1
0
18 Dec 2024
Private Yet Social: How LLM Chatbots Support and Challenge Eating Disorder Recovery
Ryuhaerang Choi
Taehan Kim
Subin Park
Jennifer G Kim
Sung-Ju Lee
AI4MH
81
0
0
16 Dec 2024
Previous
1
2
3
4
5
...
94
95
96
Next