ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.17740
  4. Cited By
Embedding Empirical Distributions for Computing Optimal Transport Maps

Embedding Empirical Distributions for Computing Optimal Transport Maps

International Symposium on Information Theory (ISIT), 2025
24 April 2025
Mingchen Jiang
Peng Xu
Xichen Ye
Xiaohui Chen
Yun Yang
Yifan Chen
    OT
ArXiv (abs)PDFHTML

Papers citing "Embedding Empirical Distributions for Computing Optimal Transport Maps"

36 / 36 papers shown
Title
The calculus of variations of the Transformer on the hyperspherical tangent bundle
The calculus of variations of the Transformer on the hyperspherical tangent bundle
Andrew Gracyk
157
0
0
21 Jul 2025
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual RestorationKnowledge Discovery and Data Mining (KDD), 2025
Mengting Ai
Tianxin Wei
Yifan Chen
Zhichen Zeng
Ritchie Zhao
G. Varatkar
B. Rouhani
Xianfeng Tang
Hanghang Tong
Jingrui He
MoE
259
9
0
10 Mar 2025
OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization
OT-Transformer: A Continuous-time Transformer Architecture with Optimal Transport Regularization
Kelvin Kan
Xingjian Li
Stanley Osher
395
2
0
30 Jan 2025
Wasserstein Wormhole: Scalable Optimal Transport Distance with
  Transformers
Wasserstein Wormhole: Scalable Optimal Transport Distance with Transformers
Doron Haviv
Russell Z. Kunes
Thomas Dougherty
Cassandra Burdziak
T. Nawy
Anna Gilbert
Dana Peér
OT
405
11
0
15 Apr 2024
Convergence of flow-based generative models via proximal gradient
  descent in Wasserstein space
Convergence of flow-based generative models via proximal gradient descent in Wasserstein spaceIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2023
Xiuyuan Cheng
Jianfeng Lu
Yixin Tan
Yao Xie
546
29
0
26 Oct 2023
Transformer Fusion with Optimal Transport
Transformer Fusion with Optimal TransportInternational Conference on Learning Representations (ICLR), 2023
Moritz Imfeld
Jacopo Graldi
Marco Giordano
Thomas Hofmann
Sotiris Anagnostidis
Sidak Pal Singh
ViTMoMe
430
28
0
09 Oct 2023
FlashAttention-2: Faster Attention with Better Parallelism and Work
  Partitioning
FlashAttention-2: Faster Attention with Better Parallelism and Work PartitioningInternational Conference on Learning Representations (ICLR), 2023
Tri Dao
LRM
401
2,032
0
17 Jul 2023
A Brief Review of Hypernetworks in Deep Learning
A Brief Review of Hypernetworks in Deep LearningArtificial Intelligence Review (AIR), 2023
Vinod Kumar Chauhan
Jiandong Zhou
Ping Lu
Soheila Molaei
David Clifton
441
146
0
12 Jun 2023
Flow Matching for Generative Modeling
Flow Matching for Generative ModelingInternational Conference on Learning Representations (ICLR), 2022
Y. Lipman
Ricky T. Q. Chen
Heli Ben-Hamu
Maximilian Nickel
Matt Le
OOD
999
2,757
0
06 Oct 2022
GeONet: a neural operator for learning the Wasserstein geodesic
GeONet: a neural operator for learning the Wasserstein geodesicConference on Uncertainty in Artificial Intelligence (UAI), 2022
Andrew Gracyk
Xiaohui Chen
OT
368
2
0
28 Sep 2022
Flow Straight and Fast: Learning to Generate and Transfer Data with
  Rectified Flow
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified FlowInternational Conference on Learning Representations (ICLR), 2022
Xingchao Liu
Chengyue Gong
Qiang Liu
OOD
934
1,916
0
07 Sep 2022
Supervised Training of Conditional Monge Maps
Supervised Training of Conditional Monge MapsNeural Information Processing Systems (NeurIPS), 2022
Charlotte Bunne
Andreas Krause
Marco Cuturi
OT
265
75
0
28 Jun 2022
Meta Optimal Transport
Meta Optimal TransportInternational Conference on Machine Learning (ICML), 2022
Brandon Amos
Samuel N. Cohen
Giulia Luise
I. Redko
OT
287
27
0
10 Jun 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with
  IO-Awareness
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-AwarenessNeural Information Processing Systems (NeurIPS), 2022
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
807
3,256
0
27 May 2022
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nyström
  Method
Skyformer: Remodel Self-Attention with Gaussian Kernel and Nyström MethodNeural Information Processing Systems (NeurIPS), 2021
Yifan Chen
Qi Zeng
Heng Ji
Yun Yang
192
62
0
29 Oct 2021
Using Optimal Transport as Alignment Objective for fine-tuning
  Multilingual Contextualized Embeddings
Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings
Sawsan Alqahtani
Garima Lalwani
Yi Zhang
Salvatore Romeo
Saab Mansour
OT
121
28
0
06 Oct 2021
Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2
  Benchmark
Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 BenchmarkNeural Information Processing Systems (NeurIPS), 2021
Alexander Korotin
Lingxiao Li
Aude Genevay
Justin Solomon
Alexander N. Filippov
Evgeny Burnaev
OT
347
99
0
03 Jun 2021
Prefix-Tuning: Optimizing Continuous Prompts for Generation
Prefix-Tuning: Optimizing Continuous Prompts for GenerationAnnual Meeting of the Association for Computational Linguistics (ACL), 2021
Xiang Lisa Li
Abigail Z. Jacobs
650
5,139
0
01 Jan 2021
Distances between probability distributions of different dimensions
Distances between probability distributions of different dimensionsIEEE Transactions on Information Theory (IEEE Trans. Inf. Theory), 2020
Yuhan Cai
Lek-Heng Lim
UDOOD
263
62
0
01 Nov 2020
Rethinking Attention with Performers
Rethinking Attention with Performers
K. Choromanski
Valerii Likhosherstov
David Dohan
Xingyou Song
Andreea Gane
...
Afroz Mohiuddin
Lukasz Kaiser
David Belanger
Lucy J. Colwell
Adrian Weller
712
1,928
0
30 Sep 2020
Linear Optimal Transport Embedding: Provable Wasserstein classification
  for certain rigid transformations and perturbations
Linear Optimal Transport Embedding: Provable Wasserstein classification for certain rigid transformations and perturbations
Caroline Moosmüller
A. Cloninger
OT
405
50
0
20 Aug 2020
Wasserstein Embedding for Graph Learning
Wasserstein Embedding for Graph Learning
Soheil Kolouri
Navid Naderializadeh
Gustavo K. Rohde
Heiko Hoffmann
GNN
205
96
0
16 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.5K
19,430
0
17 Feb 2020
Unsupervised Multilingual Alignment using Wasserstein Barycenter
Unsupervised Multilingual Alignment using Wasserstein BarycenterInternational Joint Conference on Artificial Intelligence (IJCAI), 2020
Jiale Han
Kshitij Jain
B. Cheng
Pascal Poupart
Xu Wang
206
28
0
28 Jan 2020
Are Transformers universal approximators of sequence-to-sequence
  functions?
Are Transformers universal approximators of sequence-to-sequence functions?International Conference on Learning Representations (ICLR), 2019
Chulhee Yun
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
302
427
0
20 Dec 2019
Model Fusion via Optimal Transport
Model Fusion via Optimal TransportNeural Information Processing Systems (NeurIPS), 2019
Sidak Pal Singh
Martin Jaggi
MoMeFedML
598
286
0
12 Oct 2019
Wasserstein-2 Generative Networks
Wasserstein-2 Generative NetworksInternational Conference on Learning Representations (ICLR), 2019
Alexander Korotin
Vage Egiazarian
Arip Asadulaev
Alexander Safin
Evgeny Burnaev
GAN
466
124
0
28 Sep 2019
Optimal transport mapping via input convex neural networks
Optimal transport mapping via input convex neural networksInternational Conference on Machine Learning (ICML), 2019
Ashok Vardhan Makkuva
Amirhossein Taghvaei
Sewoong Oh
Jason D. Lee
OT
302
229
0
28 Aug 2019
Style Transfer by Relaxed Optimal Transport and Self-Similarity
Style Transfer by Relaxed Optimal Transport and Self-SimilarityComputer Vision and Pattern Recognition (CVPR), 2019
Nicholas I. Kolkin
Jason Salavon
Gregory Shakhnarovich
280
306
0
29 Apr 2019
(q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs
(q,p)-Wasserstein GANs: Comparing Ground Metrics for Wasserstein GANs
Anton Mallasto
J. Frellsen
Wouter Boomsma
Aasa Feragen
169
15
0
10 Feb 2019
Parameter-Efficient Transfer Learning for NLP
Parameter-Efficient Transfer Learning for NLPInternational Conference on Machine Learning (ICML), 2019
N. Houlsby
A. Giurgiu
Stanislaw Jastrzebski
Bruna Morrone
Quentin de Laroussilhe
Andrea Gesmundo
Mona Attariyan
Sylvain Gelly
605
5,569
0
02 Feb 2019
Improving GANs Using Optimal Transport
Improving GANs Using Optimal TransportInternational Conference on Learning Representations (ICLR), 2018
Tim Salimans
Han Zhang
Alec Radford
Dimitris N. Metaxas
OTGAN
267
334
0
15 Mar 2018
Attention Is All You Need
Attention Is All You NeedNeural Information Processing Systems (NeurIPS), 2017
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
2.8K
159,241
0
12 Jun 2017
Joint Distribution Optimal Transportation for Domain Adaptation
Joint Distribution Optimal Transportation for Domain Adaptation
Nicolas Courty
Rémi Flamary
Amaury Habrard
A. Rakotomamonjy
OTOOD
354
613
0
24 May 2017
HyperNetworks
HyperNetworks
David R Ha
Andrew M. Dai
Quoc V. Le
660
1,782
0
27 Sep 2016
Input Convex Neural Networks
Input Convex Neural Networks
Brandon Amos
Lei Xu
J. Zico Kolter
818
733
0
22 Sep 2016
1