Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.15830
Cited By
A-JEPA: Joint-Embedding Predictive Architecture Can Listen
27 November 2023
Zhengcong Fei
Mingyuan Fan
Junshi Huang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A-JEPA: Joint-Embedding Predictive Architecture Can Listen"
17 / 17 papers shown
Title
A Survey on Cross-Modal Interaction Between Music and Multimodal Data
Sifei Li
Mining Tan
Feier Shen
Minyan Luo
Zijiao Yin
Fan Tang
W. Dong
Changsheng Xu
57
0
0
17 Apr 2025
SkyReels-A2: Compose Anything in Video Diffusion Transformers
Zhengcong Fei
D. Li
Di Qiu
J. Wang
Yikun Dou
...
J. Xu
Mingyuan Fan
Guibin Chen
Yang Li
Yahui Zhou
DiffM
VGen
63
2
0
03 Apr 2025
Chirp Localization via Fine-Tuned Transformer Model: A Proof-of-Concept Study
N. Bahador
M. Lankarany
39
0
0
24 Mar 2025
Leveraging Joint Predictive Embedding and Bayesian Inference in Graph Self Supervised Learning
Srinitish Srinivasan
Omkumar CU
SSL
BDL
44
0
0
02 Feb 2025
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
113
2
0
14 Dec 2024
Sparsh: Self-supervised touch representations for vision-based tactile sensing
Carolina Higuera
Akash Sharma
Chaithanya Krishna Bodduluri
Taosha Fan
Patrick E. Lancaster
...
Michael Kaess
Byron Boots
Mike Lambeta
Tingfan Wu
Mustafa Mukadam
29
11
0
31 Oct 2024
Learning Latent Wireless Dynamics from Channel State Information
Charbel Bou Chaaya
Abanoub M. Girgis
Mehdi Bennis
16
1
0
16 Sep 2024
FLUX that Plays Music
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
76
7
0
01 Sep 2024
Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI
Y. Liu
Weixing Chen
Yongjie Bai
Xiaodan Liang
Guanbin Li
Wen Gao
Liang Lin
LM&Ro
SyDa
AI4CE
48
27
0
09 Jul 2024
LaT-PFN: A Joint Embedding Predictive Architecture for In-context Time-series Forecasting
Stijn Verdenius
Andrea Zerio
Roy L.M. Wang
BDL
AI4TS
AI4CE
24
2
0
16 May 2024
Music Consistency Models
Zhengcong Fei
Mingyuan Fan
Junshi Huang
DiffM
33
5
0
20 Apr 2024
World Models for Autonomous Driving: An Initial Survey
Yanchen Guan
Haicheng Liao
Zhenning Li
Jia Hu
Runze Yuan
Yunjian Li
Guohui Zhang
Chengzhong Xu
24
30
0
05 Mar 2024
Prospective Role of Foundation Models in Advancing Autonomous Vehicles
Jianhua Wu
B. Gao
Jincheng Gao
Jianhao Yu
Hongqing Chu
...
Xun Gong
Yi Chang
H. E. Tseng
Hong Chen
Jie Chen
31
3
0
08 Dec 2023
Graph-level Representation Learning with Joint-Embedding Predictive Architectures
Geri Skenderi
Hang Li
Jiliang Tang
Marco Cristani
AI4TS
GNN
47
3
0
27 Sep 2023
HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection
Ke Chen
Xingjian Du
Bilei Zhu
Zejun Ma
Taylor Berg-Kirkpatrick
Shlomo Dubnov
ViT
114
262
0
02 Feb 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
99
144
0
02 Feb 2021
1