Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
All Papers
50 / 536,476 papers shown
Title
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Y. Wang
Zhijie Lin
Yao Teng
Yuanzhi Zhu
Shuhuai Ren
Jiashi Feng
Xihui Liu
0
0
0
20 Mar 2025
UAS Visual Navigation in Large and Unseen Environments via a Meta Agent
Yuci Han
Charles Toth
Alper Yilmaz
0
0
0
20 Mar 2025
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Hao Kang
Xin Lu
0
0
0
20 Mar 2025
UniSync: A Unified Framework for Audio-Visual Synchronization
Tao Feng
Yifan Xie
Xun Guan
Jiyuan Song
Z. Liu
Fei Ma
Fei Richard Yu
0
0
0
20 Mar 2025
Enhancing variational quantum algorithms by balancing training on classical and quantum hardware
Rahul Bhowmick
Harsh Wadhwa
Avinash Singh
Tania Sidana
Quoc Hoan Tran
Krishna Kumar Sabapathy
0
0
0
20 Mar 2025
NeuralFoil: An Airfoil Aerodynamics Analysis Tool Using Physics-Informed Machine Learning
Peter Sharpe
R. John Hansman
0
0
0
20 Mar 2025
SceneMI: Motion In-betweening for Modeling Human-Scene Interactions
Inwoo Hwang
Bing Zhou
Y. Kim
Jian Wang
Chuan Guo
0
0
0
20 Mar 2025
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification
Sharon Peled
Y. Maruvka
Moti Freiman
0
0
0
20 Mar 2025
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis
Chia-Yi Hsu
Jia-You Chen
Yu-Lin Tsai
Chih-Hsun Lin
Pin-Yu Chen
Chia-Mu Yu
Chun-ying Huang
0
0
0
20 Mar 2025
Hyperspectral Imaging for Identifying Foreign Objects on Pork Belly
Gabriela Ghimpeteanu
Hayat Rajani
Josep Quintana
Rafael García
0
0
0
20 Mar 2025
M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation
Markus Karmann
Peng-Tao Jiang
Bo Li
O. Urfalioglu
0
0
0
20 Mar 2025
OpenMIBOOD: Open Medical Imaging Benchmarks for Out-Of-Distribution Detection
Max Gutbrod
D. Rauber
Danilo Weber Nunes
Christoph Palm
0
0
0
20 Mar 2025
On the Cone Effect in the Learning Dynamics
Zhanpeng Zhou
Yongyi Yang
Jie Ren
Mahito Sugiyama
Junchi Yan
0
0
0
20 Mar 2025
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction
Ziyao Guo
K. Zhang
Michael Qizhe Shieh
0
0
0
20 Mar 2025
Machine learning identifies nullclines in oscillatory dynamical systems
Bartosz Prokop
Jimmy Billen
Nikita Frolov
Lendert Gelens
0
0
0
20 Mar 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Tianyi Wei
Yifan Zhou
Dongdong Chen
Xingang Pan
0
0
0
20 Mar 2025
FedSAF: A Federated Learning Framework for Enhanced Gastric Cancer Detection and Privacy Preservation
Yuxin Miao
Xinyuan Yang
Hongda Fan
Yichun Li
Yishu Hong
Xiechen Guo
Ali Braytee
Weidong Huang
Ali Anaissi
FedML
2
0
0
20 Mar 2025
MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures
Lucas Morin
Valéry Weber
A. Nassar
Gerhard Ingmar Meijer
Luc Van Gool
Yawei Li
Peter W. J. Staar
0
0
0
20 Mar 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Hui Zhang
Tingwei Gao
Jie Shao
Zuxuan Wu
0
0
0
20 Mar 2025
Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures
Tim Seizinger
Florin-Alexandru Vasluianu
Marcos V. Conde
Radu Timofte
0
0
0
20 Mar 2025
Disentangled and Interpretable Multimodal Attention Fusion for Cancer Survival Prediction
Aniek Eijpe
Soufyan Lakbir
Melis Erdal Cesur
Sara P. Oliveira
Sanne Abeln
Wilson Silva
0
0
0
20 Mar 2025
Landmarks Are Alike Yet Distinct: Harnessing Similarity and Individuality for One-Shot Medical Landmark Detection
Xu He
Zhen Huang
Qingsong Yao
Xiaoqian Zhou
Shuoling Zhou
0
0
0
20 Mar 2025
Closer to Ground Truth: Realistic Shape and Appearance Labeled Data Generation for Unsupervised Underwater Image Segmentation
Andrei Jelea
A. Belbachir
Marius Leordeanu
0
0
0
20 Mar 2025
V-NAW: Video-based Noise-aware Adaptive Weighting for Facial Expression Recognition
JunGyu Lee
Kunyoung Lee
Haesol Park
Ig-Jae Kim
G. Nam
0
0
0
20 Mar 2025
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
Pengyu Liu
Guohua Dong
D. Guo
Kun Li
Fengling Li
Xun Yang
Meng Wang
Xiaomin Ying
0
0
0
20 Mar 2025
Computing Lindahl Equilibrium for Public Goods with and without Funding Caps
Christian Kroer
Dominik Peters
0
0
0
20 Mar 2025
SenseExpo: Efficient Autonomous Exploration with Prediction Information from Lightweight Neural Networks
Haojia Gao
Haohua Que
Hoiian Au
Weihao Shan
Mingkai Liu
...
Lei Mu
Rong Zhao
Xinghua Yang
Qi Wei
Fei Qiao
0
0
0
20 Mar 2025
DnLUT: Ultra-Efficient Color Image Denoising via Channel-Aware Lookup Tables
Sidi Yang
Binxiao Huang
Yulun Zhang
Dahai Yu
Yujiu Yang
Ngai Wong
0
0
0
20 Mar 2025
Text-Driven Diffusion Model for Sign Language Production
J. He
Xu Wang
Ruobei Zhang
Shengeng Tang
Y. Wang
Lechao Cheng
DiffM
5
0
0
20 Mar 2025
MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations
Kyungho Bae
Jinhyung Kim
Sihaeng Lee
Soonyoung Lee
G. Lee
Jinwoo Choi
0
0
0
20 Mar 2025
From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
Ziang Li
Hongguang Zhang
Juan Wang
Meihui Chen
Hongxin Hu
Wenzhe Yi
Xiaoyang Xu
Mengda Yang
Chenjun Ma
0
0
0
20 Mar 2025
Securing Satellite Communications: Real-Time Video Encryption Scheme on Satellite Payloads
Hanshuo Qiu
Jing Lian
Xiaoyuan Wang
Jizhao Liu
0
0
0
20 Mar 2025
Are We There Yet? A Study of Decentralized Identity Applications
Daria Schumm
Katharina O. E. Müller
Burkhard Stiller
0
0
0
20 Mar 2025
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding
Keyan Chen
Chenyang Liu
Bowen Chen
Wenyuan Li
Zhengxia Zou
Zhenwei Shi
2
0
0
20 Mar 2025
Fast Homomorphic Linear Algebra with BLAS
Youngjin Bae
Jung Hee Cheon
G. Hanrot
J. Park
D. Stehlé
0
0
0
20 Mar 2025
Digital Asset Data Lakehouse. The concept based on a blockchain research center
Raul Cristian Bag
0
0
0
20 Mar 2025
Ultra-Resolution Adaptation with Ease
Ruonan Yu
Songhua Liu
Zhenxiong Tan
Xinchao Wang
0
0
0
20 Mar 2025
ALLMod: Exploring
A
‾
\underline{\mathbf{A}}
A
rea-Efficiency of
L
‾
\underline{\mathbf{L}}
L
UT-based
L
‾
\underline{\mathbf{L}}
L
arge Number
M
o
d
‾
\underline{\mathbf{Mod}}
Mod
ular Reduction via Hybrid Workloads
Fangxin Liu
Haomin Li
Zongwu Wang
Bo-Wen Zhang
M. Zhang
Shoumeng Yan
Li Jiang
Haibing Guan
2
0
0
20 Mar 2025
From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction
Ayberk Acar
Mariana Smith
Lidia Al-Zogbi
Tanner Watts
Fangjie Li
...
Robert J. Webster III
I. Oguz
Alan Kuntz
A. Krieger
Jie Wu
0
0
0
20 Mar 2025
Mixture of Lookup Experts
Shibo Jie
Yehui Tang
Kai Han
Y. Li
Duyu Tang
Zhi-Hong Deng
Yunhe Wang
0
0
0
20 Mar 2025
MapGlue: Multimodal Remote Sensing Image Matching
Peihao Wu
Yongxiang Yao
Wenfei Zhang
Dong Wei
Y. Wan
Yansheng Li
Yongjun Zhang
0
0
0
20 Mar 2025
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates
Ying Shen
Lifu Huang
0
0
0
20 Mar 2025
Agentic Keyframe Search for Video Question Answering
Sunqi Fan
Meng-Hao Guo
Shuojin Yang
0
0
0
20 Mar 2025
Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems
Shenbin Qian
Constantin Orasan
Diptesh Kanojia
Félix do Carmo
0
0
0
20 Mar 2025
Distributed Learning over Arbitrary Topology: Linear Speed-Up with Polynomial Transient Time
Runze You
Shi Pu
0
0
0
20 Mar 2025
The Morphology-Control Trade-Off: Insights into Soft Robotic Efficiency
Yue Xie
Kai-feng Chu
Xing Wang
Fumiya Iida
0
0
0
20 Mar 2025
Wearable Haptics for a Marionette-inspired Teleoperation of Highly Redundant Robotic Systems
Davide Torielli
Leonardo Franco
Maria Pozzi
L. Muratore
Monica Malvezzi
Nikos Tsagarakis
D. Prattichizzo
0
0
0
20 Mar 2025
General reproducing properties in RKHS with application to derivative and integral operators
Fatima-Zahrae El-Boukkouri
Josselin Garnier
Olivier Roustant
0
0
0
20 Mar 2025
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
2
0
0
20 Mar 2025
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models
Keda Tao
Haoxuan You
Yang Sui
Can Qin
H. Wang
VLM
2
0
0
20 Mar 2025
Previous
1
2
3
4
5
...
10728
10729
10730
Next