Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2505.16416
Cited By
v1
v2 (latest)
Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models
22 May 2025
Chengcheng Wang
Jianyuan Guo
Hongguang Li
Yuchuan Tian
Ying Nie
Chang Xu
Kai Han
Re-assign community
ArXiv (abs)
PDF
HTML
Github (11★)
Papers citing
"Circle-RoPE: Cone-like Decoupled Rotary Positional Embedding for Large Vision-Language Models"
6 / 6 papers shown
Revisiting Multimodal Positional Encoding in Vision-Language Models
Jie Huang
Xuejing Liu
Sibo Song
Ruibing Hou
Hong Chang
Junyang Lin
S. Bai
152
2
0
27 Oct 2025
Improving GUI Grounding with Explicit Position-to-Coordinate Mapping
Suyuchen Wang
Tianyu Zhang
Ahmed Masry
Christopher Pal
Spandana Gella
Bang Liu
Perouz Taslakian
106
1
0
03 Oct 2025
AttAnchor: Guiding Cross-Modal Token Alignment in VLMs with Attention Anchors
Junyang Zhang
Tianyi Zhu
Thierry Tambe
68
0
0
27 Sep 2025
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer
Weixian Lei
Jiacong Wang
Haochen Wang
Xuelong Li
Jun Hao Liew
Jiashi Feng
Zilong Huang
223
20
0
14 Apr 2025
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
703
2,801
0
20 Feb 2025
Baichuan-Omni-1.5 Technical Report
Yadong Li
Qingbin Liu
Tao Zhang
Tao Zhang
Tian Jin
...
Jianhua Xu
Haoze Sun
Mingan Lin
Guosheng Dong
Xin Wu
AuLLM
328
63
0
28 Jan 2025
1