PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration

PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration

Chen Duan
Zhentao Guo
Pei Fu
Zining Wang
Kai Zhou
Pengfei Yan
    MLLM

Papers citing "PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration"

0 / 0 papers shown

No papers found