Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, EditingNeural Information Processing Systems (NeurIPS), 2024 |
AnyEdit: Mastering Unified High-Quality Image Editing for Any IdeaComputer Vision and Pattern Recognition (CVPR), 2024 |
Towards Semantic Equivalence of Tokenization in Multimodal LLMInternational Conference on Learning Representations (ICLR), 2024 |