ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.10125
  4. Cited By
UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized
  Multimodal Framework

UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework

16 November 2023
Chris Kelly
Luhui Hu
Cindy Yang
Yu Tian
Deshun Yang
Bang Yang
Zaoshan Huang
Zihao Li
Yuexian Zou
    AI4TS
ArXivPDFHTML

Papers citing "UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework"

3 / 3 papers shown
Title
VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision
  Understanding
VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding
Chris Kelly
Luhui Hu
Jiayin Hu
Yu Tian
Deshun Yang
Bang Yang
Cindy Yang
Zihao Li
Zaoshan Huang
Yuexian Zou
26
2
0
14 Mar 2024
VisionGPT: Vision-Language Understanding Agent Using Generalized
  Multimodal Framework
VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework
Chris Kelly
Luhui Hu
Bang Yang
Yu Tian
Deshun Yang
Cindy Yang
Zaoshan Huang
Zihao Li
Jiayin Hu
Yuexian Zou
37
9
0
14 Mar 2024
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text
  and Image Inputs
WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs
Deshun Yang
Luhui Hu
Yu Tian
Zihao Li
Chris Kelly
Bang Yang
Cindy Yang
Yuexian Zou
VGen
20
12
0
10 Mar 2024
1