Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales

Terms and Conditions

Twitter GitHub LinkedIn Bluesky Youtube

© 2026 ResearchTrend.AI, All rights reserved.

Home
Papers
2505.23179
Cited By

DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

v1v2 (latest)

DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

29 May 2025

ArXiv (abs)PDF HTML HuggingFace (1 upvotes)

Papers citing "DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes"

13 / 13 papers shown

Reinforcement Learning for Large Model: A Survey

Reinforcement Learning for Large Model: A Survey

Mike Zheng Shou

316

2

0

24 Dec 2025

Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding

Learning What to Attend First: Modality-Importance-Guided Reasoning for Reliable Multimodal Emotion Understanding

92

0

0

02 Dec 2025

Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier

Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier

261

1

0

27 Oct 2025

HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis

HieroAction: Hierarchically Guided VLM for Fine-Grained Action Analysis

104

1

0

23 Aug 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

126

12

0

31 Jul 2025

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

385

46

0

01 Jul 2025

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

...

549

790

1

14 Apr 2025

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning

Prithvijit Chattopadhyay

...

AI4CE LM&Ro LRM

625

69

0

18 Mar 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

385

200

0

17 Mar 2025

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

MU OffRL LRM MLLM ReLM VLM

565

353

0

09 Mar 2025

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement LearningInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2025

Hongwei Bran Li

Daniel Rueckert

462

107

0

26 Feb 2025

Qwen2.5-VL Technical Report

Qwen2.5-VL Technical Report

...

719

2,841

0

20 Feb 2025

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

...

OffRL AI4TS LRM ReLM VLM

1.2K

5,342

0

22 Jan 2025