ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1511.02799
  4. Cited By
Neural Module Networks
v1v2v3v4 (latest)

Neural Module Networks

9 November 2015
Jacob Andreas
Marcus Rohrbach
Trevor Darrell
Dan Klein
    CoGe
ArXiv (abs)PDFHTML

Papers citing "Neural Module Networks"

50 / 653 papers shown
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding
Da Li
Yuxiao Luo
Keping Bi
Jiafeng Guo
Wei Yuan
B. Yang
Yan Wang
Fan Yang
Tingting Gao
Guorui Zhou
VLM
253
0
0
11 Nov 2025
CausalGuard: A Smart System for Detecting and Preventing False Information in Large Language Models
CausalGuard: A Smart System for Detecting and Preventing False Information in Large Language Models
Piyushkumar Patel
HILMLRM
97
0
0
30 Oct 2025
Beyond Prompt Engineering: Neuro-Symbolic-Causal Architecture for Robust Multi-Objective AI Agents
Beyond Prompt Engineering: Neuro-Symbolic-Causal Architecture for Robust Multi-Objective AI Agents
Gokturk Aytug Akarlar
AI4CE
88
0
0
27 Oct 2025
Memorizing Long-tail Data Can Help Generalization Through Composition
Memorizing Long-tail Data Can Help Generalization Through Composition
Mo Zhou
Haoyang Ma
Rong Ge
TDI
379
0
0
18 Oct 2025
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
The Artificial Intelligence Cognitive Examination: A Survey on the Evolution of Multimodal Evaluation from Recognition to Reasoning
Mayank Ravishankara
Varindra V. Persad Maharaj
ELM
201
1
0
05 Oct 2025
Unraveling Syntax: How Language Models Learn Context-Free Grammars
Unraveling Syntax: How Language Models Learn Context-Free Grammars
Laura Ying Schulz
Daniel Mitropolsky
Tomaso Poggio
ReLMLRMAI4CE
104
0
0
02 Oct 2025
How Do Language Models Compose Functions?
How Do Language Models Compose Functions?
Apoorv Khandelwal
Ellie Pavlick
KELMCoGeLRM
204
1
0
02 Oct 2025
Exploring System 1 and 2 communication for latent reasoning in LLMs
Exploring System 1 and 2 communication for latent reasoning in LLMs
Julian Coda-Forno
Zhuokai Zhao
Qiang Zhang
Dipesh Tamboli
W. Li
Xiangjun Fan
Lizhu Zhang
Eric Schulz
Hsiao-Ping Tseng
LRM
118
1
1
01 Oct 2025
From $f(x)$ and $g(x)$ to $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
From f(x)f(x)f(x) and g(x)g(x)g(x) to f(g(x))f(g(x))f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones
L. Yuan
Weize Chen
Yuchen Zhang
Ganqu Cui
Hanbin Wang
Ziming You
Ning Ding
Zhiyuan Liu
Maosong Sun
Hao Peng
OffRLCLL
236
1
0
29 Sep 2025
Can Constructions "SCAN" Compositionality ?
Can Constructions "SCAN" Compositionality ?
Ganesh Katrapati
Manish Shrivastava
104
0
0
24 Sep 2025
A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts
A Framework for Generating Artificial Datasets to Validate Absolute and Relative Position Concepts
George Correa de Araujo
H. Maia
Hélio Pedrini
144
0
0
17 Sep 2025
LaV-CoT: Language-Aware Visual CoT with Multi-Aspect Reward Optimization for Real-World Multilingual VQA
LaV-CoT: Language-Aware Visual CoT with Multi-Aspect Reward Optimization for Real-World Multilingual VQA
Jing Huang
Zhiya Tan
Shutao Gong
Fanwei Zeng
Jianshu Li
Jianshu Li
Huazhe Tan
Weibin Yao
J. Li
MLLMLRM
218
1
0
12 Sep 2025
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Explain Before You Answer: A Survey on Compositional Visual Reasoning
Fucai Ke
Joy Hsu
Zhixi Cai
Zixian Ma
Xin Zheng
...
P. D. Haghighi
Gholamreza Haffari
Ranjay Krishna
Jiajun Wu
H. Rezatofighi
ReLMCoGeLRM
357
8
0
24 Aug 2025
Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies
Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies
Ayushman Sarkar
Mohd Yamani Idna Idris
Zhenyu Yu
LRM
163
12
0
14 Aug 2025
Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring
Decoupled Functional Evaluation of Autonomous Driving Models via Feature Map Quality Scoring
Ludan Zhang
Sihan Wang
Yuqi Dai
Shuofei Qiao
Qinyue Luo
Lei He
164
0
0
11 Aug 2025
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A
Chen Li
Chinthani Sugandhika
Yeo Keat Ee
Eric Peh
Hao Zhang
Hong Yang
Deepu Rajan
Basura Fernando
LRM
154
1
0
04 Aug 2025
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
ROVER: Recursive Reasoning Over Videos with Vision-Language Models for Embodied Tasks
Philip Schroeder
Ondrej Biza
Thomas Weng
Hongyin Luo
James Glass
LM&RoLRM
164
0
0
03 Aug 2025
AuroraLong: Bringing RNNs Back to Efficient Open-Ended Video Understanding
AuroraLong: Bringing RNNs Back to Efficient Open-Ended Video Understanding
Weili Xu
Enxin Song
Wenhao Chai
Xuexiang Wen
Tian-Chun Ye
Gaoang Wang
336
5
0
03 Jul 2025
Vision Generalist Model: A Survey
Vision Generalist Model: A SurveyInternational Journal of Computer Vision (IJCV), 2025
Ziyi Wang
Yongming Rao
Shuofeng Sun
Xinrun Liu
Yi Wei
...
Zuyan Liu
Yanbo Wang
Hongmin Liu
Jie Zhou
Jiwen Lu
293
0
0
11 Jun 2025
A Neurosymbolic Agent System for Compositional Visual Reasoning
A Neurosymbolic Agent System for Compositional Visual Reasoning
Yichang Xu
Gaowen Liu
Ramana Rao Kompella
Sihao Hu
Tiansheng Huang
Fatih Ilhan
Selim Furkan Tekin
Zachary Yahn
LRMVLM
230
0
0
09 Jun 2025
CIVET: Systematic Evaluation of Understanding in VLMs
CIVET: Systematic Evaluation of Understanding in VLMs
Massimo Rizzoli
Simone Alghisi
Olha Khomyn
Gabriel Roccabruna
Seyed Mahed Mousavi
Giuseppe Riccardi
387
1
0
05 Jun 2025
Policy Search, Retrieval, and Composition via Task Similarity in Collaborative Agentic Systems
Policy Search, Retrieval, and Composition via Task Similarity in Collaborative Agentic Systems
Saptarshi Nath
Christos Peridis
Eseoghene Benjamin
Hengrong Du
Soheil Kolouri
Peter Kinnell
Zexin Li
Cong Liu
Shirin Dora
Andrea Soltoggio
307
0
0
05 Jun 2025
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?
Argus Inspection: Do Multimodal Large Language Models Possess the Eye of Panoptes?
Yang Yao
Lingyu Li
Jiaxin Song
Chiyu Chen
Zhenqi He
...
Xin Wang
Tianle Gu
Jie Li
Yan Teng
Yingchun Wang
LRM
295
0
0
03 Jun 2025
SemIRNet: A Semantic Irony Recognition Network for Multimodal Sarcasm Detection
SemIRNet: A Semantic Irony Recognition Network for Multimodal Sarcasm Detection
Jingxuan Zhou
Yuehao Wu
Yibo Zhang
Yeyubei Zhang
Yunchong Liu
Bolin Huang
Chunhong Yuan
246
7
0
28 May 2025
Characterizing Pattern Matching and Its Limits on Compositional Task Structures
Characterizing Pattern Matching and Its Limits on Compositional Task Structures
Hoyeon Chang
Jinho Park
Hanseul Cho
Sohee Yang
Miyoung Ko
Hyeonbin Hwang
Seungpil Won
Dohaeng Lee
Youbin Ahn
Minjoon Seo
279
1
0
26 May 2025
Understanding Complexity in VideoQA via Visual Program Generation
Understanding Complexity in VideoQA via Visual Program Generation
Cristobal Eyzaguirre
Igor Vasiljevic
Achal Dave
Jiajun Wu
Rares Andrei Ambrus
Thomas Kollar
Juan Carlos Niebles
P. Tokmakov
272
0
0
19 May 2025
Neuro-Symbolic Concepts
Neuro-Symbolic Concepts
Jiayuan Mao
Joshua B. Tenenbaum
Jiajun Wu
NAI
348
2
0
09 May 2025
A Theoretical Analysis of Compositional Generalization in Neural Networks: A Necessary and Sufficient Condition
A Theoretical Analysis of Compositional Generalization in Neural Networks: A Necessary and Sufficient Condition
Yuanpeng Li
CoGe
911
0
0
05 May 2025
Deep Learning with Pretrained Ínternal World' Layers: A Gemma 3-Based Modular Architecture for Wildfire Prediction
Deep Learning with Pretrained Ínternal World' Layers: A Gemma 3-Based Modular Architecture for Wildfire Prediction
Ayoub Jadouli
Chaker El Amrani
KELMAI4TS
315
1
0
20 Apr 2025
A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives
A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives
Delower Hossain
Jake Y Chen
NAI
515
9
0
23 Mar 2025
Hybrid Learners Do Not Forget: A Brain-Inspired Neuro-Symbolic Approach to Continual Learning
Hybrid Learners Do Not Forget: A Brain-Inspired Neuro-Symbolic Approach to Continual Learning
Amin Banayeeanzade
Mohammad Rostami
CLL
252
1
0
16 Mar 2025
Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU Networks
Make Haste Slowly: A Theory of Emergent Structured Mixed Selectivity in Feature Learning ReLU NetworksInternational Conference on Learning Representations (ICLR), 2025
Devon Jarvis
Richard Klein
Benjamin Rosman
Andrew M. Saxe
MLT
368
3
0
08 Mar 2025
A Theory of Initialisation's Impact on SpecialisationInternational Conference on Learning Representations (ICLR), 2025
Devon Jarvis
Sebastian Lee
Clémentine Dominé
Andrew M. Saxe
Stefano Sarao Mannelli
CLL
296
2
0
04 Mar 2025
Predicate Hierarchies Improve Few-Shot State Classification
Predicate Hierarchies Improve Few-Shot State ClassificationInternational Conference on Learning Representations (ICLR), 2025
Emily Jin
Joy Hsu
Jiajun Wu
OffRL
436
1
0
18 Feb 2025
Skill Expansion and Composition in Parameter Space
Skill Expansion and Composition in Parameter SpaceInternational Conference on Learning Representations (ICLR), 2025
Tenglong Liu
Junjie Li
Yinan Zheng
Haoyi Niu
Yixing Lan
Xin Xu
Xianyuan Zhan
378
11
0
09 Feb 2025
PatentLMM: Large Multimodal Model for Generating Descriptions for Patent FiguresAAAI Conference on Artificial Intelligence (AAAI), 2025
Shivalika Singh
Nakul Sharma
Manish Gupta
Anand Mishra
378
4
0
28 Jan 2025
PuzzleGPT: Emulating Human Puzzle-Solving Ability for Time and Location Prediction
PuzzleGPT: Emulating Human Puzzle-Solving Ability for Time and Location PredictionNorth American Chapter of the Association for Computational Linguistics (NAACL), 2025
Hammad A. Ayyubi
Xuande Feng
Junzhang Liu
Xudong Lin
Zhecan Wang
Shih-Fu Chang
168
1
0
24 Jan 2025
Compositional Instruction Following with Language Models and Reinforcement Learning
Compositional Instruction Following with Language Models and Reinforcement Learning
Vanya Cohen
Geraud Nangue Tasse
N. Gopalan
Steven D. James
Matthew C. Gombolay
Ray Mooney
Benjamin Rosman
250
0
0
21 Jan 2025
Physics of Skill Learning
Physics of Skill Learning
Ziming Liu
Yizhou Liu
Eric J. Michaud
Jeff Gore
Max Tegmark
367
2
0
21 Jan 2025
Flexible task abstractions emerge in linear networks with fast and bounded units
Flexible task abstractions emerge in linear networks with fast and bounded unitsNeural Information Processing Systems (NeurIPS), 2024
Kai Sandbrink
Jan P. Bauer
A. Proca
Andrew M. Saxe
Christopher Summerfield
Ali Hummos
350
3
0
17 Jan 2025
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering
Anupam Pandey
Deepjyoti Bodo
Arpan Phukan
Asif Ekbal
428
2
0
13 Jan 2025
Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation
Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device RecommendationKnowledge Discovery and Data Mining (KDD), 2025
Kairui Fu
Zheqi Lv
Shengyu Zhang
Fan Wu
Kun Kuang
240
5
0
07 Jan 2025
Time Series Language Model for Descriptive Caption Generation
Time Series Language Model for Descriptive Caption GenerationEngineering applications of artificial intelligence (EAAI), 2025
M. Trabelsi
Aidan Boyd
Jin Cao
H. Uzunalioglu
AI4TS
175
8
0
03 Jan 2025
Decoupling Knowledge and Reasoning in Transformers: A Modular Architecture with Generalized Cross-Attention
Zhenyu Guo
Wenguang Chen
254
0
0
01 Jan 2025
Towards Visual Grounding: A Survey
Towards Visual Grounding: A SurveyIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Linhui Xiao
Xiaoshan Yang
X. Lan
Yaowei Wang
Changsheng Xu
ObjD
969
31
0
28 Dec 2024
Language Model as Visual Explainer
Language Model as Visual ExplainerNeural Information Processing Systems (NeurIPS), 2024
Xingyi Yang
Xinchao Wang
VLM
208
1
0
08 Dec 2024
TANGO: Training-free Embodied AI Agents for Open-world Tasks
TANGO: Training-free Embodied AI Agents for Open-world TasksComputer Vision and Pattern Recognition (CVPR), 2024
Filippo Ziliotto
Tommaso Campari
Luciano Serafini
Lamberto Ballan
LLMAGLM&RoMLLMLRM
331
12
0
05 Dec 2024
A Comprehensive Survey on Visual Question Answering Datasets and Algorithms
Raihan Kabir
Naznin Haque
Md. Saiful Islam
Marium-E. Jannat
CoGe
287
8
0
17 Nov 2024
DNN Modularization via Activation-Driven Training
DNN Modularization via Activation-Driven Training
Tuan Ngo
Abid Hassan
Saad Shafiq
Nenad Medvidovic
MoMe
327
0
0
01 Nov 2024
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset
Ngoc Dung Huynh
Mohamed Reda Bouadjenek
Sunil Aryal
Imran Razzak
Hakim Hacid
233
0
0
30 Oct 2024
1234...121314
Next