Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.13565
Cited By
Exploring Diverse Methods in Visual Question Answering
21 April 2024
Panfeng Li
Qikai Yang
Xieming Geng
Wenjing Zhou
Zhicheng Ding
Yi Nian
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Exploring Diverse Methods in Visual Question Answering"
41 / 41 papers shown
Title
Research on Cloud Platform Network Traffic Monitoring and Anomaly Detection System based on Large Language Models
Ze Yang
Yihong Jin
Juntian Liu
Xinhe Xu
Yihan Zhang
Shuyang Ji
22
0
0
22 Apr 2025
Data Augmentation Through Random Style Replacement
Qikai Yang
Cheng Ji
Huaiying Luo
Panfeng Li
Zhicheng Ding
31
0
0
14 Apr 2025
Industrial Internet Robot Collaboration System and Edge Computing Optimization
Qian Zuo
Dajun Tao
Tian Qi
Jieyi Xie
Zijie Zhou
Zhen Tian
Yu Mingyu
41
2
0
03 Apr 2025
JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model
Yi Nian
Shenzhe Zhu
Yuehan Qin
Li Li
Z. Wang
Chaowei Xiao
Yue Zhao
21
0
0
03 Apr 2025
Research and Design on Intelligent Recognition of Unordered Targets for Robots Based on Reinforcement Learning
Yiting Mao
Dajun Tao
Shengyuan Zhang
Tian Qi
Keqin Li
34
5
0
10 Mar 2025
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation
Ziyao Zeng
Jingcheng Ni
Daniel Wang
Patrick Rim
Younjoon Chung
Fengyu Yang
Byung-Woo Hong
A. Wong
DiffM
MDE
93
2
0
24 Nov 2024
Enhancing Exchange Rate Forecasting with Explainable Deep Learning Models
Shuchen Meng
Andi Chen
Chihang Wang
Mengyao Zheng
Fangyu Wu
Xupeng Chen
Haowei Ni
Panfeng Li
46
2
0
25 Oct 2024
RSA: Resolving Scale Ambiguities in Monocular Depth Estimators through Language Descriptions
Ziyao Zeng
Yangchao Wu
Hyoungseob Park
Daniel Wang
Fengyu Yang
Stefano Soatto
Dong Lao
Byung-Woo Hong
Alex Wong
MDE
16
7
0
03 Oct 2024
Pre-trained Graphformer-based Ranking at Web-scale Search (Extended Abstract)
Yuchen Li
Haoyi Xiong
Linghe Kong
Zeyi Sun
Hongyang Chen
Shuaiqiang Wang
Dawei Yin
38
0
0
25 Sep 2024
A Lightweight GAN-Based Image Fusion Algorithm for Visible and Infrared Images
Zhizhong Wu
Hao Gong
Jiajing Chen
Zhou Yuru
LiangHao Tan
Ge Shi
23
16
0
07 Sep 2024
Enhancing Deep Learning with Optimized Gradient Descent: Bridging Numerical Methods and Neural Network Training
Yuhan Ma
Dan Sun
Erdi Gao
Ningjing Sang
Iris Li
Guanming Huang
20
7
0
07 Sep 2024
Style Transfer: From Stitching to Neural Networks
Xinhe Xu
Zhuoer Wang
Yihan Zhang
Yizhou Liu
Zhaoyue Wang
Zhihao Xu
Muhan Zhao
Huaiying Luo
17
3
0
01 Sep 2024
Harnessing Earnings Reports for Stock Predictions: A QLoRA-Enhanced LLM Approach
Haowei Ni
Shuchen Meng
Xupeng Chen
Ziqing Zhao
Andi Chen
Panfeng Li
Shiyao Zhang
Qifu Yin
Yuanqing Wang
Yuxi Chan
AIFin
31
30
0
13 Aug 2024
Evaluating Modern Approaches in 3D Scene Reconstruction: NeRF vs Gaussian-Based Methods
Yiming Zhou
Zixuan Zeng
Andi Chen
Xiaofan Zhou
Haowei Ni
Shiyao Zhang
Panfeng Li
Liangxi Liu
Mengyao Zheng
Xupeng Chen
3DGS
29
17
0
08 Aug 2024
Attention Mechanism and Context Modeling System for Text Mining Machine Translation
Shi Bo
Yuwei Zhang
Junming Huang
Sitong Liu
Zexi Chen
43
26
0
08 Aug 2024
Advanced User Credit Risk Prediction Model using LightGBM, XGBoost and Tabnet with SMOTEENN
Chang Yu
Yixin Jin
Qianwen Xing
Y. Zhang
Shaobo Guo
Shuchen Meng
26
16
0
07 Aug 2024
NeuroBind: Towards Unified Multimodal Representations for Neural Signals
Fengyu Yang
Chao Feng
Daniel Wang
Tianye Wang
Ziyao Zeng
...
Hyoungseob Park
Pengliang Ji
Han Zhao
Yuanning Li
Alex Wong
31
9
0
19 Jul 2024
Transforming Movie Recommendations with Advanced Machine Learning: A Study of NMF, SVD,and K-Means Clustering
Yubing Yan
Camille Moreau
Zhuoyue Wang
Wenhan Fan
Chengqian Fu
46
4
0
12 Jul 2024
Image anomaly detection and prediction scheme based on SSA optimized ResNet50-BiGRU model
Qianhui Wan
Zecheng Zhang
Liheng Jiang
Zhaoqi Wang
Yan Zhou
MedIm
3DH
24
18
0
20 Jun 2024
Exploiting Diffusion Prior for Out-of-Distribution Detection
Armando Zhu
Jiabei Liu
Keqin Li
Shuying Dai
Bo Hong
Peng Zhao
Changsong Wei
35
7
0
16 Jun 2024
Advanced Multimodal Deep Learning Architecture for Image-Text Matching
Jinyin Wang
Haijing Zhang
Yihao Zhong
Yingbin Liang
Rongwei Ji
Yiru Cang
31
22
0
13 Jun 2024
Research on Driver Facial Fatigue Detection Based on Yolov8 Model
Chang Zhou
Yang Zhao
Shaobo Liu
Yi Zhao
Xingchen Li
Chiyu Cheng
3DH
35
14
0
04 Jun 2024
Advancing Financial Risk Prediction Through Optimized LSTM Model Performance and Comparative Analysis
Ke Xu
Yu Cheng
Shiqing Long
Junjie Guo
Jue Xiao
Mengfang Sun
AI4TS
43
9
0
31 May 2024
TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction
Yinda Chen
Haoyuan Shi
Xiaoyu Liu
Te Shi
Ruobing Zhang
Dong Liu
Zhiwei Xiong
Feng Wu
36
9
0
27 May 2024
Advancements in Feature Extraction Recognition of Medical Imaging Systems Through Deep Learning Technique
Qishi Zhan
Dan Sun
Erdi Gao
Yuhan Ma
Yaxin Liang
Haowei Yang
40
8
0
23 May 2024
Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis
Yuxiang Hu
Haowei Yang
Ting Xu
Shuyao He
Jiajie Yuan
Haozhang Deng
39
12
0
23 May 2024
Investigation of Customized Medical Decision Algorithms Utilizing Graph Neural Networks
Yafeng Yan
Shuyao He
Zhou Yu
Jiajie Yuan
Ziang Liu
Yan Chen
30
11
0
23 May 2024
Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual Data
Lingxi Xiao
Muqing Li
Yinqiu Feng
Meiqi Wang
Ziyi Zhu
Zexi Chen
29
16
0
23 May 2024
Enhancing Medical Imaging with GANs Synthesizing Realistic Images from Limited Data
Yinqiu Feng
Bo Zhang
Lingxi Xiao
Yutian Yang
Gegen Tana
Zexi Chen
GAN
MedIm
31
10
0
22 May 2024
Application of Multimodal Fusion Deep Learning Model in Disease Recognition
Xiaoyi Liu
Hongjie Qiu
Muqing Li
Zhou Yu
Yutian Yang
Yafeng Yan
35
17
0
22 May 2024
Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks
Taiyuan Mei
Yun Zi
X. Cheng
Zijun Gao
Qi Wang
Haowei Yang
36
20
0
20 May 2024
Automatic News Generation and Fact-Checking System Based on Language Processing
Xirui Peng
Qiming Xu
Zheng Feng
Haopeng Zhao
Lianghao Tan
Yan Zhou
Zecheng Zhang
Chenwei Gong
Yingqiao Zheng
23
28
0
17 May 2024
Mix of Experts Language Model for Named Entity Recognition
Xinwei Chen
Kun Li
Tianyou Song
Jiangjian Guo
MoE
27
10
0
30 Apr 2024
Cross-Task Multi-Branch Vision Transformer for Facial Expression and Mask Wearing Classification
Armando Zhu
Keqin Li
Tong Wu
Peng Zhao
Bo Hong
26
25
0
22 Apr 2024
A Comparative Study on Enhancing Prediction in Social Network Advertisement through Data Augmentation
Qikai Yang
Panfeng Li
Xinhe Xu
Zhicheng Ding
Wenjing Zhou
Yi Nian
45
31
0
22 Apr 2024
Advanced Feature Manipulation for Enhanced Change Detection Leveraging Natural Language Models
Zhenglin Li
Yangchen Huang
Mengran Zhu
Jingyu Zhang
Jinghao Chang
Houze Liu
26
4
0
23 Mar 2024
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection
Hanning Chen
Wenjun Huang
Yang Ni
Sanggeon Yun
Fei Wen
Hugo Latapie
Mohsen Imani
ObjD
MLLM
VLM
35
16
0
12 Mar 2024
Accelerating Parallel Sampling of Diffusion Models
Zhiwei Tang
Jiasheng Tang
Hao Luo
Fan Wang
Tsung-Hui Chang
30
11
0
15 Feb 2024
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
Can Jin
Tong Che
Hongwu Peng
Yiyuan Li
Dimitris N. Metaxas
Marco Pavone
42
42
0
05 Feb 2024
Language-Guided Face Animation by Recurrent StyleGAN-based Generator
Tiankai Hang
Huan Yang
Bei Liu
Jianlong Fu
Xin Geng
B. Guo
VGen
13
13
0
11 Aug 2022
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,458
0
06 Jun 2016
1