Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.21229
Cited By
Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration
30 July 2024
Ngoc Son Nguyen
Van Son Nguyen
Tung Le
ViT
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Advancing Vietnamese Visual Question Answering with Transformer and Convolutional Integration"
5 / 5 papers shown
Title
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
259
4,223
0
30 Jan 2023
A Unified View of Masked Image Modeling
Zhiliang Peng
Li Dong
Hangbo Bao
QiXiang Ye
Furu Wei
VLM
52
35
0
19 Oct 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,110
0
28 Jan 2022
PhoBERT: Pre-trained language models for Vietnamese
Dat Quoc Nguyen
A. Nguyen
162
341
0
02 Mar 2020
Efficient Estimation of Word Representations in Vector Space
Tomáš Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
228
31,150
0
16 Jan 2013
1