Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.15206
Cited By
Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding
26 November 2023
Hoang-Quan Nguyen
Thanh-Dat Truong
Xuan-Bac Nguyen
Ashley Dowling
Xin Li
Khoa Luu
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Insect-Foundation: A Foundation Model and Large-scale 1M Dataset for Visual Insect Understanding"
13 / 13 papers shown
Title
BIOSCAN-5M: A Multimodal Dataset for Insect Biodiversity
Zahra Gharaee
Scott C. Lowe
ZeMing Gong
Pablo Millán Arias
Nicholas Pellegrino
...
Lila Kari
Dirk Steinke
Graham W. Taylor
Paul Fieguth
Angel X. Chang
38
7
0
28 Jan 2025
A Novel Dataset for Video-Based Autism Classification Leveraging Extra-Stimulatory Behavior
Manuel Serna-Aguilera
Xuan-Bac Nguyen
Han-Seok Seo
Khoa Luu
28
1
0
06 Sep 2024
FungiTastic: A multi-modal dataset and benchmark for image categorization
Lukás Picek
Klara Janouskova
Milan Šulc
Jirí Matas
72
1
0
24 Aug 2024
Equivariant Similarity for Vision-Language Foundation Models
Tan Wang
Kevin Qinghong Lin
Linjie Li
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
CoGe
33
44
0
25 Mar 2023
OTAdapt: Optimal Transport-based Approach For Unsupervised Domain Adaptation
Thanh-Dat Truong
N. V. R. Chappa
Xuan-Bac Nguyen
Ngan Le
Ashley Dowling
Khoa Luu
OOD
OT
19
11
0
22 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
385
4,010
0
28 Jan 2022
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
258
7,337
0
11 Nov 2021
Fine-Grained Zero-Shot Learning with DNA as Side Information
Sarkhan Badirli
Zeynep Akata
G. Mohler
Christel Picard
M. M. Dundar
SyDa
BDL
38
34
0
29 Sep 2021
BiMaL: Bijective Maximum Likelihood Approach to Domain Adaptation in Semantic Scene Segmentation
Thanh-Dat Truong
C. Duong
Ngan Le
S. L. Phung
Chase Rainwater
Khoa Luu
59
39
0
06 Aug 2021
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
283
5,723
0
29 Apr 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
293
3,683
0
11 Feb 2021
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
238
3,359
0
09 Mar 2020
Feature Pyramid Networks for Object Detection
Tsung-Yi Lin
Piotr Dollár
Ross B. Girshick
Kaiming He
Bharath Hariharan
Serge J. Belongie
ObjD
166
21,643
0
09 Dec 2016
1