ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.11926
  4. Cited By
Focal Modulation Networks

Focal Modulation Networks

22 March 2022
Jianwei Yang
Chunyuan Li
Xiyang Dai
Lu Yuan
Jianfeng Gao
    3DPC
ArXivPDFHTML

Papers citing "Focal Modulation Networks"

50 / 141 papers shown
Title
Context Aware Grounded Teacher for Source Free Object Detection
Context Aware Grounded Teacher for Source Free Object Detection
Tajamul Ashraf
Rajes Manna
Partha Sarathi Purkayastha
Tavaheed Tariq
Janibul Bashir
22
0
0
21 Apr 2025
Hadamard product in deep learning: Introduction, Advances and Challenges
Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G. Chrysos
Yongtao Wu
Razvan Pascanu
Philip Torr
V. Cevher
AAML
96
0
0
17 Apr 2025
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
HGFormer: Topology-Aware Vision Transformer with HyperGraph Learning
Hao Wang
Shuo Zhang
Biao Leng
ViT
59
0
0
03 Apr 2025
Spectral-Adaptive Modulation Networks for Visual Perception
Spectral-Adaptive Modulation Networks for Visual Perception
Guhnoo Yun
J. Yoo
Kijung Kim
Jeongho Lee
Paul Hongsuck Seo
Dong Hwan Kim
32
0
0
31 Mar 2025
LSNet: See Large, Focus Small
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
37
0
0
29 Mar 2025
GmNet: Revisiting Gating Mechanisms From A Frequency View
GmNet: Revisiting Gating Mechanisms From A Frequency View
Yifan Wang
Xu Ma
Yitian Zhang
Zhongruo Wang
Sung-Cheol Kim
Vahid Mirjalili
Vidya Renganathan
Y. Fu
36
0
0
28 Mar 2025
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object Detection
Zhichao Sun
Huazhang Hu
Yidong Ma
Gang Liu
Nemo Chen
Xu Tang
Yao Hu
Yongchao Xu
ObjD
44
0
0
24 Mar 2025
Depth-Aware Range Image-Based Model for Point Cloud Segmentation
Depth-Aware Range Image-Based Model for Point Cloud Segmentation
Bike Chen
Antti Tikänmaki
Juha Roning
3DPC
3DV
39
0
0
19 Mar 2025
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition
Shristi Das Biswas
Efstathia Soufleri
Arani Roy
Kaushik Roy
54
0
0
17 Mar 2025
AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements
Calvin Yeung
Tomohiro Suzuki
Ryota Tanaka
Zhuoer Yin
Keisuke Fujii
3DH
68
1
0
10 Mar 2025
Partial Convolution Meets Visual Attention
Haiduo Huang
Fuwei Yang
D. Li
Ji Liu
Lu Tian
Jinzhang Peng
Pengju Ren
E. Barsoum
3DH
94
0
0
05 Mar 2025
Boltzmann Attention Sampling for Image Analysis with Small Objects
Boltzmann Attention Sampling for Image Analysis with Small Objects
Theodore Zhao
Sid Kiblawi
Naoto Usuyama
Ho Hin Lee
Sam Preston
Hoifung Poon
Mu-Hsin Wei
MedIm
64
0
0
04 Mar 2025
Unifying Light Field Perception with Field of Parallax
Fei Teng
Buyin Deng
Boyuan Zheng
Kai Luo
Kunyu Peng
Jiaming Zhang
Kailun Yang
34
0
0
02 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
110
1
0
27 Feb 2025
Infrared Image Super-Resolution: Systematic Review, and Future Trends
Infrared Image Super-Resolution: Systematic Review, and Future Trends
Y. Huang
Tomo Miyazaki
Xiao-Fang Liu
S. Omachi
SupR
77
10
0
21 Feb 2025
iFormer: Integrating ConvNet and Transformer for Mobile Application
iFormer: Integrating ConvNet and Transformer for Mobile Application
Chuanyang Zheng
ViT
67
0
0
26 Jan 2025
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Parallel Sequence Modeling via Generalized Spatial Propagation Network
Hongjun Wang
Wonmin Byeon
Jiarui Xu
Jinwei Gu
Ka Chun Cheung
Xiaolong Wang
Kai Han
Jan Kautz
Sifei Liu
58
0
0
21 Jan 2025
Towards Context-aware Convolutional Network for Image Restoration
Towards Context-aware Convolutional Network for Image Restoration
Fangwei Hao
Ji Du
Weiyun Liang
Jing Xu
Xiaoxuan Xu
SupR
90
0
0
15 Dec 2024
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition
Yongkun Du
Z. Chen
Hongtao Xie
Caiyan Jia
Yu Jiang
80
1
0
24 Nov 2024
Breaking the Low-Rank Dilemma of Linear Attention
Breaking the Low-Rank Dilemma of Linear Attention
Qihang Fan
Huaibo Huang
Ran He
28
0
0
12 Nov 2024
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices
Ming Kang
F. F. Ting
Raphaël C.-W. Phan
C. Ting
ViT
MedIm
52
1
0
29 Oct 2024
Historical Test-time Prompt Tuning for Vision Foundation Models
Historical Test-time Prompt Tuning for Vision Foundation Models
Jingyi Zhang
Jiaxing Huang
Xiaoqin Zhang
Ling Shao
Shijian Lu
VLM
25
4
0
27 Oct 2024
Improving 3D Medical Image Segmentation at Boundary Regions using Local
  Self-attention and Global Volume Mixing
Improving 3D Medical Image Segmentation at Boundary Regions using Local Self-attention and Global Volume Mixing
Daniya Najiha Abdul Kareem
M. Fiaz
Noa Novershtern
Jacob Hanna
Hisham Cholakkal
24
3
0
20 Oct 2024
MoH: Multi-Head Attention as Mixture-of-Head Attention
MoH: Multi-Head Attention as Mixture-of-Head Attention
Peng Jin
Bo Zhu
Li Yuan
Shuicheng Yan
MoE
29
13
0
15 Oct 2024
Overcoming Domain Limitations in Open-vocabulary Segmentation
Overcoming Domain Limitations in Open-vocabulary Segmentation
Dongjun Hwang
Seong Joon Oh
Junsuk Choe
SSeg
OOD
42
0
0
15 Oct 2024
LGFN: Lightweight Light Field Image Super-Resolution using Local
  Convolution Modulation and Global Attention Feature Extraction
LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction
Zhongxin Yu
Liang Chen
Zhiyun Zeng
Kunping Yang
Shaofei Luo
Shaorui Chen
Cheng Zhong
SupR
20
0
0
26 Sep 2024
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive
  Technology
@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology
Xin Jiang
Junwei Zheng
Ruiping Liu
Jiahang Li
Jiaming Zhang
Sven Matthiesen
Rainer Stiefelhagen
VLM
18
0
0
21 Sep 2024
A Comparative Study of Open Source Computer Vision Models for
  Application on Small Data: The Case of CFRP Tape Laying
A Comparative Study of Open Source Computer Vision Models for Application on Small Data: The Case of CFRP Tape Laying
Thomas Fraunholz
Dennis Rall
Tim Kohler
Alfons Schuster
M. Mayer
Lars Larsen
27
0
0
16 Sep 2024
Text-Guided Mixup Towards Long-Tailed Image Categorization
Text-Guided Mixup Towards Long-Tailed Image Categorization
Richard Franklin
Jiawei Yao
Deyang Zhong
Qi Qian
Juhua Hu
VLM
24
0
0
05 Sep 2024
Towards Flexible Visual Relationship Segmentation
Towards Flexible Visual Relationship Segmentation
Fangrui Zhu
Jianwei Yang
Huaizu Jiang
VOS
29
1
0
15 Aug 2024
DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain
  Feature Extraction and Interaction Attention
DFE-IANet: A Method for Polyp Image Classification Based on Dual-domain Feature Extraction and Interaction Attention
Wei Wang
Jixing He
Xin Wang
29
0
0
30 Jul 2024
Practical Video Object Detection via Feature Selection and Aggregation
Practical Video Object Detection via Feature Selection and Aggregation
Yuheng Shi
Tong Zhang
Xiaojie Guo
ObjD
26
2
0
29 Jul 2024
VSSD: Vision Mamba with Non-Causal State Space Duality
VSSD: Vision Mamba with Non-Causal State Space Duality
Yuheng Shi
Minjing Dong
Mingjia Li
Chang Xu
Mamba
28
3
0
26 Jul 2024
GroupMamba: Efficient Group-Based Visual State Space Model
GroupMamba: Efficient Group-Based Visual State Space Model
Abdelrahman M. Shaker
Syed Talal Wasim
Salman Khan
Juergen Gall
Fahad Shahbaz Khan
Mamba
51
0
0
18 Jul 2024
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General
  LiDAR Point Clouds
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds
Yanbo Wang
Wentao Zhao
Chuan Cao
Tianchen Deng
Jingchuan Wang
Weidong Chen
3DPC
40
5
0
16 Jul 2024
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation
  in Breast Cancer Detection from Mammograms
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
Tajamul Ashraf
K. Rangarajan
Mohit Gambhir
Richa Gabha
Chetan Arora
MedIm
26
1
0
09 Jul 2024
LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network
  for Multifeatures Segmentation
LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network for Multifeatures Segmentation
Tariq M Khan
Shahzaib Iqbal
Syed S. Naqvi
Imran Razzak
Erik H. W. Meijering
27
5
0
03 Jul 2024
Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies
Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies
Ivan Drokin
35
19
0
01 Jul 2024
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object
  Detection: Methods and Results
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results
Jiaqi Wang
Yuhang Zang
Pan Zhang
Tao Chu
Yuhang Cao
...
Kehong Yuan
Yanyan Zu
Jiayao Ha
Qiong Gao
Licheng Jiao
ObjD
31
1
0
17 Jun 2024
Technique Report of CVPR 2024 PBDL Challenges
Technique Report of CVPR 2024 PBDL Challenges
Ying Fu
Yu Li
Shaodi You
Boxin Shi
Linwei Chen
...
Songyin Dai
Sen Jia
Junpei Zhang
Puhua Chen
Qihang Li
33
0
0
15 Jun 2024
AdaFisher: Adaptive Second Order Optimization via Fisher Information
AdaFisher: Adaptive Second Order Optimization via Fisher Information
Damien Martins Gomes
Yanlei Zhang
Eugene Belilovsky
Guy Wolf
Mahdi S. Hosseini
ODL
74
2
0
26 May 2024
Infinite-Dimensional Feature Interaction
Infinite-Dimensional Feature Interaction
Chenhui Xu
Fuxun Yu
Maoliang Li
Zihao Zheng
Zirui Xu
Jinjun Xiong
Xiang Chen
25
1
0
22 May 2024
BiomedParse: a biomedical foundation model for image parsing of
  everything everywhere all at once
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once
Theodore Zhao
Yu Gu
Jianwei Yang
Naoto Usuyama
Ho Hin Lee
...
B. Piening
Carlo Bifulco
Mu-Hsin Wei
Hoifung Poon
Sheng Wang
MedIm
18
22
0
21 May 2024
MambaOut: Do We Really Need Mamba for Vision?
MambaOut: Do We Really Need Mamba for Vision?
Weihao Yu
Xinchao Wang
Mamba
37
46
0
13 May 2024
Linearly-evolved Transformer for Pan-sharpening
Linearly-evolved Transformer for Pan-sharpening
Junming Hou
Zihan Cao
Naishan Zheng
Xuan Li
Xiaoyu Chen
Xinyang Liu
Xiaofeng Cong
Man Zhou
Danfeng Hong
ViT
16
7
0
19 Apr 2024
SpiralMLP: A Lightweight Vision MLP Architecture
SpiralMLP: A Lightweight Vision MLP Architecture
Haojie Mu
Burhan Ul Tayyab
Nicholas Chua
32
0
0
31 Mar 2024
Rewrite the Stars
Rewrite the Stars
Xu Ma
Xiyang Dai
Yue Bai
Yizhou Wang
Yun Fu
25
92
0
29 Mar 2024
Efficient Modulation for Vision Networks
Efficient Modulation for Vision Networks
Xu Ma
Xiyang Dai
Jianwei Yang
Bin Xiao
Yinpeng Chen
Yun Fu
Lu Yuan
24
17
0
29 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
36
7
0
28 Mar 2024
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs
Donghyun Kim
Byeongho Heo
Dongyoon Han
25
12
0
28 Mar 2024
123
Next