ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.15840
  4. Cited By
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective
  with Transformers
v1v2v3 (latest)

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Computer Vision and Pattern Recognition (CVPR), 2020
31 December 2020
Sixiao Zheng
Jiachen Lu
Hengshuang Zhao
Xiatian Zhu
Zekun Luo
Yabiao Wang
Yanwei Fu
Jianfeng Feng
Tao Xiang
Juil Sock
Li Zhang
    ViT
ArXiv (abs)PDFHTML

Papers citing "Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers"

50 / 1,242 papers shown
Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation
Boundary-Aware Test-Time Adaptation for Zero-Shot Medical Image Segmentation
Chenlin Xu
Lei Zhang
Lituan Wang
Xinyu Pu
Pengfei Ma
Guangwu Qian
Z. Wang
Yan Wang
VLM
195
0
0
04 Dec 2025
ReactionMamba: Generating Short & Long Human Reaction Sequences
ReactionMamba: Generating Short & Long Human Reaction Sequences
Hajra Anwar Beg
Baptiste Chopin
Hao Tang
Mohamed Daoudi
Mamba
218
0
0
28 Nov 2025
SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation
SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation
Tianrun Chen
Runlong Cao
Xinda Yu
Lanyun Zhu
Chaotao Ding
...
Cheng Chen
Qi Zhu
C. Xu
Papa Mao
Ying Zang
MedImVLM
418
0
0
24 Nov 2025
Seg-VAR: Image Segmentation with Visual Autoregressive Modeling
Seg-VAR: Image Segmentation with Visual Autoregressive Modeling
Rongkun Zheng
Lu Qi
Xi Chen
Yi Wang
K. Wang
Hengshuang Zhao
187
0
0
16 Nov 2025
Navigating the Wild: Pareto-Optimal Visual Decision-Making in Image Space
Navigating the Wild: Pareto-Optimal Visual Decision-Making in Image Space
Durgakant Pushp
Weizhe (Wesley) Chen
Zheng Chen
Chaomin Luo
Jason M. Gregory
Lantao Liu
117
1
0
11 Nov 2025
Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning
Learning with less: label-efficient land cover classification at very high spatial resolution using self-supervised deep learning
Dakota Hester
Vitor S. Martins
Lucas B. Ferreira
Thainara M. A. Lima
SSL
406
0
0
04 Nov 2025
MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation
MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation
Ziyi Wang
Yuanmei Zhang
Dorna Esrafilzadeh
Ali R. Jalili
Suncheng Xiang
199
0
0
03 Nov 2025
Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation
Terrain-Enhanced Resolution-aware Refinement Attention for Off-Road Segmentation
Seongkyu Choi
Jhonghyun An
129
0
0
03 Nov 2025
SA$^{2}$Net: Scale-Adaptive Structure-Affinity Transformation for Spine Segmentation from Ultrasound Volume Projection Imaging
SA2^{2}2Net: Scale-Adaptive Structure-Affinity Transformation for Spine Segmentation from Ultrasound Volume Projection Imaging
Hao Xie
Zixun Huang
Yushen Zuo
Yakun Ju
F. Leung
N. F. Law
Kin-Man Lam
Y. Zheng
Sai Ho Ling
130
0
0
30 Oct 2025
Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation
Classifier Enhancement Using Extended Context and Domain Experts for Semantic Segmentation
Huadong Tang
Youpeng Zhao
Min Xu
Jun Wang
Qiang Wu
132
0
0
29 Oct 2025
Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks
Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks
Jieyuan Zhang
Xiaolong Zhou
Shuai Wang
Wenjie Wei
Hanwen Liu
Qian Sun
Malu Zhang
Yang Yang
Haizhou Li
217
1
0
24 Oct 2025
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
WaveSeg: Enhancing Segmentation Precision via High-Frequency Prior and Mamba-Driven Spectrum Decomposition
Guoan Xu
Yang Xiao
Wenjing Jia
Guangwei Gao
Guo-Jun Qi
Chia-Wen Lin
Mamba
249
0
0
24 Oct 2025
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference
Wenxun Wang
Shuchang Zhou
Wenyu Sun
Peiqin Sun
Y. Liu
171
43
0
20 Oct 2025
SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation
SaFiRe: Saccade-Fixation Reiteration with Mamba for Referring Image Segmentation
Zhenjie Mao
Yuhuan Yang
Chaofan Ma
Dongsheng Jiang
Jiangchao Yao
Ya Zhang
Yanfeng Wang
150
1
0
11 Oct 2025
Fitzpatrick Thresholding for Skin Image Segmentation
Fitzpatrick Thresholding for Skin Image Segmentation
Duncan Stothers
Sophia Xu
Carlie Reeves
Lia Gracey
128
1
0
08 Oct 2025
FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks
FlexiQ: Adaptive Mixed-Precision Quantization for Latency/Accuracy Trade-Offs in Deep Neural Networks
Jaemin Kim
Hongjun Um
Sungkyun Kim
Yongjun Park
Jiwon Seo
MQ
264
0
0
03 Oct 2025
Consistent Assistant Domains Transformer for Source-free Domain Adaptation
Consistent Assistant Domains Transformer for Source-free Domain AdaptationIEEE Transactions on Image Processing (IEEE TIP), 2025
Renrong Shao
Wei Zhang
Kangyang Luo
Qin Li
and Jun Wang
213
0
0
02 Oct 2025
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
PyramidStyler: Transformer-Based Neural Style Transfer with Pyramidal Positional Encoding and Reinforcement Learning
Raahul Krishna Durairaju
K. Saruladha
250
0
0
02 Oct 2025
ClustViT: Clustering-based Token Merging for Semantic Segmentation
ClustViT: Clustering-based Token Merging for Semantic Segmentation
Fabio Montello
Ronja Güldenring
Lazaros Nalpantidis
VLM
156
0
0
02 Oct 2025
Transformer Classification of Breast Lesions: The BreastDCEDL_AMBL Benchmark Dataset and 0.92 AUC Baseline
Transformer Classification of Breast Lesions: The BreastDCEDL_AMBL Benchmark Dataset and 0.92 AUC Baseline
Naomi Fridman
Anat Goldstein
MedIm
177
0
0
30 Sep 2025
Causally Guided Gaussian Perturbations for Out-Of-Distribution Generalization in Medical Imaging
Causally Guided Gaussian Perturbations for Out-Of-Distribution Generalization in Medical Imaging
Haoran Pei
Yuguang Yang
Kexin Liu
Baochang Zhang
OODOODDCMLMedIm
242
0
0
30 Sep 2025
RetoVLA: Reusing Register Tokens for Spatial Reasoning in Vision-Language-Action Models
RetoVLA: Reusing Register Tokens for Spatial Reasoning in Vision-Language-Action Models
Jiyeon Koo
Taewan Cho
Hyunjoon Kang
Eunseom Pyo
Tae Gyun Oh
Taeryang Kim
Andrew Jaeyong Choi
113
2
0
25 Sep 2025
Weakly Supervised Food Image Segmentation using Vision Transformers and Segment Anything Model
Weakly Supervised Food Image Segmentation using Vision Transformers and Segment Anything Model
I. Sarafis
Alexandros Papadopoulos
A. Delopoulos
ViT
194
1
0
23 Sep 2025
Prototype-Based Pseudo-Label Denoising for Source-Free Domain Adaptation in Remote Sensing Semantic Segmentation
Prototype-Based Pseudo-Label Denoising for Source-Free Domain Adaptation in Remote Sensing Semantic Segmentation
Bin Wang
Fei Deng
Zeyu Chen
Zhicheng Yu
Y. Liu
137
1
0
21 Sep 2025
Uncertainty-Gated Deformable Network for Breast Tumor Segmentation in MR Images
Uncertainty-Gated Deformable Network for Breast Tumor Segmentation in MR Images
Yue Zhang
Jiahua Dong
Chengtao Peng
Qiuli Wang
Dan Song
Guiduo Duan
215
0
0
19 Sep 2025
[Re] Improving Interpretation Faithfulness for Vision Transformers
[Re] Improving Interpretation Faithfulness for Vision Transformers
Izabela Kurek
Wojciech Trejter
Stipe Frkovic
Andro Erdelez
182
0
0
18 Sep 2025
Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions
Where Do Tokens Go? Understanding Pruning Behaviors in STEP at High Resolutions
Michal Szczepanski
Martyna Poreba
Karim Haroun
ViT
170
0
0
17 Sep 2025
Masked Feature Modeling Enhances Adaptive Segmentation
Masked Feature Modeling Enhances Adaptive Segmentation
Wenlve Zhou
Zhiheng Zhou
Tiantao Xian
Yikui Zhai
Weibin Wu
Biyun Ma
144
0
0
17 Sep 2025
MAFS: Masked Autoencoder for Infrared-Visible Image Fusion and Semantic Segmentation
MAFS: Masked Autoencoder for Infrared-Visible Image Fusion and Semantic SegmentationIEEE Transactions on Image Processing (IEEE TIP), 2025
Liying Wang
Xiaoli Zhang
Chuanmin Jia
Siwei Ma
216
2
0
15 Sep 2025
Geometric Analysis of Magnetic Labyrinthine Stripe Evolution via U-Net Segmentation
Geometric Analysis of Magnetic Labyrinthine Stripe Evolution via U-Net Segmentation
Vinícius Yu Okubo
Kotaro Shimizu
B.S. Shivaran
Gia-Wei Chern
Hae Yong Kim
111
1
0
15 Sep 2025
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
Bingyu Li
Haocheng Dong
Da Zhang
Zhiyuan Zhao
Junyu Gao
Xuelong Li
218
13
0
15 Sep 2025
U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT
U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT
Zhi Qin Tan
Xiatian Zhu
Owen Addison
Yunpeng Li
MambaAI4CE
317
1
0
15 Sep 2025
I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation
I-Segmenter: Integer-Only Vision Transformer for Efficient Semantic Segmentation
Jordan Sassoon
Michal Szczepanski
Martyna Poreba
MQVLM
264
0
0
12 Sep 2025
Differential Morphological Profile Neural Networks for Semantic Segmentation
Differential Morphological Profile Neural Networks for Semantic Segmentation
David Huangal
J. Alex Hurt
143
0
0
04 Sep 2025
TransForSeg: A Multitask Stereo ViT for Joint Stereo Segmentation and 3D Force Estimation in Catheterization
TransForSeg: A Multitask Stereo ViT for Joint Stereo Segmentation and 3D Force Estimation in Catheterization
Pedram Fekri
M. Zadeh
Javad Dargahi
MedIm
118
0
0
01 Sep 2025
VoCap: Video Object Captioning and Segmentation from Any Prompt
VoCap: Video Object Captioning and Segmentation from Any Prompt
J. Uijlings
Xingyi Zhou
Xiuye Gu
Arsha Nagrani
Anurag Arnab
Alireza Fathi
David A. Ross
Cordelia Schmid
VOSVLM
304
1
0
29 Aug 2025
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
WaveHiT-SR: Hierarchical Wavelet Network for Efficient Image Super-Resolution
Fayaz Ali
Muhammad Zawish
Steven Davy
Radu Timofte
144
1
0
27 Aug 2025
ISALux: Illumination and Segmentation Aware Transformer Employing Mixture of Experts for Low Light Image Enhancement
ISALux: Illumination and Segmentation Aware Transformer Employing Mixture of Experts for Low Light Image Enhancement
Raul Balmez
Alexandru Brateanu
Ciprian Orhei
C. Ancuti
Cosmin Ancuti
155
0
0
25 Aug 2025
GazeProphet: Software-Only Gaze Prediction for VR Foveated Rendering
GazeProphet: Software-Only Gaze Prediction for VR Foveated Rendering
Farhaan Ebadulla
Chiraag Mudlapur
Gaurav BV
182
0
0
19 Aug 2025
SCRNet: Spatial-Channel Regulation Network for Medical Ultrasound Image Segmentation
SCRNet: Spatial-Channel Regulation Network for Medical Ultrasound Image Segmentation
Weixin Xu
Ziliang Wang
ViTMedIm
205
1
0
19 Aug 2025
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
Shi-Chen Zhang
Yunheng Li
Yu-Huan Wu
Qibin Hou
Ming-Ming Cheng
SSeg
247
4
0
12 Aug 2025
SMOL-MapSeg: Show Me One Label as prompt
SMOL-MapSeg: Show Me One Label as prompt
Yunshuang Yuan
Frank Thiemann
Thorsten Dahms
Monika Sester
157
1
0
07 Aug 2025
Glass Surface Segmentation with an RGB-D Camera via Weighted Feature Fusion for Service Robots
Glass Surface Segmentation with an RGB-D Camera via Weighted Feature Fusion for Service Robots
Henghong Lin
Zihan Zhu
Tao Wang
Anastasia Ioannou
Yuanshui Huang
178
3
0
03 Aug 2025
Representation Shift: Unifying Token Compression with FlashAttention
Representation Shift: Unifying Token Compression with FlashAttention
Joonmyung Choi
S. Lee
Byungoh Ko
Eunseo Kim
Jihyung Kil
Hyunwoo J. Kim
241
2
0
01 Aug 2025
EIFNet: Leveraging Event-Image Fusion for Robust Semantic Segmentation
EIFNet: Leveraging Event-Image Fusion for Robust Semantic Segmentation
Zhijiang Li
Haoran He
142
0
0
29 Jul 2025
ModalFormer: Multimodal Transformer for Low-Light Image Enhancement
ModalFormer: Multimodal Transformer for Low-Light Image Enhancement
Alexandru Brateanu
Raul Balmez
Ciprian Orhei
C. Ancuti
Cosmin Ancuti
ViTOffRL
279
0
0
27 Jul 2025
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models
SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion ModelsAAAI Conference on Artificial Intelligence (AAAI), 2025
J. Park
Kumju Jo
Sungyong Baik
DiffM
255
2
0
26 Jul 2025
MambaVesselNet++: A Hybrid CNN-Mamba Architecture for Medical Image Segmentation
MambaVesselNet++: A Hybrid CNN-Mamba Architecture for Medical Image SegmentationACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) (TOMM), 2025
Qing Xu
Yanming Chen
Yue Li
Ziyu Liu
Zhenye Lou
Yixuan Zhang
Xiangjian He
Mamba
185
3
0
26 Jul 2025
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
Chen Zhu
Wangbo Zhao
Huiwen Zhang
Samir Khaki
Yuhao Zhou
...
Zhihang Yuan
Yuzhang Shang
Xiaojiang Peng
Kai Wang
Dawei Yang
229
3
0
25 Jul 2025
Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows
Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows
Simin Huo
Ning Li
ViT
296
0
0
24 Jul 2025
1234...232425
Next
Page 1 of 25
Pageof 25