A Dual-Branch Local-Global Framework for Cross-Resolution Land Cover Mapping
- AI4ClViTMU3DVWSOLBDLOffRLAI4TSAI4CELMTDMedImMQSupRUQCVOTDMLSSLMoE3DGSKELMFedMLPINNISegCLLVGenAAMLSLRWSODAIFinRALMALMCLIPUQLMHAILRMPEROCLFAttAILawSSegHILMMLAULM&RoWaLMVLMSyDa3DPCPICVTTAReLMOSLMLM&MAAI4MHVOTCoGeELMMDEPILMMILMNAIUDReCodOnRLAI4EdSILM3DHMLTGNNMIALMGPMGenLLMSVXAICMLAuLLM
Cross-resolution land cover mapping aims to produce high-resolution semantic predictions from coarse or low-resolution supervision, yet the severe resolution mismatch makes effective learning highly challenging. Existing weakly supervised approaches often struggle to align fine-grained spatial structures with coarse labels, leading to noisy supervision and degraded mapping accuracy. To tackle this problem, we propose DDTM, a dual-branch weakly supervised framework that explicitly decouples local semantic refinement from global contextual reasoning. Specifically, DDTM introduces a diffusion-based branch to progressively refine fine-scale local semantics under coarse supervision, while a transformer-based branch enforces long-range contextual consistency across large spatial extents. In addition, we design a pseudo-label confidence evaluation module to mitigate noise induced by cross-resolution inconsistencies and to selectively exploit reliable supervisory signals. Extensive experiments demonstrate that DDTM establishes a new state-of-the-art on the Chesapeake Bay benchmark, achieving 66.52\% mIoU and substantially outperforming prior weakly supervised methods. The code is available atthis https URL.
View on arXiv