Weakly Supervised Semantic Segmentation Using Visual Explainability

Das, Niladri

Weakly Supervised Semantic Segmentation Using Visual Explainability

Files

Niladri Das-Cs2219-Mtech2024.pdf (8.08 MB)

Date

2024-06

Authors

Das, Niladri

Publisher

Indian Statistical Institute, Kolkata

Abstract

In the realm of remote sensing, the task of semantically segmenting landslide images is traditionally reliant on supervised learning techniques. These methods necessitate extensive training datasets and meticulous pixel-level annotations—a process that demands considerable human labor and incurs high costs. To mitigate these challenges, we introduce an innovative approach that employs weakly supervised learning, integrating Class Activation Maps (CAMs) with a Cycle Generative Adversarial Network (CycleGAN). This novel methodology leverages image-level labels in lieu of pixel-level annotations. Initially, CAMs are utilized to locate the landslide’s rough area. Subsequently, CycleGAN generates a synthetic image devoid of landslides, which, when contrasted with the original, yields precise segmentation results. The efficacy of our approach is quantified using the mean Intersection-over-Union (mIOU) metric, demonstrating a superior performance with an mIOU of 0.228. Additionally, when juxtaposed with a U-Net network’s supervised learning technique, which scored an mIOU of 0.408, our results affirm the viability of weakly supervised learning for accurate landslide semantic segmentation in remote sensing imagery. This method significantly alleviates the burden of data annotation. Incorporating the advancements of Score-CAM [6], which surpasses Grad-CAM in object localization accuracy, we further refine our model. Score-CAM’s enhanced precision in identifying relevant features contributes to the improved segmentation of landslide areas, promising a new frontier in remote sensing image analysis.

Description

Dissertation under the supervision of Ujjwal Bhattacharya

Keywords

Class Activation Maps (CAMs), Cycle Generative Adversarial Network (CycleGAN), Score-CAM

Citation

45p.

URI

http://hdl.handle.net/10263/7505

Collections

Dissertations - M Tech (CS)

Full item page

Weakly Supervised Semantic Segmentation Using Visual Explainability

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By