Degraded Document Binarisation

dc.contributor.authorAjith, Miriyala
dc.date.accessioned2025-07-21T10:31:34Z
dc.date.available2025-07-21T10:31:34Z
dc.date.issued2025-06
dc.descriptionDissertation under the supervision of Dr. Ujjwal Bhattacharyaen_US
dc.description.abstractIn this study, I explored degraded document binarization by reviewing two recent model frameworks and implementing their models using PyTorch. The first model is based on cGANs, specifically the DE-GAN [41] framework, which enhances degraded documents by restoring their quality prior to binarization. The second model employs vision transformers [40], inspired by the DocBinFormer architecture, which uses an autoencoder in both the encoder and decoder for effective binarization. Both models were evaluated on the ISI-Bengali dataset. Experimental results demonstrate that DE-GAN improved document quality by 4% compared to the degraded input, while the vision transformer model achieved a 14% improvement, highlighting the effectiveness of transformer-based approaches for document enhancement and binarization.en_US
dc.identifier.citation31p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7585
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesMTech(CS) Dissertation;23-10
dc.titleDegraded Document Binarisationen_US
dc.typeOtheren_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Dissertation_CS2310_Report_compressed (1).pdf
Size:
485.08 KB
Format:
Adobe Portable Document Format
Description:
Dissertations - M Tech (CS)
No Thumbnail Available
Name:
Dissertation_CS2310_similarity_report.pdf
Size:
535.89 KB
Format:
Adobe Portable Document Format
Description:
Plagiarism_report

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: