Exploring Character-level Attacks on Neural Ranking Models

dc.contributor.authorHalder, Surjyanee
dc.date.accessioned2025-07-15T10:26:21Z
dc.date.available2025-07-15T10:26:21Z
dc.date.issued2025-06
dc.descriptionDissertation under the supervision of Dr. Debapriyo Majumdaren_US
dc.description.abstractNeural ranking models (NRMs) have achieved state-of-the-art performance in information retrieval, yet they remain highly susceptible to subtle adversarial inputs such as character-level typos. This project explores the robustness of such systems by introducing a reinforcement learning (RL)-based query perturbation framework. RL agents—PPO, DQN, and A2C—were trained to minimally modify user queries (e.g., through character deletions or swaps) with the goal of significantly altering the resulting document rankings, as measured by Kendall’s Tau. Experiments were conducted on the TREC DL 2019 and 2020 benchmarks using two different neural rankers: MiniLM and a fine-tuned CharacterBERT model. The perturbation attacks were shown to succeed in over 85% of cases for MiniLM and approximately 40% for CharacterBERT, indicating varying degrees of vulnerability. To mitigate these effects, a set of pretrained query recovery models—such as T5-large-spell, spelling-correction-base, and grammar correction modules—were applied to restore the original query form. When used in combination, these recovery mechanisms reduced the MiniLM attack success rate to around 52%, demonstrating partial robustness. This study underscores both the fragility of neural rankers to character-level noise and the value of lightweight correction pipelines in improving retrieval resilience.en_US
dc.identifier.citation45p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7569
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesMTech(CS) Dissertation;23-31
dc.subjectNeural Ranking Modelsen_US
dc.subjectReinforcement learning (RL)en_US
dc.subjectTREC DL 2019en_US
dc.subjectMiniLMen_US
dc.subjectCharacterBERT modelen_US
dc.titleExploring Character-level Attacks on Neural Ranking Modelsen_US
dc.typeOtheren_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
Surjyaneeh_dissertation (2) (1).pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format
Description:
Dissertations - M Tech (CS)
No Thumbnail Available
Name:
CS2331.pdf
Size:
959.39 KB
Format:
Adobe Portable Document Format
Description:
Plagiarism_report

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: