My research encompass training, alignment, and benchmarking of language models. With the growing integration of AI into our daily lives, I am passionate about developing safe and robust AI systems.

I am currently part of the research group at ServiceNow where my work focuses on enhancing LLMs in post-training phase through data curation, alignment, tool-augmented reasoning, and benchmarking. Prior to that, I was an AI Resident at Facebook AI Research, where I worked with Marcus Rohrbach on developing models with Multimodal understanding of vision and language content. During my Masters, I worked on evaluating the robustness and generalization of language models as part of my thesis at IIIT Hyderabad.
More details are available in my cover letter.

Research

Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA
Rishabh Maheshwary, Masoud Hashemi, Khyati Mahajan, Shiva Krishna Reddy Malay, Sai Rajeswar, Sathwik Tejaswi Madhusudhan, Spandana Gella, Vikas Yadav
Under Submission in ARR
M-RewardBench: Evaluating Reward Models in Multilingual Settings
Srishti Gureja*, Lester James V. Miranda*, Shayekh Bin Islam*, Rishabh Maheshwary*, Drishti Sharma, Gusti Winata, Nathan Lambert, Sebastian Ruder, Sara Hooker, Marzieh Fadaee
ACL 2025
Enhancing Alignment using Curriculum Learning & Ranked Preferences
Pulkit Pattnaik*, Rishabh Maheshwary*, Kelechi Ogueji, Vikas Yadav, Sathwik Tejaswi Madhusudhan
EMNLP 2024
M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models
Rishabh Maheshwary, Vikas Yadav, Hoang Nguyen, Khyati Mahajan, Sathwik Tejaswi Madhusudhan
NAACL 2025
Improving Selective Visual Question Answering by Learning from Your Peers
Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach
CVPR 2023
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar, Rishabh Maheshwary, Vikram Pudi
NAACL 2022 (Oral)
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary*, Saket Maheshwary*, Vikram Pudi
EMNLP 2021
Adversarial Examples for Evaluating Math Word Problem Solvers
Vivek Kumar*, Rishabh Maheshwary*, Vikram Pudi
EMNLP 2021, Findings
Generating natural language attacks in a hard label black box setting
Rishabh Maheshwary, Saket Maheshwary, Vikram Pudi
AAAI 2021
A context aware approach for generating natural language attacks
Rishabh Maheshwary, Saket Maheshwary, Vikram Pudi
AAAI 2021, Student Poster