Aviral Gupta

Hey, thanks for stopping by!

I’m a junior undergraduate in the Computer Science Department at BITS Pilani, deeply passionate about deep learning :). My research spans various areas in AI and DL, including diffusion models, fairness, and interpretability.

I am currently a research intern at the Vision and AI Lab, IISc Bangalore, where I work on ML fairness, bias mitigation, and domain generalization using synthetic data-driven approaches. My current focus is on unsupervised debiasing techniques through counterfactual dataset generation with diffusion models. Independently, my recent work and publications explore how interpretability methods can be enhanced using formal languages and structured representations.

More recently, I have been interested in multi-modal intepretability and mechanistic interpretability in vision. I am exploring principled methods to peer inside vision models leveraging MI techniques to understand and mitigate bias through interpretability-driven solutions.

This summer, I will also be interning at Trexquant as a ML Quant Researcher, where I will work on developing alphas for trading equities and futures markets using ML methods.

Always looking to collaborate on interesting problems, drop a mail!

news

Jun 20, 2025	Paper on steering vectors for bias correction at inference time accepted at the Actionable Interpretability Workshop at ICML 2025!!
May 19, 2025	Started my summer internship at Trexquant!
May 06, 2025	Paper on harnessing diffusion-generated synthetic images for fair image classification accepted at the Synthetic Data for Computer Vision Workshop at CVPR 2025!!
Mar 13, 2025	Paper on emergent stacks in transformers also accepted at NAACL Student Research Workshop 2025!
Mar 06, 2025	First paper on emergent stacks in transformers accepted at the ICLR World Models Workshop 2025!

selected publications

ICML Workshop

No Training Wheels: Steering Vectors for Bias Correction at Inference Time

Aviral Gupta^*, Armaan Sethi^*, and Ameesh Sethi

ICML Actionable Interpretability Workshop, 2025

arXiv
CVPR Workshop

Harnessing Diffusion-Generated Synthetic Images for Fair Image Classification

Abhipsa Basu^*, Aviral Gupta^*, Abhijnya Bhatt, and 1 more author

CVPR Synthetic Data for Computer Vision Workshop, 2025
ICLR Workshop

Emergent Stack Representations in Modeling Counter Languages Using Transformers

Aviral Gupta^*, Utkarsh Tiwari^*, and Michael Hahn

ICLR World Models Workshop, 2025
NAACL Student Research Workshop, 2025

arXiv