Vijay Gohil

Engineer · Researcher, NYU AI4CE Lab

prof_pic.jpg

I’m an ML/CV Engineer at 2K Games and Visiting Researcher at the NYU AI4CE Lab. I work spans from building 3D computer vision systems with a focus on building systems that see and understand the 3D world to Vision Interpretability research.

My research interests include computer vision, sparse autoencoders for vision transformer interpretability, and multi-camera 3D reconstruction. I’m particularly interested in mechanistic interpretability for vision models — understanding what and how vision transformers represent the world internally.

Currently, I’m leading visaebench, a cross-architecture Sparse Autoencoder evaluation study training TopK SAEs on ImageNet across DINOv2-B, CLIP ViT-B/16, SigLIP, MAE ViT-B, and supervised DeiT, targeting NeurIPS 2026.

On the applied side, I work on object detection, pose estimation, and 3D triangulation, animation generation for Video Games— problems where geometric understanding meets real-time constraints.

news

Aug 11, 2025 Invited as speaker at Ai4 2025 conference at MGM Grand, Las Vegas (August 11–13, 2025) — North America’s premier AI industry event with 8,000+ attendees from 85+ countries.
Jul 09, 2025 Featured by iMerit on LinkedIn in connection with the upcoming Ai4 2025 conference.
Jun 11, 2025 Featured by Ai4 on LinkedIn as a confirmed speaker at Ai4 2025.
Jan 26, 2025 Started research on Vision-SAEBench — cross-architecture SAE evaluation study training Sparse Autoencoders on ImageNet across DINOv2, CLIP, SigLIP, MAE, and DeiT ViT backbones, targeting NeurIPS 2026.

latest posts

selected publications

  1. arXiv
    Scene Change Detection with Vision-Language Representation Learning
    Diwei Sheng, Vijayraj Gohil, Satyam Gaba, and 5 more authors
    2026