Vijay Gohil
Engineer · Researcher, NYU AI4CE Lab
I’m an ML/CV Engineer at 2K Games and Visiting Researcher at the NYU AI4CE Lab. I work spans from building 3D computer vision systems with a focus on building systems that see and understand the 3D world to Vision Interpretability research.
My research interests include computer vision, sparse autoencoders for vision transformer interpretability, and multi-camera 3D reconstruction. I’m particularly interested in mechanistic interpretability for vision models — understanding what and how vision transformers represent the world internally.
Currently, I’m leading visaebench, a cross-architecture Sparse Autoencoder evaluation study training TopK SAEs on ImageNet across DINOv2-B, CLIP ViT-B/16, SigLIP, MAE ViT-B, and supervised DeiT, targeting NeurIPS 2026.
On the applied side, I work on object detection, pose estimation, and 3D triangulation, animation generation for Video Games— problems where geometric understanding meets real-time constraints.
news
| Aug 11, 2025 | Invited as speaker at Ai4 2025 conference at MGM Grand, Las Vegas (August 11–13, 2025) — North America’s premier AI industry event with 8,000+ attendees from 85+ countries. |
|---|---|
| Jul 09, 2025 | Featured by iMerit on LinkedIn in connection with the upcoming Ai4 2025 conference. |
| Jun 11, 2025 | Featured by Ai4 on LinkedIn as a confirmed speaker at Ai4 2025. |
| Jan 26, 2025 | Started research on Vision-SAEBench — cross-architecture SAE evaluation study training Sparse Autoencoders on ImageNet across DINOv2, CLIP, SigLIP, MAE, and DeiT ViT backbones, targeting NeurIPS 2026. |
latest posts
| Nov 06, 2025 | Should Developers Care about Interpretability? |
|---|---|
| Feb 11, 2025 | LLM-Powered Sorting with TrueSkill |
| May 14, 2024 | Google Gemini updates: Flash 1.5, Gemma 2 and Project Astra |