Vijay Gohil

I’m a machine learning and computer vision engineer at 2K Games and a visiting researcher at the NYU AI4CE Lab. My work sits at the intersection of production ML systems and research, with a focus on 3D perception, multiview reasoning, and vision interpretability.

My research interests include computer vision, vision transformer interpretability, and multi-camera 3D reconstruction. I’m particularly interested in mechanistic interpretability for vision models: understanding what vision transformers represent internally and how those representations can be measured, compared, and improved.

On the applied side, I work on object detection, pose estimation, 3D triangulation, and animation-adjacent ML tooling for video games, where geometric understanding has to meet production reliability and real-time constraints.

news

Aug 11, 2025	Invited as speaker at Ai4 2025 conference at MGM Grand, Las Vegas (August 11–13, 2025) — North America’s premier AI industry event with 8,000+ attendees from 85+ countries.
Jul 09, 2025	Featured by iMerit on LinkedIn in connection with the upcoming Ai4 2025 conference.
Jun 11, 2025	Featured by Ai4 on LinkedIn as a confirmed speaker at Ai4 2025.

latest posts

Nov 06, 2025	Should Developers Care about Interpretability?
Feb 11, 2025	LLM-Powered Sorting with TrueSkill

selected publications

arXiv

Scene Change Detection with Vision-Language Representation Learning

Diwei Sheng, Vijayraj Gohil, Satyam Gaba, and 5 more authors

2026

arXiv Bib HTML

@misc{sheng2026scenechange,
  title = {Scene Change Detection with Vision-Language Representation Learning},
  author = {Sheng, Diwei and Gohil, Vijayraj and Gaba, Satyam and Liu, Zihan and Hamilton-Fletcher, Giles and Rizzo, John-Ross and Liang, Yongqing and Feng, Chen},
  year = {2026},
  archiveprefix = {arXiv},
  primaryclass = {cs.CV},
}