About

As a Senior Data Scientist at NVIDIA, I help cloud customers train and deploy conversational AI models at scale, using NVIDIA’s GPUs and software. I have a passion for speech and audio data, and I specialize in text-to-speech (TTS) algorithms and applications. I have contributed to multiple TTS projects, such as scaling NVIDIA’s Multi-speaker Multi-lingual TTS systems with Zero-shot TTS to Indic Languages.

I have a Master of Science in Computer Science from the University of Colorado Boulder, where I explored and published papers on various topics, such as cognitive science, climate prediction and AI for education. My academic background and professional experience have taught me how to apply deep learning to solve real-world problems and deliver impactful solutions. I am always eager to learn new skills and technologies, and to collaborate with other experts and innovators in the field of conversational AI.

If you are interested in collaborating, feel free to drop a note. If you are looking connect 1-1 with me, book time on my calendar.

Research


Scaling NVIDIA’s Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages

Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro

project page, pdf, blog

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro

project page, pdf, code, blog, talk

Does Deep Knowledge Tracing Model Interactions among Skills?

Shirly Montero, Akshit Arora, Sean Kelly, Brent Milne, Michael Mozer

project page, pdf, code

Interactive landslide simulator: a tool for landslide risk assessment and communication

Pratik Chaturvedi, Akshit Arora, Varun Dutt

project page, pdf, journal

Recent News


  • [February 2024] I launched my personal blog, The Curious Perspective, on substack, check it out here! #personal

More updates here.