Home

As a Senior Solutions Architect at NVIDIA, I help large enterprises train and deploy Scalable Generative AI systems at scale, using NVIDIA hardware and software platforms available in cloud. I have a passion for language, speech and audio data, and I specialize in large language models (LLM) and text-to-speech (TTS) algorithms and applications. I have contributed to multiple TTS projects, such as scaling NVIDIA’s Multi-speaker Multi-lingual TTS systems with Zero-shot TTS to Indic Languages.

I have a Master of Science in Computer Science from the University of Colorado Boulder, where I explored and published papers on various topics, such as cognitive science, climate prediction and AI for education. My academic background and professional experience have taught me how to apply deep learning to solve real-world problems and deliver impactful solutions. I am always eager to learn new skills and technologies, and to collaborate with other experts and innovators in the field of conversational AI.

If you are interested in collaborating, feel free to drop a note. If you are looking connect 1-1 with me, book time on my calendar.

Research

	Scaling NVIDIA’s Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro project page, pdf, blog
	VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro project page, pdf, code, blog, talk
	Does Deep Knowledge Tracing Model Interactions among Skills? Shirly Montero, Akshit Arora, Sean Kelly, Brent Milne, Michael Mozer project page, pdf, code
	Interactive landslide simulator: a tool for landslide risk assessment and communication Pratik Chaturvedi, Akshit Arora, Varun Dutt project page, pdf, journal

Akshit Arora

Home

Research

Scaling NVIDIA’s Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Does Deep Knowledge Tracing Model Interactions among Skills?

Interactive landslide simulator: a tool for landslide risk assessment and communication

Recent News

Error

Research

Scaling NVIDIA’s Multi-speaker Multi-lingual TTS Systems with Zero-Shot TTS to Indic Languages

VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation

Does Deep Knowledge Tracing Model Interactions among Skills?

Interactive landslide simulator: a tool for landslide risk assessment and communication

Recent News

Templates (for web app):

Error