Siddharth Dalmia is a Member of Technical Staff at WaveForms AI with eight years of experience building audio LLMs and multimodal long-context systems. He was previously a Research Scientist at Google DeepMind, where he worked on multimodal audio and long-context capabilities for Gemini. Siddharth holds a Ph.D. from Carnegie Mellon University’s Language Technologies Institute, where his research made sequence models more practical in resource-constrained settings by applying compositional principles like task simplification, reusability, transferability, and data pooling. He pairs research depth with production engineering—contributing to high-profile open-source speech tooling (notably backend and DevOps fixes for espnet) and automating tasks such as audio resampling and model logging. Based in New York, he specializes in translating advanced speech and language research into robust, deployable systems.
8 years of coding experience
9 years of employment as a software developer
Bachelor’s Degree, Computer Science, Bachelor’s Degree, Computer Science at Birla Institute of Technology and Science
Doctor of Philosophy - PhD Language Technologies Computer Science, Doctor of Philosophy - PhD Language Technologies Computer Science at Carnegie Mellon University
Contributions:26 reviews, 53 commits, 30 PRs in 1 year 4 months
Contributions summary:Siddharth primarily focused on improving the code's compatibility, debugging, and expanding the functionality of the core components. They addressed issues related to PyTorch versions, run script errors, and automated the process of resampling audio files. Additionally, the user contributed to improving model parameter logging and made several modifications to the configuration options of the language model.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Siddharth Dalmia - Member Of Technical Staff at WaveForms AI