Summary
Apala Guha is a Principal Engineer with a decade of experience designing and optimizing AI infrastructure that bridges hardware, software, and cloud systems to accelerate large models in production. She has led hardware-software co-design and inference platform delivery at Microsoft Azure and optimized model runtimes across AI accelerator startups including Tenstorrent, Lightmatter, and now Cerebras. Her background spans academic research in energy-efficient architectures and compilers to hands-on engineering work on custom accelerators, giving her a rare combination of theoretical rigor and product-focused execution. Based in Greater Boston, she consistently translates deep technical ideas into cross-team impact, from compiler-level optimizations to end-to-end inference pipelines. An unexpected throughline in her career is sustained focus on energy- and performance-aware systems, dating back to postdoctoral work on exascale efficiency.
9 years of coding experience
16 years of employment as a software developer
B.E Computer Science and Engineering, B.E Computer Science and Engineering at Jadavpur University
M.E Computer Engineering, M.E Computer Engineering at University of Virginia
English, Hindi