Jared Casper is a Senior Deep Learning Research Scientist with 17 years of experience blending foundational research and production-grade engineering at organizations including NVIDIA, Baidu, and Oracle Labs. He holds a Ph.D. from Stanford and a BS from MIT and focuses on scaling transformer training and dependable ML infrastructure—contributions to NVIDIA’s Megatron-LM underline his work on PyTorch RNG state handling and large-model sampling. Jared also has a strong systems and compiler background, evidenced by substantive back-end contributions to the Icarus Verilog project that improved compatibility and added SystemVerilog 2009 features. Based in Palo Alto, he operates at the intersection of deep learning research and low-level software reliability, bringing academic rigor to pragmatic, shipping systems.
17 years of coding experience
11 years of employment as a software developer
B.S., Electrical Engineering and Computer Science, B.S., Electrical Engineering and Computer Science at Massachusetts Institute of Technology
Ph.D., Computer Science, Ph.D., Computer Science at Stanford University
Ongoing research training transformer models at scale
Role in this project:
ML Engineer & Data Scientist
Contributions:6 releases, 2 reviews, 351 commits in 3 years 4 months
Contributions summary:Jared's contributions primarily focused on supporting and improving the PyTorch RNG state API and updating the code base with latest changes, implying a role in maintaining or optimizing the codebase. The user also addressed a specific bug and added code to generate samples. These contributions suggest involvement in both model functionality and related software engineering aspects of the project.
Contributions summary:Jared primarily contributed to improving the Icarus Verilog compiler. They fixed compiler compatibility issues with different GCC versions by including necessary header files and addressing memory leaks in specific functions. Furthermore, the user implemented the addition of new features, such as the ability to generate SystemVerilog 2009 constructs, along with corresponding tokens and language elements. They also addressed the correction of the system tasks and some of its arguments.
icarusicarus-verilogverilog
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Jared Casper - Senior Deep Learning Research Scientist at NVIDIA