Fei Kou

Member Of Technical Staff at Anthropic

San Francisco Bay Area United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Fei Kou is a Member of Technical Staff at Anthropic in the San Francisco Bay Area with eight years of engineering experience spanning finance technology and GPU inference optimization. He spent an extended tenure at Facebook driving GPU inference enablement and performance, and now applies that low-level, performance-first mindset to production LLM infrastructure. Early roles in reference-data and banking technology at Nomura and JPMorgan gave him a discipline for data integrity and operational reliability that informs his systems design. Fei holds a BS in Computer Science from SUNY Stony Brook and is adept at turning GPU and systems expertise into scalable, production-ready inference pipelines.
code8 years of coding experience
job13 years of employment as a software developer
bookHigh School
bookBachelor of Science (BS), Computer Science, Bachelor of Science (BS), Computer Science at State University of New York at Stony Brook
languagesChinese, English
stackoverflow-logo

Stackoverflow

Stats
1reputation
0reached
0answers
0questions
github-logo-circle

Github Skills (33)

python10
acceleration10
machine-learning10
numpy10
deep-learning10
tensorflow10
neural-networks10
gpu10
autograd10
neural-network10
tensor10
gpu-acceleration10
analyser9
multiplication9
amd-gpu8

Programming languages (2)

C++Python

Github contributions (5)

github-logo-circle
feikou/pytorch

Oct 2023 - Apr 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration
Contributions:25 pushes, 7 branches in 6 months
amateurcoffee/AITemplate

Apr 2023 - Apr 2023

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Contributions:6 pushes, 1 branch in 1 day
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial