Summary
Di Feng is a Senior Research Scientist with 11 years of experience building multimodal AI and perception systems, currently leading efforts on vision-language foundation models and autonomous digital agents at Apple. His background spans industry research and production—developing multi-view 3D perception at Argo AI scaled to millions of images and driving robust multi-modal 3D detection, fusion, and uncertainty estimation at Bosch Research, with outcomes including publications and patents. Comfortable in Python and C++ and trained at TUM and Ulm (PhD), he blends deep academic rigor with hands-on engineering to move models from concept to production. Notably, he has supervised students and interns and helped onboard teams, reflecting a track record of mentoring alongside technical leadership. Based in Beijing with experience across the US and Europe, he focuses on post-training evaluation and robustness for foundation models, bringing autonomy-grade reliability to multimodal AI.
11 years of coding experience
6 years of employment as a software developer
Visiting Researcher, Visiting Researcher at University of California, Berkeley
Bachelor's degree Engineering, Bachelor's degree Engineering at Tongji University
Doctor of Philosophy - PhD (Dr. -Ing.) Computer Science, Doctor of Philosophy - PhD (Dr. -Ing.) Computer Science at Ulm University
Master of Science - MS Electrical and Computer Engineering, Master of Science - MS Electrical and Computer Engineering at Technische Universität München (Technical University of Munich)
English, German, Chinese