Lu-hsuan Chen

Independent System Performance Engineer at 國立中央大學

New Taipei, Taiwan
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Lu-hsuan Chen is a systems software engineer with a decade of hands-on experience and 4+ years focused on low-level performance engineering, CPU architecture, SIMD and Linux internals. He has shipped kernel and driver patches at scale, optimized networking and storage components, and improved production clustering performance (notably a CRC16 multi-byte lookup optimization that boosted RPS and latency). An active open-source contributor, he has improved simdjson, sse2neon and semu—implementing cross-ISA intrinsics like _rdtsc and adding C++20 DOM ranges tests—bridging deep systems knowledge with reproducible community-grade engineering. His background spans virtualization, USB/guest audio device VirtIO work, distributed infrastructure automation and real‑time analytics dashboards, showing a rare mix of firmware-level debugging and higher-level cloud tooling. Based in New Taipei, Taiwan, he’s open to roles in system software, performance engineering and infrastructure where low-latency, platform-aware optimization matters.
code10 years of coding experience
job4 years of employment as a software developer
bookMaster of Science - MS, Computer Science and Information Engineering, Information Technology, Master of Science - MS, Computer Science and Information Engineering, Information Technology at National Central University
languagesEnglish, Chinese, Mandarin
github-logo-circle

Github Skills (11)

arm10
c-language10
intrinsics10
cprogramming-language10
neon10
assemble9
assembler9
x86-649
x869
assembly9
testing8

Programming languages (8)

JuliaC++CJupyter NotebookRubyPythonCrystalCuda

Github contributions (5)

github-logo-circle
DLTcollab/sse2neon

May 2022 - Jan 2023

A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
Role in this project:
userBack-end Developer
Contributions:52 reviews, 9 commits, 20 PRs in 7 months
Contributions summary:Lu-hsuan implemented and tested the `_rdtsc` function, which retrieves the processor's time-stamp counter. They modified existing test cases and added new ones for `_rdtsc` to ensure its functionality. Additionally, the user refactored the code by changing the style of the `asm volatile` keywords to align with the project's coding style. They also simplified some test cases.
armv8-asimdsse-intrinsicsintrinsicsneon-intrinsics
Contributions:1 PR, 8 pushes, 1 branch in 3 years 7 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial