Cahya Wirawan

Systems Engineer, Software Developer at CTBTO

Vienna, Austria
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

👤
Senior
🎓
Top School
Cahya Wirawan is a seasoned systems engineer and software developer with 19 years of experience building secure, production-ready systems at international organizations like CTBTO and IAEA. He combines a strong IT security and Red Hat systems background with deep expertise in embedded systems, DSP, and machine learning, and actively experiments with NLP, computer vision, and speech recognition for curiosity-driven projects. Cahya has contributed Indonesian language datasets to the widely used Hugging Face datasets hub, reflecting both practical data engineering skills and a commitment to language-specific ML resources. Based in Vienna and educated in information and communication systems, he brings a rare blend of operational systems know-how and hands-on ML dataset curation that helps bridge research prototypes and secure deployments.
code19 years of coding experience
job7 years of employment as a software developer
bookInformation and Communication System, Bachelor, Information and Communication System, Bachelor at Fachhochschule Technikum Wien
languagesGerman, English, Indonesian
github-logo-circle

Github Skills (7)

machine-learning10
nlp10
python10
datasets10
data-engineering9
pandas9
computer-vision4

Programming languages (12)

TypeScriptShellC++RustSCSSJavaScriptGoHTML

Github contributions (5)

github-logo-circle
huggingface/datasets

Dec 2020 - Sep 2021

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Role in this project:
userData Scientist
Contributions:16 reviews, 14 commits, 16 PRs in 9 months
Contributions summary:Cahya contributed to the `huggingface/datasets` repository by adding and modifying datasets related to Indonesian language processing. Their work involved creating dataset configurations, defining features, and generating examples, as seen in the addition of the `id_nergrit_corpus`, `id_newspapers_2018`, `id_clickbait`, and `id_panl_bppt` datasets. The user also updated existing dataset links, and corrected dataset structures by making various updates. The primary focus appears to be in curating and integrating Indonesian language datasets.
ml-modelstensorflownatural-language-processingmanipulationdata-science
Contributions:78 commits, 63 pushes, 1 branch in 6 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Cahya Wirawan - Systems Engineer, Software Developer at CTBTO