Leon Weber-genzel is a Machine Learning Research Engineer with a decade of professional experience and 6+ years focused on ML and NLP research and engineering. Currently at smedo GmbH after a postdoc at LMU and a summa cum laude Doctor of Science from Humboldt-Universität, he bridges rigorous academic research with production-minded engineering. He specializes in information extraction and biomedical NLP, building models, evaluation pipelines and application software that emphasize data quality and runtime efficiency. An active open-source contributor to the well-known flair NLP framework, he implemented multiple biomedical datasets, improved entity-overlap checks and helped develop a multi-tagger model. Combining a technical CS pedigree with a BA in Philosophy, he brings analytical rigor and clear communication to projects that aim to directly improve people's lives from his base in Nuremberg.
11 years of coding experience
9 years of employment as a software developer
Bachelor of Science - BS, Computer Science, Bachelor of Science - BS, Computer Science at Freie Universität Berlin
Doctor of Science, Computer Science, summa cum laude, Doctor of Science, Computer Science, summa cum laude at Humboldt-Universität zu Berlin
Master of Science - MS, Computer Science, Master of Science - MS, Computer Science at Humboldt University of Berlin
Bachelor of Arts - BA, Philosophy, Bachelor of Arts - BA, Philosophy at Otto-Friedrich-Universität Bamberg
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Role in this project:
Back-end Developer & Data Scientist
Contributions:50 commits, 3 PRs, 41 pushes in 5 months
Contributions summary:Leon contributed significantly to the `flairnlp/flair` repository, focusing on the integration and implementation of various biomedical datasets and NER functionalities. Their commits involved adding new datasets like BioInfer, CellFinder, CHEMDNER, and others, demonstrating a strong understanding of dataset integration and data processing. They also worked on improving the efficiency of entity overlap checks and mapping corpora-specific tags to canonical tags, indicating a focus on both data quality and performance optimization. Additionally, the user contributed to the development of a multi-tagger model within the framework.
Contributions:40 commits, 7 pushes in 2 years 9 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.