Haoran Liu is a PhD candidate in computer science at Texas A&M (TAMU CSE) with five years of software engineering and data science experience focused on graph deep learning for molecular applications. He contributes to the divelab/DIG open-source library as a back-end developer and data scientist, building dataset interfaces and preprocessing pipelines for QM9, ZINC, and MOSES using RDKit and PyTorch Geometric. Haoran excels at bridging cheminformatics and ML engineering—handling SMILES parsing, molecular property targets, and graph representations to make graph-based experiments reproducible and scalable. He brings research rigor to production-style data infrastructure, accelerating model development for molecular discovery.
5 years of coding experience
7 years of employment as a software developer
Doctor of Philosophy - PhD, Geophysics and oceanography, Doctor of Philosophy - PhD, Geophysics and oceanography at Louisiana State University
Contributions:27 commits, 24 pushes, 4 branches in 3 months
Contributions summary:Haoran primarily contributed to the development of a dataset interface within the `dig/ggraph` library, focusing on molecular data processing. They added a dataset class (`PygDataset`) and implemented the necessary processing steps for several molecular datasets, including QM9, ZINC, and MOSES. This involved handling SMILES strings, molecular property targets, and graph representations using RDKit and PyTorch Geometric. The commits also indicate modifications to data loading and pre-processing pipelines for machine learning tasks involving graph data.
Contributions:59 pushes, 4 branches in 2 years 4 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.