Haoran Liu

Data Scientist - IC5

San Francisco Bay Area United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Haoran Liu is a PhD candidate in computer science at Texas A&M (TAMU CSE) with five years of software engineering and data science experience focused on graph deep learning for molecular applications. He contributes to the divelab/DIG open-source library as a back-end developer and data scientist, building dataset interfaces and preprocessing pipelines for QM9, ZINC, and MOSES using RDKit and PyTorch Geometric. Haoran excels at bridging cheminformatics and ML engineering—handling SMILES parsing, molecular property targets, and graph representations to make graph-based experiments reproducible and scalable. He brings research rigor to production-style data infrastructure, accelerating model development for molecular discovery.
code5 years of coding experience
job7 years of employment as a software developer
bookDoctor of Philosophy - PhD, Geophysics and oceanography, Doctor of Philosophy - PhD, Geophysics and oceanography at Louisiana State University
stackoverflow-logo

Stackoverflow

Stats
1reputation
0reached
0answers
0questions
github-logo-circle

Github Skills (11)

pytorch10
rdkit10
deep-learning10
python10
data-science10
data-processing10
pandas9
3d-graphics8
graph8
graphs8
3d8

Programming languages (2)

C++Python

Github contributions (5)

github-logo-circle
divelab/DIG

Apr 2021 - Aug 2021

A library for graph deep learning research
Role in this project:
userBack-end Developer & Data Scientist
Contributions:27 commits, 24 pushes, 4 branches in 3 months
Contributions summary:Haoran primarily contributed to the development of a dataset interface within the `dig/ggraph` library, focusing on molecular data processing. They added a dataset class (`PygDataset`) and implemented the necessary processing steps for several molecular datasets, including QM9, ZINC, and MOSES. This involved handling SMILES strings, molecular property targets, and graph representations using RDKit and PyTorch Geometric. The commits also indicate modifications to data loading and pre-processing pipelines for machine learning tasks involving graph data.
explainable-mlpytorchdataminingdeep-learninggraph-deep-learning
homepage
Contributions:59 pushes, 4 branches in 2 years 4 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Haoran Liu - Data Scientist - IC5