WHERE I STUDIED

MY EDUCATION

Duke University

Data Science (M.S.)

2020 - Present

University of Toronto

Biomedical Engineering (M.Eng.)

2017 - 2018

McMaster University

Electrical & Biomedical Engineering (B.Eng.)

2011 - 2017

WHERE I WORKED

MY EXPERIENCES

  • Data Scientist

    Red Venture - Healthline Media
    08.2021 – Present
    • Build an article tagger with clustering techniques (K-means, hierarchical) to enable efficient inventory search.
    • Embed 70,000+ articles with techniques including Doc2Vec, Universal Sentence Encoder, and TF-IDF.
    • Create a classification model that automatically assigns a new article with multiple tags to help the business team place relevant advertisements and identify early healthcare-related trends.
  • Data Scientist

    Duke Anesthesiology
    08.2021 – Present
    • Leverage ensemble methods (Random Forest, XGBoost) on anesthesia case data to predict the likelihood of drug addiction and substance abuse.
    • Encode high-cardinality (200+) categorical features with Bio-ClinicalBERT and K-means clustering.
    • Launch a dashboard in Tableau to provide model interpretability and help clinicians identify at-risk providers.
  • Machine Learning Engineer Intern

    Bosch - Center for Artificial Intelligence
    05.2021 - 08.2021
    • Developed a Root Cause (RC) detection model by quantifying the statistical relationship between root cause measurements and manufacturing failures with hypothesis tests including t-test, chi-square.
    • Designed and deployed a data simulator in Hadoop to augment the RC model’s applicability in complex processes.
    • Produced behavioural-driven tests to evaluate model performance; captured and resolved false-positive cases to increase production efficiency at customer sites.
  • Graduate Research Assistant

    Duke Cancer Institute
    08.2020 - 04.2021
    • Collaborated with cross-functional teams to identify discrepancies among patient experience patterns and optimized questionnaire delivery timing & frequency for tele-visits; increased questionnaire click-through rate by 10%.
    • Visualized historical symptom trends and clustered self-reported symptoms to facilitate cancer patient triage and diagnosis.
  • Machine Learning Research Intern

    UXSINO Software Co.
    05.2020 - 08.2020
    • Queried network operation data with PostgreSQL; cleaned and analyzed data to identify failing hardware components in a network topology.
    • Created a Graph Convolutional Network-based Anomaly Detector to predict node status; outperformed previous Gaussian-based model by 12% in F1 score.
  • Product Development Engineer

    Innovere Medical Inc.
    05.2018 - 04.2020
    • Generated measurable and practical design inputs by collaborating with stakeholders to identify and attain user, regulatory, and marketing-related requirements.
    • Directed software development lifecycle activities such as software validation through an agile process; accelerated the product delivery by 2 weeks.

WHAT I DID

MY PROJECTS