What cardiovascular conditions are associated with different features of ECG readouts?
Recently, the Patient Contrastive Learning of Representations (PCLR) featurization—a vector of ~320 numbers—was found to more "expressive, performant, and practical" than raw electrocardiogram (ECG) data for cardiovascular machine learning tasks (Diamant et al., 2022). Traditionally, cardiologists read an ECG to diagnose adverse cardiac events, monitor drug toxicity, and screen patients who might be at risk for certain conditions. Using XAI and generative learning techniques on large datasets from MIMIC-IV (MIT) and the UK Biobank (NHS), I seek to understand internal feature relationships in PCLR and associate feature perturbations with clinical cardiac conditions.
Goal: To develop a best-in-class AI-assisted clinical decision making tool for maternal preeclampsia.
Preeclampsia is a pregnancy disorder characterized by hypertension that affects 1 in 10 women, claiming the lives of 50,000 mothers and 500,000 babies annually. Because its warning signs are difficult to detect and may only appear close to delivery, preeclampsia often goes undetected until delivery, when complications can become serious. Following the discovery of novel blood biomarkers, the AI_PREMie team began developing an AI risk stratification tool to assist clinicians. I am running various machine learning optimization and feature engineering techniques to improve the performance of ensemble models at diagnostic classification. My optimization techniques, in addition to novel biomarker data, enables our model to outperform market competitors by large margins.
Can protein structure inform strategic priorities for discovery biology and pharmacogenomics?
Performed structural alignment and hierarchical clustering (Dali, AlphaFold) of 600+ cancer driver proteins identified by OncoKB. Constructed a polar dendrogram where proteins were clustered by structural similarity, and encoded patient mutation frequency (AACR GENIE) with node radius. By clustering cancer driver proteins by structural homology, I informed the decision-making for next generation drug priorities and enabled efficient cancer targeting strategies. I documented this project's workflow over here.
Goals: develop generalizable tools to increase pharmacogenomics throughput and expidite ADME screening.
Goal: To be updated soon!
Description coming soon!
Full stack web developer for Global Biologics team in Cambridge, MA. Two of my major projects included:
I will be beginning my doctoral studies in New York City in August, 2024. I'm excited to become a part of the cutting-edge at Sinai, learning from the best minds in computational biomedicine and clinical big data!
I love to stay active. I enjoy playing basketball, tennis, pickleball, and frisbee. I'm a marathoner and recently began on-season training for the Connemarathon Ultra (39.3 miles). 2024 will be the year I break 3:15, I just know it! I enjoy reading non-fiction and memoirs; I just finished And Finally and now I'm reading The Henna Artist, and I stay up to date with Nature and The New York Times as best I can.
I can't hold a conversation about TV shows, but I can talk your ear off about music. Bad Bunny and 6LACK had a chokehold on my Spotify playlists this year. I casually produce music and I host a weekly radio show (PM.fm! Wednesdays at 3pm EST), and a close friend and I are debuting our music podcast in January 2024. Check out my original music and some of my favorite songs!
And finally, here's my reading bucket list for 2024. If you have any recommendations, please let me know :)