About
Back in 2020, I began my journey in data engineering, diving deep into the world of data pipelines and large-scale analytics. What started as a passion for data soon turned into a career where I’ve had the opportunity to build robust data solutions for an e-commerce giant like Wayfair and now, a leading healthcare analytics company, Arcadia.
Today, I’m focused on transforming complex healthcare data into actionable insights at Arcadia, helping to improve patient care through innovative data integration and analytics. I thrive at the intersection of data and technology, where I tackle challenges in data scalability and architecture to create seamless, impactful solutions.
When I’m not optimizing data workflows, you’ll find me exploring new technologies, playing with my puppy, or digging into a good book.
Experience
May 2022 - Present Data Engineer • Arcadia
Built and optimized ETL pipelines, integrating diverse healthcare data into Arcadia’s analytics platform. Developed tools to enhance data quality, reduce errors, and streamline massive datasets. Collaborated with clinical teams to implement custom solutions, delivering actionable insights for at-risk populations.
PythonSparkSQLScalaAWS S3AWS AthenaJan 2021 - Sept 2021 Data Engineer Co-op • Wayfair
Developed analytics solutions for Wayfair’s financial products, making data process migrations that doubled performance. Automated data schema monitoring, reducing failures and costs. Improved customer data visibility, driving informed decision-making.
BigQueryLookerSQLPythonJavaScriptData Modeling
Projects
Automated Essay Scoring
An automated essay scoring website built with Python, Django, and Keras, utilizing LSTM models to evaluate and score essays.DjangoKerasDeep LearrningEthereum Analytics Dashboard
A SparkStreaming application that reads CSV data from Kafka producers and writes to a MySQL database in real-time.KafkaScalaSparkStreamingMySQLMusic Recommendation System
A music recommendation system using collaborative filtering with Spark’s MLLib, deployed on AWS EMR and sourcing data from S3 buckets.PySparkAWS EMRCollaborative Filtering
Blog Posts
6/20/2019 Differential Privacy for Deep Learning
Using Differential Privacy to classify MNIST Digits and perform PATE Analysis on the model
7/15/2019 A 5-Step Guide on incorporating Differential Privacy into your Deep Learning models
Using PyTorch and Differential Privacy to classify MNIST digits