About

Back in 2020, I began my journey in data engineering, diving deep into the world of data pipelines and large-scale analytics. What started as a passion for data soon turned into a career where I’ve had the opportunity to build robust data solutions for an e-commerce giant like Wayfair and now, a leading healthcare analytics company, Arcadia.

Today, I’m focused on transforming complex healthcare data into actionable insights at Arcadia, helping to improve patient care through innovative data integration and analytics. I thrive at the intersection of data and technology, where I tackle challenges in data scalability and architecture to create seamless, impactful solutions.

When I’m not optimizing data workflows, you’ll find me exploring new technologies, playing with my puppy, or digging into a good book.

Experience

  1. May 2022 - Present

    Data Engineer Arcadia

    Built and optimized ETL pipelines, integrating diverse healthcare data into Arcadia’s analytics platform. Developed tools to enhance data quality, reduce errors, and streamline massive datasets. Collaborated with clinical teams to implement custom solutions, delivering actionable insights for at-risk populations.

    PythonSparkSQLScalaAWS S3AWS Athena
  2. Jan 2021 - Sept 2021

    Data Engineer Co-op Wayfair

    Developed analytics solutions for Wayfair’s financial products, making data process migrations that doubled performance. Automated data schema monitoring, reducing failures and costs. Improved customer data visibility, driving informed decision-making.

    BigQueryLookerSQLPythonJavaScriptData Modeling

Projects

  1. Automated Essay Scoring

    Automated Essay Scoring

    An automated essay scoring website built with Python, Django, and Keras, utilizing LSTM models to evaluate and score essays.
    DjangoKerasDeep Learrning
  2. Ethereum Analytics Dashboard

    Ethereum Analytics Dashboard

    A SparkStreaming application that reads CSV data from Kafka producers and writes to a MySQL database in real-time.
    KafkaScalaSparkStreamingMySQL
  3. Music Recommendation System

    Music Recommendation System

    A music recommendation system using collaborative filtering with Spark’s MLLib, deployed on AWS EMR and sourcing data from S3 buckets.
    PySparkAWS EMRCollaborative Filtering

Blog Posts

  1. 6/20/2019

    Differential Privacy for Deep Learning

    Using Differential Privacy to classify MNIST Digits and perform PATE Analysis on the model

  2. 7/15/2019

    A 5-Step Guide on incorporating Differential Privacy into your Deep Learning models

    Using PyTorch and Differential Privacy to classify MNIST digits