Data Engineer Aug 2022- Present
Contributed to Data Warehouse 2.0 re-architecture, reducing on-call effort by 15 hours/week, improving Redshift query wait time by 95%, and eliminating monolithic (1K+ line) jobs through team-wide pipeline modularization. (Amazon Inc. - February 2024-March 2025)
Led GDPR and COPPA compliance onboarding for Alexa data, enabling child-data deletion and privacy safeguards and reducing potential legal exposure by an estimated $25M. (Amazon Inc. - January 2025-April 2025)
Built a proactive data quality framework across 50 raw pipelines, implementing ingestion trend analysis, schema drift detection, value-range validation, null/duplicate thresholds, and freshness SLAs with automated alerts—reducing downstream failures and cutting Weekly Business Review delays by 75%. (Amazon Inc. - April 2025- December 2025)
Drove telemetry strategy and data pipeline design for Alexa+ AI communication features, enabling 300+ telemetry points across SendToPhone, EmailToContacts, Calling, Messaging, Announcements, Drop-In, and Bluetooth Texting; built and owned end-to-end ingestion and transformation pipelines for 30+ raw sources, delivering high-quality analytics-ready datasets used by BIEs and Data Scientists
Owned end-to-end setup of a secure, isolated data environment for the ML team, provisioning schemas, S3, IAM roles, datanet access, and guardrails to protect production data; Enabled self-serve ML and analytics by training the team on Redshift–QuickSight workflows, metric transformations, and reporting, while adding usage tracking to improve cost visibility and compliance
Led end-to-end revamp of Alexa Calling & Drop-In data pipeline, simplifying undocumented legacy architecture, consolidating 6 datanet jobs into 2, standardizing 7 customer ID types to ECID, introducing incremental processing (350 → 20 mins, ~94% runtime reduction), and expanding feature analytics (in-home/out-of-home, call duration, Drop-In vs Calling, child vs adult profiles) while maintaining metric parity with PM expectations
Led end-to-end Application Recertification to close a multi-year security gap (2017-2022), driving 30% stronger security posture, fewer code vulnerabilities, and fewer architectural flaws, while collaborating cross-org and reporting to L10 leadership
Data Engineer January 2021- Present
Developed and automated time saving ETL data pipelines in Matillion to ingest over 10 healthcare databases and from multiple cloud sources using Python and SnowSQL
Independently modified and enhanced existing pipelines which directly improved the load time by 95%
Served as a DBA in 80+ medium to large scale project implementations for over 30+ users in team
Worked with domain experts, engineers, and other data scientists to develop, implement, and improve upon existing systems and create new tools
Participated and contributed to design meetings for creation of the Data Model and provide guidance on best data architecture practices
Mentored users on understanding of database systems by conducting pre-implementation workshops, delivering group and individual training sessions, and creating user-friendly workshop materials
Junior Data Engineer October 2019- December 2020
Assembled large, complex data sets that meet functional / non-functional business requirements
Played an integral role in team wide efforts to migrate existing architecture using Microsoft Azure Cloud computing services such as Database, Datawarehouse, Data factory, and blob storage accounts and, Snowflake and Matillion
ETL (Extract, Transform, Load) - Extract data from different data sources, transform and load to database or data warehouse using PolyBase and external tables
Product Developer June 2018- December 2018
Wrote over 10,000+ lines of production level code in python and worked with cyber security data in JSON and XML data structures
Development, debugging and documenting complex JSON flows for company product called Fusion
Interacted with partner companies directly to understand the technical aspects of the services provided by them and their integration into the company product
Developed security use cases for a cyber security and threat intelligence platform Fusion
Programmed integration extensions for partners such as Palo Alto, Reversing Labs, Virus Total, DomainTools etc.
Developed easy to use documentation for the tools for integration
Teaching Assistant, Operations Research January 2019- April 2019
Linear Programming, Big M, Duality Theory, Transportation and Assignment Problems, Network Analysis
Held Office hours to help students with queries and graded assignments
Operations Supervisor, Campus Recreation October 2017- Present
Conduct inventory, gather data and make reports that assist in process optimization
Create excel reports of data collected for easy decision making
Oversee implementation of procedures, goals and objectives within department
Supervise facility assistants and cater to patrons regarding all queries and inquiries
Oversee, process and be accountable for all payments, membership and facility reservations