Featured Projects
A selection of data engineering projects showcasing end-to-end pipeline development, automation, and cloud infrastructure expertise.
Enterprise Data Sync Pipeline
End-to-end data synchronization pipeline for employee, member roles, and account management with comprehensive failure logging.
- Designed Role/Member-Role Entity schema
- Built Employee & Account Role sync pipeline
- Implemented failure logging system
ETLDatabase DesignData Sync
ELT Pipeline with DBT
Modern Data Stack implementation with containerized ELT system, automated workflows, and real-time data quality monitoring.
- 100% automated daily workflows via Airflow
- Star Schema transformation with DBT
- Data Quality Framework (6 dimensions)
DBTAirflowDockerStar Schema
CMS Migration Automation
Test automation framework for CMS migration using SeleniumBase, enabling automated QC and content verification.
- Automated login & page creation flows
- QC scripts for CMS migration
- Content consistency verification
SeleniumBasePythonQA Automation
Legal Data Platform
Backend development and refactoring for EU legal data platform, including web crawlers, authentication, and subscription systems.
- Refactored EU legal web crawlers
- Improved authentication & billing
- Role-based access control system
PythonWeb ScrapingBackendRBAC
Cloud Analytics Audit
Data stack audit and optimization for analytics infrastructure, including BigQuery, Fivetran, and BI tool evaluation.
- Audited data architecture
- Resolved cloud data warehouse connectivity issues
- Hex vs Looker Studio comparison
BigQueryFivetranLookerMixpanel