Data Engineer building enterprise data platforms across Azure, Databricks, Microsoft Fabric, AWS, and Snowflake. I own the full pipeline — ingestion, transformation, quality, monitoring, and delivery.
- Delivered ~9x pipeline throughput improvement via a single SQL ordering change on a financial data pipeline
- Built and maintain 80+ ADF pipelines across dev/staging/prod with ARM-based CI/CD
- Designed row-level validation frameworks using PySpark
exceptAlldiffs for Databricks → Synapse migration - Built IcM automation and SOX control narratives for daily financial reconciliation
| Certification | Issuer | |
|---|---|---|
| ✅ | Azure Data Engineer Associate (DP-203) | Microsoft |
| ✅ | Fabric Analytics Engineer Associate (DP-600) | Microsoft |
| ✅ | Databricks Certified Data Engineer Associate | Databricks |
| ✅ | Databricks Certified Associate Developer for Apache Spark | Databricks |
| ✅ | AWS Certified Data Engineer – Associate (DEA-C01) | AWS |
| ✅ | SnowPro Core Certification | Snowflake |
Cloud & Platforms
Data Engineering
Languages & Tools
| Project | Stack | |
|---|---|---|
| 🏗️ | Azure End-to-End Data Engineering | ADF · ADLS Gen2 · Databricks · Synapse |
| 🔥 | Databricks Lakehouse Platform | PySpark · Delta Lake · DLT · Unity Catalog |
| 🧵 | Microsoft Fabric Analytics | Fabric · OneLake · Dataflow Gen2 · Power BI |
| 🎬 | Netflix Data Engineering Pipeline | Azure · Databricks · Airflow · Streaming |
| 🌩️ | AWS Data Engineering Pipeline | Glue · Lambda · Kinesis · S3 · Redshift |
| ❄️ | Snowflake & dbt Analytics | Snowflake · dbt Core · Streams & Tasks |
| 🌀 | Apache Airflow Pipelines | Airflow · dbt · Databricks · Docker |
| 🤖 | Azure AI for Data Engineering | Azure OpenAI · LangChain · RAG · GPT-4o |
| ⚙️ | Azure DevOps CI/CD | ADF · Databricks Asset Bundles · dbt |
| 🐍 | PySpark Interview Prep | PySpark · Delta Lake · Performance Tuning |
| 🗺️ | Data Engineer Roadmap 2025 | Azure · AWS · Databricks · Snowflake · dbt |
| 🏅 | Tokyo Olympics — Azure Pipeline | ADF · ADLS Gen2 · Databricks · Synapse · Power BI |
| 🌊 | Live Streaming — NiFi + Snowpipe | Apache NiFi · Snowpipe · AWS S3 · Azure |
| 🧠 | AI Pipeline Anomaly Detection | Databricks · Isolation Forest · PySpark |
| 📊 | Power BI Analytics Dashboards | Power BI · DAX · Microsoft Fabric · DirectLake · Synapse |
| 📈 | Tableau Data Visualizations | Tableau · Snowflake · Redshift · LOD Expressions · Prep |