Skip to content
View donthula9908's full-sized avatar

Block or report donthula9908

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
donthula9908/README.md
Naveen Donthula — Data Engineer

LinkedIn Email GitHub


About

Data Engineer building enterprise data platforms across Azure, Databricks, Microsoft Fabric, AWS, and Snowflake. I own the full pipeline — ingestion, transformation, quality, monitoring, and delivery.

  • Delivered ~9x pipeline throughput improvement via a single SQL ordering change on a financial data pipeline
  • Built and maintain 80+ ADF pipelines across dev/staging/prod with ARM-based CI/CD
  • Designed row-level validation frameworks using PySpark exceptAll diffs for Databricks → Synapse migration
  • Built IcM automation and SOX control narratives for daily financial reconciliation

Certifications

Certification Issuer
Azure Data Engineer Associate (DP-203) Microsoft
Fabric Analytics Engineer Associate (DP-600) Microsoft
Databricks Certified Data Engineer Associate Databricks
Databricks Certified Associate Developer for Apache Spark Databricks
AWS Certified Data Engineer – Associate (DEA-C01) AWS
SnowPro Core Certification Snowflake

Tech Stack

Cloud & Platforms

Azure Databricks Microsoft Fabric AWS Snowflake

Data Engineering

ADF PySpark Delta Lake dbt Airflow Kafka

Languages & Tools

Python SQL Scala Azure DevOps Docker Power BI


Projects

Project Stack
🏗️ Azure End-to-End Data Engineering ADF · ADLS Gen2 · Databricks · Synapse
🔥 Databricks Lakehouse Platform PySpark · Delta Lake · DLT · Unity Catalog
🧵 Microsoft Fabric Analytics Fabric · OneLake · Dataflow Gen2 · Power BI
🎬 Netflix Data Engineering Pipeline Azure · Databricks · Airflow · Streaming
🌩️ AWS Data Engineering Pipeline Glue · Lambda · Kinesis · S3 · Redshift
❄️ Snowflake & dbt Analytics Snowflake · dbt Core · Streams & Tasks
🌀 Apache Airflow Pipelines Airflow · dbt · Databricks · Docker
🤖 Azure AI for Data Engineering Azure OpenAI · LangChain · RAG · GPT-4o
⚙️ Azure DevOps CI/CD ADF · Databricks Asset Bundles · dbt
🐍 PySpark Interview Prep PySpark · Delta Lake · Performance Tuning
🗺️ Data Engineer Roadmap 2025 Azure · AWS · Databricks · Snowflake · dbt
🏅 Tokyo Olympics — Azure Pipeline ADF · ADLS Gen2 · Databricks · Synapse · Power BI
🌊 Live Streaming — NiFi + Snowpipe Apache NiFi · Snowpipe · AWS S3 · Azure
🧠 AI Pipeline Anomaly Detection Databricks · Isolation Forest · PySpark
📊 Power BI Analytics Dashboards Power BI · DAX · Microsoft Fabric · DirectLake · Synapse
📈 Tableau Data Visualizations Tableau · Snowflake · Redshift · LOD Expressions · Prep

GitHub Stats

 

@donthula9908's activity is private