Skip to content

SubhrjiT/Build-With-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 

Repository files navigation

⚡ Build with Data

Modern Data Engineering Projects • Azure • Databricks • Microsoft Fabric • PySpark • SQL

A curated collection of real-world Data Engineering projects demonstrating modern data platforms, scalable ETL/ELT pipelines, Lakehouse Architecture, Delta Lake, and analytics engineering.

Azure Databricks Apache Spark PySpark Delta Lake Python SQL


Learn. Build. Scale.

Building modern data platforms through practical, production-inspired projects.


📖 About

⚡Build with Data is a collection of practical Data Engineering projects demonstrating modern data platforms, ETL/ELT pipelines, Lakehouse Architecture, and analytics engineering using Azure, Databricks, Microsoft Fabric, PySpark, and SQL.


🎯 Topics

AzureDatabricksMicrosoft FabricApache SparkPySparkSQLDelta LakeUnity CatalogLakehouseETLELTData EngineeringData WarehousingAnalytics Engineering


🚀 Featured Projects

Project Platform Description Repository
🤖 AI-Assisted Data Platform Databricks Enterprise-grade Medallion Architecture using Databricks, Delta Lake, Unity Catalog, PySpark, and AI-assisted development. View Repository
♻️ Waste Analytics Lakehouse Platform Microsoft Fabric End-to-end Microsoft Fabric Lakehouse implementing Bronze, Silver, and Gold architecture using OneLake, Data Factory, Lakehouse, and Warehouse. View Repository
📊 Databricks Enterprise Analytics Platform Databricks Enterprise analytics platform demonstrating scalable ETL pipelines, Delta Lake, Medallion Architecture, and production-ready data engineering practices. View Repository
🎵 Music Store Analysis SQL SQL SQL analytics project demonstrating joins, CTEs, window functions, aggregations, and business reporting techniques. View Repository

📚 What You'll Learn

✅ Enterprise ETL & ELT Pipelines ✅ Data Warehousing
✅ Databricks Lakehouse ✅ Medallion Architecture
✅ Microsoft Fabric ✅ Incremental Loading
✅ Azure Data Engineering ✅ Data Validation
✅ Delta Lake ✅ Performance Optimization
✅ Apache Spark ✅ Production Data Engineering Practices
✅ PySpark Transformations ✅ Analytics Engineering
✅ SQL Analytics ✅ Data Governance

🤝 Contributions

Contributions, suggestions, and improvements are always welcome.

If you find these projects helpful:

  • ⭐ Star the repositories
  • 🍴 Fork a project
  • 🐛 Report issues
  • 💡 Suggest improvements

📄 License

Each project repository contains its own license information.


🚀 Build with Data

Modern Data Engineering • Real Projects • Open Source

If you find these projects useful, consider starring the repositories.

Made with ❤️ for the Data Engineering community.