This project implements a data backbone for a simulation of a city, a continuously updated virtual replica of real-world urban systems. It ingests, processes, and transforms mobility, energy, environmental, and citizen interaction data, enabling city planners to simulate and predict scenarios such as traffic rerouting, energy load balancing, or pollution control.
The system integrates multiple types of data (all synthetic):
- Smart meters: Energy usage.
- Air quality sensors: CO₂, PM2.5, NO₂.
- Noise sensors.
- Smart parking sensors.
- Bike-share trips.
- EV charging station logs.
- Ride-hailing platform data.
- Building energy efficiency reports.
- Water usage logs.
- Weather forecasts.
- Tool: Airflow → BigQuery
- Details: All raw data lands in BigQuery for centralized storage.
- Tools: dbt + BigQuery
- Layers:
- Staging: Standardized schemas for energy, mobility, and environment datasets.
- Tool: Airflow
- Function: Schedules dbt transformations according to source data frequency.
- Enable city planners to simulate urban scenarios in near real-time.
- Provide actionable insights into traffic, energy, and environmental patterns.
- Support sustainability initiatives through measurable KPIs.
- Facilitate predictive analytics for urban management.
-
Create a virtual environment
python -m venv venv source venv/bin/activate # macOS/Linux venv\Scripts\activate # Windows
-
Install dependencies
pip install -r requirements.txt
-
Run Airflow
airflow standalone
-
Access the Airflow UI at
http://localhost:8080to trigger DAGs.