Skip to content
View wiktorbielski's full-sized avatar

Block or report wiktorbielski

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wiktorbielski/README.md

👋 Hi, I'm Wiktor Bielski

⚙️ Data and Analytics Engineer

Welcome to my GitHub! I'm Wiktor, a data and analytics engineer, designing and delivering scalable analytical solutions that enable data-driven decision making across the organization. My work spans ETL pipeline development, distributed data processing in PySpark and Azure Databricks, interactive dashboarding, and end-to-end workflow automation — with a focus on integrating data from diverse sources and turning it into trusted, decision-ready outputs.


🧰 Core Skills & Tools

  • Data Engineering: Building and maintaining data pipelines and ETL processes in PySpark and Azure Databricks, integrating data from warehouses, relational and columnar databases, and Azure Data Lake
  • Programming & Scripting: Python (pandas, NumPy, matplotlib), SQL, Spark SQL — writing custom scripts to enhance analytical depth, automation, and processing performance
  • Analytics & Visualization: Creating interactive dashboards and analytical reports in Power BI and Power Query to transform data into actionable insights
  • Workflow Automation: Implementing end-to-end automation in Power Automate to streamline analytical processes and increase productivity
  • Collaboration: Experienced in agile, cross-functional environments — bridging engineering and business stakeholder needs

⚙️ Technologies I Work With

PySparkApache SparkAzure DatabricksAzure Data LakeBigQueryKafkaDockerPythonpandasNumPySQLBashGitPower BIDAXPower QueryPower AutomateLooker StudioVisual Studio CodeBPMN

Pinned Loading

  1. real-time-polish-weather-kafka-pipeline real-time-polish-weather-kafka-pipeline Public

    Real-time Polish weather monitoring pipeline using Apache Kafka (KRaft), Python ETL, BigQuery storage & Looker Studio viz. Demonstrates streaming ingestion, batch loading & serverless analytics.

    Python

  2. precious-metals-rag-airflow-pipeline precious-metals-rag-airflow-pipeline Public

    An end-to-end data engineering and AI pipeline that orchestrates real-time precious metals market analysis. The system ingests spot prices via Apache Kafka, stores time-series data in Google BigQue…

    Python

  3. databricks-batch-pipeline-atp_tennis-matches databricks-batch-pipeline-atp_tennis-matches Public

    An automated Bronze → Silver → Gold ETL pipeline for ATP tennis data. Built with Databricks, PySpark, and Delta Lake. Features data quality checks, partitioning, and analytics-ready Fact/Dimension …

    Jupyter Notebook

  4. bigquery-medallion-dwh-project bigquery-medallion-dwh-project Public

    Building a modern data warehouse with Big Query, including ETL process, data modeling and analytics.