Skip to content
View elfahad98's full-sized avatar

Highlights

  • Pro

Block or report elfahad98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
elfahad98/README.md

El-Fahad COMBO — Data Scientist · ML Engineer · MLOps

MSc Applied Mathematics & Statistics — Data Science · Université de Caen Normandie 🇫🇷

I build end-to-end ML systems — from raw data ingestion to production deployment and monitoring.
Clean pipelines. Reproducible results. Real business impact.

🎯 Open to : Data Science · Machine Learning · ML Engineering · MLOps · Data Engineering


🏅 Certification


🧠 About

  • 🤖 Machine Learning & NLP — classification, fraud detection, imbalanced learning, log analysis, clustering
  • 📈 Time series — forecasting, feature engineering, model comparison, experiment tracking with MLflow
  • ⚙️ ML Engineering / MLOps — model deployment, confidence scoring, production monitoring, alerting workflows
  • 🔌 Data Engineering — ETL/ELT pipelines, BigQuery, REST API integration, cloud automation
  • ☁️ Cloud — GCP for data pipelines and serverless deployment; Azure ML, Azure Databricks and Azure AI Foundry

🧰 Tech Stack


⚡ What I Build

data collection & API integration  →  cleaning & preprocessing  →  feature engineering
→  ML model training & tuning  →  evaluation & tracking (MLflow)
→  API / app integration  →  cloud deployment (Cloud Run)
→  monitoring, alerting & dashboards

Not just notebooks — complete, deployable systems.


🔥 Featured Projects

🏦 Fraud Detection — IEEE-CIS (Kaggle)

Tech: PySpark · Spark MLlib · Python · SQL · Superset

  • Distributed pipeline: join, clean and prepare identity + transaction data at scale
  • Feature engineering, supervised modeling, imbalanced learning
  • Evaluation: ROC-AUC, PR-AUC, recall, F1 + dashboard for fraud pattern analysis

➡️ github.com/elfahad98/ieee-fraud-pyspark


🔐 Account Takeover Detection

Tech: Python · scikit-learn · XGBoost · MLP · pandas · NumPy

  • Behavioral and time-based feature engineering (frequency, device signals, login patterns)
  • Benchmarking: Logistic Regression vs XGBoost vs MLP
  • Imbalanced learning, threshold tuning, recall-first optimization

➡️ github.com/elfahad98/ato-fraud-detection-mlp


📈 Time Series Forecasting

Tech: Python · pandas · scikit-learn · statsmodels · MLflow

  • Preprocessing pipeline, lag and rolling-window features, leakage-free splits
  • Model comparison: Ridge vs SARIMA/SARIMAX vs baselines
  • Experiment tracking with MLflow, evaluation on validation and test sets

➡️ github.com/elfahad98/water-temp-forecast


📫 Contact

Pinned Loading

  1. ieee-fraud-pyspark ieee-fraud-pyspark Public

    Détection de fraude bancaire avec PySpark — pipeline ML distribué

    Jupyter Notebook 4

  2. ato-fraud-detection-mlp ato-fraud-detection-mlp Public

    Analyse et modélisation d’un système de détection de fraude (Account Takeover) pour repérer les connexions suspectes et comportements anormaux.

    Jupyter Notebook 4

  3. base-donnees-foot base-donnees-foot Public

    Application web en PHP + PostgreSQL pour gérer une base de joueurs de football.

    HTML 1

  4. game-trio game-trio Public

    Application Python (Tkinter) regroupant trois jeux interactifs : Snake, Turmites et Conway’s Game of Life.

    Python 1

  5. etl-datawarehouse etl-datawarehouse Public

    Mini Data Warehouse alimenté via des pipelines ETL Apache Hop (dimensions + fait). Modélisation étoile, SQL, et gestion de la qualité des données.

    1

  6. water-temp-forecast water-temp-forecast Public

    Multi-horizon water temperature forecasting on 7 monitoring stations using time-series modeling, temporal backtesting, and MLflow.

    Jupyter Notebook