⛏️ Mining Quality Intelligence — Predictive Quality Control System

A full-stack machine learning system that predicts silica concentrate quality failures in an iron ore flotation plant — hours before they happen.

🔗 Demo Link

👉 View Live Streamlit Dashboard

📋 Table of Contents

Business Understanding
Screenshots
Technologies
Setup
Approach
Status
Credits

💼 Business Understanding

In iron ore flotation processing, silica is an unwanted impurity. When silica concentration exceeds 4% in the final concentrate, product quality fails industry standards, customers impose penalty fees or reject shipments, and revenue is lost while reprocessing costs increase.

By the time quality degradation is detected through lab analysis, it is already too late to intervene. This project solves that problem.

A binary classification model trained on 737,453 hourly sensor readings predicts quality failures before they occur — giving operators time to adjust process parameters and prevent off-spec production.

Operators receive a three-tier alert system:

Alert	Meaning
🟢 GREEN	Normal operation — no action needed
🟡 AMBER	Early warning — monitor closely
🔴 RED	Intervention required + specific recommended actions

📸 Screenshots

🛠️ Technologies

Languages & Environment

Data & EDA

Machine Learning

Deployment

Category	Tools
Data & EDA	Python, Pandas, NumPy, Matplotlib, Seaborn
Machine Learning	XGBoost, TensorFlow/Keras, Scikit-learn
Class Imbalance	Imbalanced-learn (SMOTE)
Deployment	Streamlit Cloud, Plotly, Joblib

⚙️ Setup

# Clone the repository
git clone https://github.com/your-username/mining-quality-intelligence.git
cd mining-quality-intelligence

# Install dependencies
pip install -r requirements.txt

# Run the Streamlit dashboard locally
streamlit run app.py

Dataset: Quality Prediction in a Mining Process — available on Kaggle.

🔍 Approach

Phase 1 — Data Loading

The dataset comprises 737,453 rows × 24 sensor columns sourced from a real iron ore flotation plant, with zero missing values confirmed across all features.

Phase 2 — Exploratory Data Analysis

Twelve new derived features were engineered, followed by univariate, correlation, and time series analysis. Quality thresholds were defined as: Premium < 2%, Good < 3%, Acceptable < 4%, Poor ≥ 4%. Analysis revealed weekly operational cycles and shift-change patterns in the data.

Phase 3 — Machine Learning

Data was preprocessed using StandardScaler with SMOTE to address class imbalance, on an 80/20 stratified split. Two models were trained and compared:

Model 1 — XGBoost: 300 estimators, max_depth=6, learning rate=0.05
Model 2 — Neural Network: Architecture 128→64→32→1 with BatchNorm and Dropout layers

The primary evaluation metric was F1-Score, with threshold tuning applied to maximise net financial benefit for the business.

Phase 4 — Deployment & Monitoring

A real-time MiningQualityScorer pipeline class was built and deployed via a Streamlit dashboard with four interactive pages:

Page	Description
🏠 Live Scoring	Enter sensor readings → instant alert + recommended actions
📈 Historical Trends	168-hour probability trend + alert distribution
🔍 Feature Inspector	Feature vs failure probability analysis
📊 Drift Monitor	PSI heatmap — flags when model needs retraining

PSI drift monitoring with automated retraining triggers ensures the model remains accurate as plant conditions evolve over time.

📌 Status

The full pipeline is complete and deployed. Future iterations may include:

Multi-plant generalisation across different flotation configurations
Integration with SCADA/DCS systems for direct operator alerts
Expanded drift monitoring with automated model versioning

🙏 Credits

Developed by Lindiwe Songelwa — Data Scientist | Developer | Insight Creator

Platform	Link
💼 LinkedIn	Lindiwe S.
🌐 Portfolio	Creative Portfolio
🏅 Credly	Lindiwe Songelwa – Badges
🚀 Live App	Streamlit Dashboard
📧 Email	sl.songelwa@hotmail.co.za

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Mining Quality Screenshots		Mining Quality Screenshots
Mining-Quality-Dashboard		Mining-Quality-Dashboard
data		data
model		model
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
scored_history.csv		scored_history.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

⛏️ Mining Quality Intelligence — Predictive Quality Control System

🔗 Demo Link

📋 Table of Contents

💼 Business Understanding

📸 Screenshots

🛠️ Technologies

⚙️ Setup

🔍 Approach

Phase 1 — Data Loading

Phase 2 — Exploratory Data Analysis

Phase 3 — Machine Learning

Phase 4 — Deployment & Monitoring

📌 Status

🙏 Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

⛏️ Mining Quality Intelligence — Predictive Quality Control System

🔗 Demo Link

📋 Table of Contents

💼 Business Understanding

📸 Screenshots

🛠️ Technologies

⚙️ Setup

🔍 Approach

Phase 1 — Data Loading

Phase 2 — Exploratory Data Analysis

Phase 3 — Machine Learning

Phase 4 — Deployment & Monitoring

📌 Status

🙏 Credits

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages