🎓 Student Dropout Prediction & Analysis Dashboard

An interactive web application built with Streamlit to predict student dropout, analyze contributing factors, and provide actionable insights using machine learning and model explainability techniques.

🌟 Key Features

This dashboard provides a complete, end-to-end workflow for student dropout analysis:

📊 Data Overview

Get a quick summary of the dataset, including data quality checks, student demographics, academic performance, and key risk factors.

📈 Exploratory Data Analysis (EDA)

Interactively explore feature distributions, correlations, and their relationship with student outcomes (Dropout, Graduate, Enrolled).

🤖 Model Training

Train a Random Forest Classifier with a single click to predict student outcomes.

✅ Model Evaluation

Assess model performance using accuracy, classification reports, and an interactive confusion matrix.

🧠 Model Explainability (XAI)

Global Explanations: Understand the most important features driving predictions across the entire dataset using SHAP, Permutation Importance, and built-in feature importance.
Local Explanations: Dive deep into why the model made a specific prediction for an individual student using SHAP Waterfall Plots and LIME.

🔮 Individual Prediction Tool

Input a student's data using interactive sliders and dropdowns
Receive an instant prediction of the student's likely outcome (Dropout, Graduate, or Enrolled)
Get a detailed explanation of the factors that influenced the prediction, along with actionable recommendations

🔧 Interactive Feature Analysis

Explore how changing a single feature's value impacts the model's prediction probabilities
Use the interactive feature explorer to view detailed statistics and dropout rates for any column

📂 Custom Data Upload

Upload your own student dataset in CSV format to use the dashboard's full capabilities.

📸 Screenshots

Data Overview & EDA	Model Explainability (SHAP)	Individual Prediction with Explanation

Replace the image links above with actual screenshots of your running application.

🛠️ Technology Stack

Framework: Streamlit
Data Manipulation: Pandas, NumPy
Machine Learning: Scikit-learn
Data Visualization: Matplotlib, Seaborn, Plotly
Model Explainability: SHAP, LIME

🚀 Getting Started

Follow these instructions to set up and run the project locally.

Prerequisites

Python 3.9 or higher
pip package manager

Installation

Clone the repository:

git clone https://github.com/your-username/student-dropout-prediction.git
cd student-dropout-prediction

Create and activate a virtual environment (recommended):

On Windows:

python -m venv .venv
.\.venv\Scripts\activate

On macOS/Linux:

python3 -m venv .venv
source .venv/bin/activate

Install the required dependencies:
```
pip install -r requirements.txt
```

Running the Application

Run the Streamlit app:
```
streamlit run app.py
```
(Replace app.py with the actual name of your Python script if it's different.)
Open your web browser: Navigate to http://localhost:8501. The application should now be running.

📋 How to Use the Dashboard

The dashboard is organized into four main modules accessible from the sidebar navigation:

1. Data Overview

Start here to get a high-level understanding of your dataset.

2. Exploratory Data Analysis

Dive deeper into the data. Use the interactive charts to uncover trends and relationships between different student attributes and their final outcomes.

3. Model Training & Evaluation

Click the "Start Training" button to build the prediction model
Once trained, view the model's performance metrics and feature importance charts
Explore the "Model Explainability" tab to understand how the model works on a global and local level

4. Dropout Prediction

Navigate to this section to use the interactive prediction tool
Adjust the sliders and inputs to match a student's profile
Click "Predict with Explanation" to see the predicted outcome and the key factors that led to that decision

💾 Data

The application comes pre-loaded with a sample dataset (student_dropout_data.csv) from the UCI Machine Learning Repository. This dataset contains various demographic, socio-economic, and academic features for students.

You can also upload your own CSV file using the file uploader in the sidebar. Ensure your dataset has a Target column with values like 'Dropout', 'Graduate', and 'Enrolled' for full functionality.

🤝 Contributing

Contributions are welcome! If you have suggestions for improvements or find any issues, please feel free to:

Fork the repository
Create a new feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

📦 Requirements

Create a requirements.txt file in your repository with the following content:

streamlit
pandas
numpy
matplotlib
seaborn
scikit-learn
shap
lime
plotly

🙏 Acknowledgments

UCI Machine Learning Repository for the student dataset
Streamlit community for the excellent framework
SHAP and LIME libraries for model explainability

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.devcontainer		.devcontainer
.idea		.idea
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎓 Student Dropout Prediction & Analysis Dashboard

🌟 Key Features

📊 Data Overview

📈 Exploratory Data Analysis (EDA)

🤖 Model Training

✅ Model Evaluation

🧠 Model Explainability (XAI)

🔮 Individual Prediction Tool

🔧 Interactive Feature Analysis

📂 Custom Data Upload

📸 Screenshots

🛠️ Technology Stack

🚀 Getting Started

Prerequisites

Installation

Running the Application

📋 How to Use the Dashboard

1. Data Overview

2. Exploratory Data Analysis

3. Model Training & Evaluation

4. Dropout Prediction

💾 Data

🤝 Contributing

📜 License

📦 Requirements

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎓 Student Dropout Prediction & Analysis Dashboard

🌟 Key Features

📊 Data Overview

📈 Exploratory Data Analysis (EDA)

🤖 Model Training

✅ Model Evaluation

🧠 Model Explainability (XAI)

🔮 Individual Prediction Tool

🔧 Interactive Feature Analysis

📂 Custom Data Upload

📸 Screenshots

🛠️ Technology Stack

🚀 Getting Started

Prerequisites

Installation

Running the Application

📋 How to Use the Dashboard

1. Data Overview

2. Exploratory Data Analysis

3. Model Training & Evaluation

4. Dropout Prediction

💾 Data

🤝 Contributing

📜 License

📦 Requirements

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages