Credit Card Fraud Detection using Machine Learning

📌 Overview

This project focuses on detecting fraudulent credit card transactions in a highly imbalanced dataset using machine learning techniques. The goal is not just high accuracy, but effective fraud detection while minimizing false positives in a real-world alert system.

📂 Dataset

Source: Kaggle – Credit Card Fraud Detection Dataset
Transactions: 284,807
Fraud Rate: ~0.17%
Features: PCA-transformed features (V1–V28) + Amount
Target: Class (1 = Fraud, 0 = Legitimate)

The dataset is not included in this repository due to size and licensing constraints.

⚙️ Tech Stack

Python
NumPy, Pandas
Scikit-learn
XGBoost
Matplotlib, Seaborn

🔄 Project Workflow

Data loading and inspection
Train–test split with stratification
Handling class imbalance using SMOTE
Baseline modeling with Random Forest
Main modeling using XGBoost
Model evaluation using ROC-AUC
Precision–Recall analysis
Threshold tuning to reduce false positives
Feature importance analysis

📊 Results

ROC-AUC: ~0.95
Key Outcome: ~32% reduction in false positives via threshold tuning
Focus: Alert optimization rather than raw accuracy

💡 Key Learnings

Accuracy is misleading for highly imbalanced datasets
Threshold selection is a business decision, not a model default
XGBoost scales better than Random Forest for large imbalanced data

⚠️ Limitations

No temporal validation
PCA features limit interpretability
SMOTE may introduce synthetic bias

🚀 Future Improvements

Cost-sensitive learning
Time-based validation
Concept drift handling

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
data_source.txt		data_source.txt
fraud_detection (1).ipynb		fraud_detection (1).ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Card Fraud Detection using Machine Learning

📌 Overview

📂 Dataset

⚙️ Tech Stack

🔄 Project Workflow

📊 Results

💡 Key Learnings

⚠️ Limitations

🚀 Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Credit Card Fraud Detection using Machine Learning

📌 Overview

📂 Dataset

⚙️ Tech Stack

🔄 Project Workflow

📊 Results

💡 Key Learnings

⚠️ Limitations

🚀 Future Improvements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages