Customer Personality Segmentation

Problem statement

In this data science project, you will build a machine learning system which will be able predict the personality of the customer using machine learning algorithms. This project will be very usefull for malls, various stores and companies which are product based. Based on customer's personal details and purchase details, we can cluster them and we can predict the customer's cluster number using classification techniques.

Solution Proposed

Now the question is how to dynamically predict the cluster of the customer ?. One of the approaches which we can use of machine learning approach, where we can cluster the customer based on the details we have and predict the cluster type based on the domain knowledge and leverage previous customer data to predict the cluster.

Dataset used

Dataset Link

Tech Stack Used

Python
FastAPI
Machine learning algorithms
Docker
MongoDB

Infrastructure required

AWS S3
Azure
Github Actions

How to run

Before you run this project make sure you have MongoDB Atlas account and you have the shipping dataset into it.

Step 1. Cloning the repository.


git clone https://github.com/Machine-Learning-01/Customer_segmentation.git

Step 2. Create a conda environment.


conda create --prefix venv python=3.7 -y


conda activate venv/

Step 3. Install the requirements


pip install -r requirements.txt

Step 4. Export the environment variable

export AWS_ACCESS_KEY_ID=<AWS_ACCESS_KEY_ID>


export AWS_SECRET_ACCESS_KEY=<AWS_SECRET_ACCESS_KEY>


export AWS_DEFAULT_REGION=<AWS_DEFAULT_REGION>


export MONGODB_URL= <MONGODB_URL>

Step 5. Run the application server


python app.py

Step 6. Train application

http://localhost:5000/train

Step 7. Prediction application

http://localhost:5000/predict

Run locally

Check if the Dockerfile is available in the project directory
Build the Docker image


docker build --build-arg AWS_ACCESS_KEY_ID=<AWS_ACCESS_KEY_ID> --build-arg AWS_SECRET_ACCESS_KEY=<AWS_SECRET_ACCESS_KEY> --build-arg AWS_DEFAULT_REGION=<AWS_DEFAULT_REGION> --build-arg MONGODB_URL=<MONGODB_URL> .

Run the Docker image


docker run -d -p 5000:5000 <IMAGE_NAME>

Project Architecture -

Data Collection Architecture -

Deployment Architecture -

Models Used

From these above models after hyperparameter optimization we selected these two models which were K-Means for clustering and Logistic Regression for classification and used the following in Pipeline.

GridSearchCV is used for Hyperparameter Optimization in the pipeline.

`src` is the main package folder which contains

Components : Contains all components of Machine Learning Project

Data Ingestion
Data Validation
Data Transformation
Data Clustering
Model Trainer
Model Evaluation
Model Pusher

Custom Logger and Exceptions are used in the Project for better debugging purposes.

Conclusion

This Project can be used in real-life by Users.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
assignment		assignment
config		config
docs		docs
flowchart		flowchart
notebooks		notebooks
scripts		scripts
src		src
static/css		static/css
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Personality Segmentation

Problem statement

Solution Proposed

Tech Stack Used

Infrastructure required

How to run

Run locally

Project Architecture -

Data Collection Architecture -

Deployment Architecture -

Models Used

`src` is the main package folder which contains

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Customer Personality Segmentation

Problem statement

Solution Proposed

Tech Stack Used

Infrastructure required

How to run

Run locally

Project Architecture -

Data Collection Architecture -

Deployment Architecture -

Models Used

src is the main package folder which contains

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`src` is the main package folder which contains

Packages