Machine learning pipelines for training, evaluating, and serializing predictive models for diabetic complications.
Repository · Report Bug · Research Paper
dabetai is a comprehensive preventive ecosystem for diabetes that predicts complications like retinopathy, nephropathy, neuropathy, and diabetic foot before they become irreversible.
This repository contains the AI Models — complete pipelines for training, evaluating, and serializing machine learning models focused on predicting:
- Diabetic retinopathy
- Diabetic nephropathy
- Diabetic neuropathy
- Diabetic foot
The models are based on clinical and biometric data from the IOBP2 study and are optimized with advanced techniques such as class balancing, hyperparameter tuning, and cross-validation.
| Component | Repository | Stack |
|---|---|---|
| Mobile App | dabetai-org/mobile-app | React Native 0.79, Expo 53, Tailwind CSS |
| Web Portal | dabetai-org/web-app | Angular 19, Tailwind CSS |
| Core API | dabetai-org/api | NestJS 11, PostgreSQL, Prisma |
| AI Inference API | dabetai-org/ai-api | FastAPI, Python 3.11, MongoDB |
| AI Models (this) | dabetai-org/ai-models | Python, scikit-learn, XGBoost, PyTorch |
| Landing | dabetai-org/landing | Astro, Tailwind CSS |
- Modular Dataset Preparation — Automated per-complication data pipelines
- Multi-Algorithm Experimentation — Logistic Regression, Random Forest, LightGBM, XGBoost, SVM, AdaBoost
- Hyperparameter Optimization — Grid Search for optimal model configuration
- Model Serialization — Export trained models for production deployment
- Automatic Reporting — ROC curves, confusion matrices, feature importance visualizations
- Python 3.11+
- pip
git clone https://github.com/dabetai-org/ai-models.git
cd ai-models
pip install -r requirements.txtai-models/
├── scripts/
│ ├── 01_prepare_datasets.py
│ ├── 02_run_experiments.py
│ └── 03_finalize_model.py
├── data/
│ ├── raw/
│ └── processed/
├── models/
├── reports/
│ └── figures/
└── requirements.txt
python scripts/01_prepare_datasets.pypython scripts/02_run_experiments.pypython scripts/03_finalize_model.pyData is based on the IOBP2 (In Control) study. Place files in data/raw/datatables/. See CITATION.md for attribution and responsible use.
Please read CONTRIBUTING.md for branch naming, commit conventions, and PR workflow.
This project is licensed under the GNU General Public License v3.0 — see the LICENSE file for details.
Authors:
- Cardenas Cabal Fermín
- Ortiz Pérez Alejandro — alex03ortizperez@gmail.com
- Serrano Puertos Jorge Christian — christian.serrano.puertos@gmail.com
Advisors:
- Guarneros Nolasco Luis Rolando
- Cruz Ramos Nancy Aracely
Academic Support:
- Universidad Tecnológica del Centro de Veracruz