Repository files navigation Lasso Regression with UV + Notebooks
Install dependencies and create the environment:
Open Jupyter in the project environment:
notebooks/preprocess_train.ipynb
notebooks/predict_evaluate.ipynb
CV diagnostics (Lasso alpha error path + learning curve) to check overfitting risk.
Regression plots (actual vs predicted, residuals vs predicted).
Feature importance plot from Lasso coefficients.
Optional thresholded classification metrics derived from regression outputs.
Calibration-style threshold sweep for threshold selection.
Main configuration lives in configs/project.yaml.
Machine-generated outputs are written to artifacts/*.json and artifacts/*.csv.
artifacts/model.joblib — trained pipeline
artifacts/splits.json — train/test split indices
artifacts/training_summary.json — selected alpha and CV summary
artifacts/predictions.csv — test predictions
artifacts/metrics.json — RMSE/MAE/R² on test set
artifacts/classification_metrics.json — thresholded classification metrics
Latest experiment results
Regression metrics (test set)
Metric
Value
RMSE
52.9196
MAE
42.7930
R²
0.4714
Thresholded classification metrics (test set)
Threshold used: 140.5
Metric
Value
Accuracy
0.7753
Precision
0.7083
Recall
0.8500
F1
0.7727
ROC-AUC
0.8423
Overfitting and CV checks
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
You can’t perform that action at this time.