Churn Prediction Model

In this project, i utilized the LightGBM (Light Gradient Boosting Machine) model to predict customer churn. Gradient boosting was chosen due to its high efficiency with large datasets and its ability to automatically handle missing values, making it ideal for churn prediction tasks.

Task Description

Type of Classification Task

This project focuses on a binary classification task, where the objective is to predict whether a client has churned (left the service) or not. The two possible outcomes are:

Not churned clients (class 0)
Churned clients (class 1)

Distribution of classes:

Churned clients: 6-7%
Not churned clients: 93-94%

Navigation

pipeline.ipynb - The main notebook that encompasses the entire training cycle, from data preparation to results analysis.
transform - Directory containing all scripts for data preparation.
research - Directory containing all notebooks and scripts used for experiments, exploratory data analysis (EDA),hyperparameter tuning, etc.
models - Directory containing stored models.
data - Directory containing the initial raw data. (HIDDEN)
cache - Directory containing cached transformed data for faster access and reuse. (HIDDEN)

Model Parameters:

{
    "random_state": 42,
    "seed": 42,
    "objective": "binary",
    "metric": "auc",
    "verbosity": -1,
    "boosting_type": "gbdt",
    "feature_pre_filter": False,
    "n_jobs": -1,
    "lambda_l1": 8,
    "lambda_l2": 5,
    "learning_rate": 0.018,
    "num_leaves": 14,
    "feature_fraction": 0.6803603979260223,
    "bagging_fraction": 0.6735621254996546,
    "max_depth": 11,
    "min_child_samples": 30,
    "n_estimators": 350,
    "drop_rate": 0.2,
    "is_unbalance": True,
}

Results

Train metrics

AUC: 0.912
Classification Report:
              precision    recall  f1-score   support

         0.0       0.99      0.84      0.91    140413
         1.0       0.26      0.83      0.40      9585

    accuracy                           0.84    149998
   macro avg       0.62      0.84      0.65    149998
weighted avg       0.94      0.84      0.87    149998

Test metrics

AUC: 0.898
Classification Report:
              precision    recall  f1-score   support

         0.0       0.98      0.84      0.91    140597
         1.0       0.25      0.81      0.38      9403

    accuracy                           0.84    150000
   macro avg       0.62      0.82      0.64    150000
weighted avg       0.94      0.84      0.87    150000

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
models		models
research		research
transform		transform
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
PresentationENG.pdf		PresentationENG.pdf
PresentationUAH.pdf		PresentationUAH.pdf
README.md		README.md
pipeline.ipynb		pipeline.ipynb
rocauc.png		rocauc.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Churn Prediction Model

Task Description

Type of Classification Task

Navigation

Model Parameters:

Results

About

Releases

Packages

Languages

bezrukavyi/ML-churn-prediction

Folders and files

Latest commit

History

Repository files navigation

Churn Prediction Model

Task Description

Type of Classification Task

Navigation

Model Parameters:

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages