PREDICTING CHURN


Chaitanya Sagar
Sinna Muthiah
Jagadish Singh Naik
Chaitanya Sagar
Sinna Muthiah
Jagadish Singh Naik
Customers
2x
Year 3
• 2% improvement in Churn can lead to 10% savings in costs
• Cost of acquisition of customers is 6-7 times higher than that of retaining customers
Source Matt Dancho
http://bit.ly/2uO723P
Logistic Regression Random Forest LightGBM
XGBoost
CatBoost
ANNs
Scikit Learn
Numpy
Pandas
Matplotlib/Seaborn
Keras
A Cloud-based music streaming service provider, with a ‘freemium’ pricing strategy
10 Million+ Users, 50 Million Songs, 1600 Artists
▪ Personalized Music Recommendation
▪ Ability to listen with friends and chat
▪ Streaming Live interviews and concerts
▪ Lyric Screening
Data about 1 Mn. customers (approx. 32 GB)
It spans across 12 years & has 30 attributes in total New registrations by time (sampled data):
Customer ID, Joining Date, Age, Gender, City etc.
Is Transaction Auto Renewable, Is Transaction Cancelled, Validity of Subscription, Subscription Price, Actual
Amount Paid, Discounts etc.
Time spent, No. of unique songs heard, No. of songs heard, %length of songs heard,, Length of Playlist etc.
Why is ‘feature engineering’ important?
Methodology: (Motivated by insights from EDA)
• Contrasting abilities of ‘Transactional’ & ‘User-Log’ data patterns to capture churn -> We’ll use separate feature sets & ML models
• Dynamic customer behavior with time -> We’ll use temporal features
• Helps us capture customer behavior patterns with time
• Amplifies the distinction among churn & non churn customers
Reasons to choose Ensemble approach?
• To bring together all the distinct patterns to predict churn
• Data Imbalance
Feature Name
Is Cancel ratio month 5- week 8
Is Cancel ratio week 8 - week 4
Is Cancel ratio week 4 - week 1
Is Cancel ratio month 1
Is cancel ration entire history
Auto renew ratio m5 - w8
Auto renew ratio w8 – w4
Auto renew ratio w4 – w1
Auto renew ratio w1
Auto renew ratio m1
Auto renew ratio entire history
Pay ratio
Length of first transaction
whether latest transaction canceled
User’s payment status in last month
Does last transaction has auto renew
Payment method of last transaction
Length of last transaction
Min days bw trans & cutoff dates m5-w8
Min days bw trans & cutoff dates w8-w4
Min days bw trans & cutoff dates w4-w1
Insight: A user is more likely to churn if she has more cancelled transactions.
Action: We can guide users to suitable, appropriate subscription plans during onboarding to reduce cancelations
Insight: A user is less likely to churn is she has more transaction frequency or opted for Auto-renew Option
Action: Encourage users to select auto renew option with some plan benefit and improve transactional experience by making transaction process hassle free.