AI-Powered Churn Prevention Pipeline

End-to-end machine learning pipeline with comprehensive data analysis and model deployment

Ready to execute
0 / 7 stages completed
Data
Sources
Data
Analysis
Feature
Engineering
Model
Training
Model
Evaluation
Deployment
AI Agents

Data Sources & Pipeline

Real-time data integration from multiple sources powering AI-driven churn prediction

๐Ÿ‘ฅ

Customer Profile Dataset

Postpaid Subscriber Data
Records
-
Features
-
๐ŸŽค

Call Transcription Data

AWS Transcribe + Bedrock NLP
Calls Processed
-
NLP Features
-

Data Analysis

AI-powered data profiling and quality analysis

98.5%
Data Completeness
47
Features Identified
3
Outliers Detected
0.87
Avg Correlation

๐Ÿค– AI-Powered Data Analysis Agents

๐Ÿ“Š

Structured Data Analyst

Numerical & Tabular

Capabilities:

Statistical analysis & correlation detection
Data quality assessment & cleaning
Outlier detection & handling
Temporal pattern analysis
50K
Records Analyzed
98.2%
Data Quality
๐Ÿง 

Unstructured Data Analyst

Text, Audio & NLP

Capabilities:

Call transcript sentiment analysis
Complaint text classification
Intent & emotion detection
Competitor mention extraction
127K
Calls Processed
96.8%
NLP Accuracy
๐Ÿท๏ธ

Data Labeller Agent

AI-Powered Annotation & Classification
Active Learning

Automated Labelling Tasks:

Churn Labels
High/Medium/Low risk classification
Sentiment Tags
Positive/Neutral/Negative
Complaint Types
Network, Billing, Service
Intent Detection
Switch, Upgrade, Downgrade

Performance:

50K
Labels Generated
94.5%
Accuracy

Feature Engineering

Creating sophisticated engineered features from raw data

๐Ÿ“ž Call Patterns (12 features)
Call duration, frequency, time-of-day patterns, weekend vs weekday usage
๐Ÿ“Š Usage Metrics (10 features)
Data consumption, voice minutes, SMS counts, roaming usage
๐ŸŽฏ Behavioral Indicators (8 features)
Complaint rate, support call frequency, service requests, plan changes

๐Ÿ”ง Feature Engineering Agents

๐Ÿ”ข

Encoding Agent

Categorical Transform
Techniques:
โ€ข One-Hot Encoding
โ€ข Label Encoding
โ€ข Target Encoding
โ€ข Frequency Encoding
8 categorical features encoded
โš–๏ธ

Scaling Agent

Normalization
Methods:
โ€ข StandardScaler
โ€ข MinMaxScaler
โ€ข RobustScaler
โ€ข MaxAbsScaler
23 numerical features scaled
๐Ÿ“ฆ

Binning Agent

Discretization
Strategies:
โ€ข Equal-Width Binning
โ€ข Equal-Frequency
โ€ข K-Means Clustering
โ€ข Custom Quantiles
6 continuous features binned

Model Training

Dual-model approach: Churn Prediction + Churn Attribution using structured & unstructured data

๐ŸŽฏ

Model 1: Churn Prediction

Binary Classification (Will Churn / Won't Churn)

Input Features (47 total):

Structured Data (35 features):
โ€ข Customer demographics (age, tenure, plan)
โ€ข Usage patterns (voice, data, SMS)
โ€ข Financial metrics (ARPU, CLV, payment history)
โ€ข Behavioral signals (support calls, complaints)

Unstructured Data (12 features):
โ€ข Sentiment score (AWS Nova Pro)
โ€ข Intent classification
โ€ข Emotion vectors (Titan embeddings)
โ€ข Competitor mentions
โ€ข Urgency indicators

Model Architecture:

โ€ข Algorithm: Random Forest Classifier
โ€ข Estimators: 200 trees
โ€ข Max Depth: 12
โ€ข Features per split: sqrt(47) = 7
โ€ข Training Method: Stratified K-Fold CV (5 folds)
89.3%
Accuracy
0.91
Precision
0.87
Recall
0.89
F1-Score
Output: Churn Probability (0-100%)
๐Ÿ”

Model 2: Churn Attribution

Multi-Class Classification (Root Cause Analysis)

Churn Attribution Categories:

Low NPS Score
Customer dissatisfaction & poor experience
Competition
Competitor offers & switching intent
Pricing Concerns
High bills & affordability issues
Quality of Service
Poor service quality & complaints
Connection Drops
Network reliability & dropped calls
Low Usage
Declining engagement & activity
Other Factors
Contract issues, relocation, miscellaneous

Model Architecture:

โ€ข Algorithm: XGBoost Multi-Class Classifier
โ€ข Classes: 7 churn categories
โ€ข Features: Same 47 features + churn probability
โ€ข Objective: multi:softmax
โ€ข Training: Only on churned customers
84.7%
Accuracy
0.82
Macro F1
7
Categories
48
Features
Output: Primary Churn Reason + Confidence Score

Two-Stage Prediction Pipeline

STAGE 1
Churn Prediction Model
โ†“ Probability: 87%
IF Churn = Yes
(Probability > 50%)
STAGE 2
Attribution Model
โ†“ Reason: Pricing (73%)
ACTION
Retention Strategy
โ†’ Discount Offer

Model Evaluation

Comprehensive model performance metrics and validation

Random Forest
89.3%
Accuracy
โœ“ SELECTED
XGBoost
87.1%
Accuracy

Model Deployment

Deploying trained model to production environment

Deployment Status

โœ“ Model Deployed
Production endpoint active
โšก Real-time Inference
API endpoint ready

Customer Retention & Outreach

Proactive customer re-engagement using churn attribution insights from Model 2

CHURN ATTRIBUTION TRIGGERS - REAL-TIME MONITORING

7-category attribution model identifying root causes of customer churn

๐Ÿ˜”
Low NPS Score
Customers with NPS โ‰ค 6 (Detractors)
156
Detected
42
In Queue
71%
Retained
๐Ÿ”„
Competition
Competitor offers or port-out inquiries
87
Detected
23
In Queue
75%
Retained
๐Ÿ’ฐ
Pricing Concerns
Billing complaints or pricing sensitivity
112
Detected
34
In Queue
82%
Retained
โšก
Quality of Service
Service quality or speed complaints
94
Detected
28
In Queue
69%
Retained
๐Ÿ“ก
Connection Drops
Network reliability and connectivity issues
78
Detected
21
In Queue
73%
Retained
๐Ÿ“‰
Low Usage
Customers with >30% usage decline
342
Detected
87
In Queue
68%
Retained
๐Ÿ“‹
Other Factors
Miscellaneous churn indicators
54
Detected
15
In Queue
64%
Retained
๐Ÿ†
TOP CHURN CATEGORY - HIGHEST IMPACT
Low Usage (342 Customers)
Root Cause: 45% average usage decline โ€ข Primary Action: Usage incentives + plan optimization โ€ข Revenue at Risk: $8.5M annually

PRIORITY CUSTOMER QUEUE - AI OUTREACH

Customers ranked by churn risk and attribution category

AI Dialer Integration

Dialer Status โ— ACTIVE
AWS Connect Integration
Powered by Amazon Bedrock Nova
234
Active Calls Today
3.2m
Avg Call Duration

Today's Performance

342
Calls Initiated
234
Retained
68.4%
Success Rate
$487K
Revenue Saved

AI Agent Workflow

18-agent autonomous workflow for churn prevention and customer retention

Agent Status

๐ŸŽฏ Real-time Monitoring
Monitor customer behavior patterns
๐Ÿ“Š Feature Selection
Select optimal feature set
๐Ÿ“ž Dialer Agent
Initiate retention calls
๐Ÿ’ฌ Conversation Agent
Handle customer interactions
๐ŸŽ Offer Manager
Create personalized offers
๐Ÿ“ˆ Analytics Agent
Generate insights and reports