Load UCI Bank Marketing dataset, analyse structure, visualise target distribution, and perform correlation analysis
Encode categorical variables using label encoding. No standardisation needed for tree-based models
Train and evaluate Decision Tree, Random Forest, AdaBoost, Gradient Boosting, and XGBoost models
Optimise Decision Tree using GridSearch with pre-pruning and post-pruning techniques
Use SHAP values to interpret model predictions and understand feature importance
Understanding which features drive model predictions and their directional impact
Technical Achievements: Successfully implemented comprehensive machine learning pipeline with proper handling of class imbalance, feature encoding, and model interpretation. SHAP analysis revealed actionable insights about customer behaviour patterns.
Business Value: The model provides clear direction for marketing strategy optimisation, with call duration and economic indicators as primary conversion drivers. Implementation of these insights could significantly improve campaign effectiveness and ROI.
Next Steps: Deploy the Gradient Boosting model in production environment, implement A/B testing framework for validation, and establish monitoring systems for model drift detection and performance tracking.