Wellness-centre NLP Analytics Dashboard

Wellness-centre NLP Analytics

A comprehensive analysis of customer feedback to extract actionable insights through advanced Natural Language Processing techniques

27,586

Total Reviews Analysed

5,824

Negative Reviews

600+

Wellness Centres

10

Topic Clusters

Project Overview

This project analyses customer reviews from Google and Trustpilot for Wellness-centre Group to identify key drivers of customer satisfaction and dissatisfaction. By leveraging advanced Natural Language Processing (NLP) techniques, we extract actionable insights to enhance the customer experience across 600+ wellness centres.

  • Data Sources 12 months of reviews from Google and Trustpilot platforms
  • Analysis Methods BERTopic, Emotion Analysis, Large Language Models (Falcon-7B), Gensim LDA
  • Primary Goal Identify key areas for improvement to enhance customer experience

Topic Distribution in Negative Reviews

0 50 100 150 200 Cleanliness Access Equipment Staff Classes Maintenance Crowding Amenities 191 122 112 109 106 97 79 74 Top Topics by Document Count

Emotion Analysis

Emotions Negative Reviews Anger (42%) Sadness (33%) Joy (18%) Fear (7%) Emotion Distribution

Top Locations with Negative Reviews

Rank Location Google Reviews Trustpilot Reviews Total Reviews
1 London Stratford 59 22 81
2 London Enfield 25 23 48
3 London Swiss Cottage 22 15 37
4 London Hayes 17 16 33
5 Bradford Thornbury 19 14 33

Topic Clusters

Showers & Cleanliness 191
Day Pass & Access 122
Equipment & Weights 112
Staff & Rudeness 109
Music & Classes 106
Equipment Maintenance 97
Crowding 79

Methodology & Process

1
Data Import & Preprocessing

Imported and cleaned review data from Google and Trustpilot sources, preparing it for NLP analysis.

  • Data Sources: Google_12_months.xlsx and Trustpilot_12_months.xlsx
  • Cleaning Operations: Removed missing values, cleaned text, tokenised reviews
  • Data Volume: After cleaning - Google (11,865 reviews) & Trustpilot (15,721 reviews)
  • Anonymisation: Removed personal identifiers and Wellness-centre specific terms
  • Preprocessing: Converted to lowercase, removed stopwords and numbers
Raw Data Files 27,586 reviews Data Cleaning & Preprocessing Cleaned Dataset Ready for Analysis
2
Initial Data Exploration

Conducted preliminary analysis to understand review distribution and common themes across both platforms.

  • Unique Locations: Google (456) & Trustpilot (376) with 7 common locations in top negative reviews
  • Common Locations: London Stratford, London Enfield, London Swiss Cottage among the most frequently reviewed
  • Word Frequency Analysis: Identified most common terms in both positive and negative reviews
  • Negative Reviews: Google (2,425) & Trustpilot (3,399) negative reviews identified
equipment great staff dirty clean rude classes crowded machines broken weights
3
Topic Modelling with BERTopic

Applied BERTopic to identify key themes and patterns in negative customer reviews, focusing on common locations.

  • Model Configuration: Used KMeans clustering with 10 topics
  • Input: 1,000 negative reviews from locations common to both platforms
  • Visualisation: Created intertopic distance maps and bar charts of most salient terms
  • Results: Identified distinct clusters representing specific customer concerns
Showers 191 Access 122 Equipment 112 Staff 109 Classes 106
4
Location-Specific Analysis

Expanded analysis to top 30 locations to identify geographical patterns and location-specific issues.

  • Top Locations Analysis: Identified London Stratford, Leicester Walnut Street, and London Enfield as having highest negative reviews
  • Cross-Platform Comparison: Examined review patterns across Google and Trustpilot
  • Location-Specific Topics: Applied BERTopic on reviews from top 30 locations
  • Contrasting Results: Identified differences between overall findings and top-location findings
London Stratford London Enfield London Swiss Cottage Bradford
5
Emotion Analysis

Used BERT-based emotion classification to identify emotional patterns in reviews, with particular focus on anger.

  • Model: bhadresh-savani/bert-base-uncased-emotion from Hugging Face
  • Emotions Analysed: Anger, sadness, joy, fear, love, surprise
  • Results: Anger was the dominant emotion in negative reviews (Google: 807, Trustpilot: 1,225)
  • Analysis: Applied BERTopic on anger-dominant reviews to identify key triggers
Emotion Distribution in Negative Reviews Anger (2,032) Sadness (1,599) Joy (1,705) Fear (316) Love (111)
6
Advanced Analysis with Falcon-7B

Leveraged large language model capabilities to extract topics and generate actionable insights from negative reviews.

  • Model: tiiuae/falcon-7b-instruct from Hugging Face
  • Process: Extracted top 3 topics from each negative review
  • Topic Clustering: Applied BERTopic to the extracted topics
  • Action Generation: Generated specific recommendations for each topic cluster
Negative Reviews Falcon-7B LLM Topic Extraction Actionable Insights
7
Validation with Gensim LDA

Applied traditional LDA topic modelling as a validation method to compare with BERTopic findings.

  • Model: Gensim LDA with 10 topics
  • Input: All negative reviews from Google and Trustpilot
  • Visualisation: Created interactive topic visualisations with pyLDAvis
  • Comparison: Confirmed consistency with BERTopic findings, but with less granularity
1 2 3 4 equipment staff showers membership