Home » Practice Verticals » Academic Practice » PhDStats Advisor » PhD Statistical Analysis Guides » Ph.D. Statistical Data Analysis Case Studies » Software & Workflow Walkthroughs » Case Study: Automating Likert-Scale Data Transformation & Analysis in a Public Health Dissertation

Case Study: Automating Likert-Scale Data Transformation & Analysis in a Public Health Dissertation

By Ankit Gupta |

1. Project & Data Context

This case study focuses on automating likert‑scale analysis for a Public Health PhD dissertation surveyed 413 healthcare workers across urban and rural hospitals to assess their perceptions of standard operating protocols via a 10-item, 5-point Likert scale. The goals were to:

Collapse and aggregate responses into meaningful categories
Generate reproducible dashboards of item and composite scores
Execute inferential tests on the derived categorical outcomes

This case study details the end-to-end software stack and workflow that automated each step—maximizing consistency, auditability, and efficiency.

2. Software Ecosystem for focusing on automating likert‑scale analysis

R (Base + tidyverse)
- Data import (read.csv) and reshaping (pivot_longer)
- Custom functions for scale collapsing and composite scoring
- Inferential tests: chisq.test(), t.test(), glm()
Microsoft Excel + VBA
- “Masterchart” creation macro: merges pre-survey demographic files with cleaned Likert scores
- Automated dashboard macro: iterates over item groups to produce summary tables and charts
R Markdown & Git
- Parameterized .Rmd templates for generating HTML/PDF reports with embedded code, tables, and narrative
- Git version control for script integrity and reproducibility

3. Data Ingestion & Masterchart Standardization for automating likert‑scale analysis

Raw Data Import

raw_df <- read.csv(\"survey_responses.csv\") demo_df <- read.csv(\"demographics.csv\")

Schema Validation

validate_schema(raw_df, expected_cols = paste0(\"Q\", 1:10))

Excel Masterchart Macro
- VBA script CreateMasterchart() reads both CSVs, standardizes column names, and outputs masterchart.csv with one row per respondent, demographic fields + raw item scores.

4. Likert-Scale Transformation Pipeline for automating likert‑scale analysis

All transformations occur in likert_transform.R:

Collapsing 5-Point → 3-Point Scale

collapse_likert <- function(x) 
   { 
    case_when
      ( 
       x >= 4 ~ 3, # Agree 
       x == 3 ~ 2, # Neutral 
       x <= 2 ~ 1 # Disagree 
     )
   } 
df3 <- raw_df %>% mutate(across(Q1:Q10, collapse_likert))

Composite Scoring of Related Items

df3 <- df3 %>% 
rowwise() %>% 
mutate
( 
 composite_training = mean(c(Q2, Q5), na.rm=TRUE), 
 composite_protocol = mean(c(Q1, Q4, Q7), na.rm=TRUE) 
) # Convert composites back into 3-point categories 
df3 <- df3 %>% 
mutate(across(starts_with(\"composite_\"), ~ case_when
 ( 
   . > 2.33 ~ 3, 
   . >= 1.67 ~ 2, 
   TRUE ~ 1 
)))

Export Cleaned Masterchart

write.csv(df3, \"likert_masterchart.csv\", row.names=FALSE)

5. Automated Dashboard Generation

Excel VBA Macro:
- Loops over each Q1:Q10 and composite_* column
- Builds a pivot table of frequency counts (1, 2, 3) by hospital_type
- Creates a clustered bar chart for each pivot and labels axes/text
Output:
- 13 dashboards (10 individual items + 3 composites), each on its own worksheet, ready for inclusion in R Markdown reports.

6. Inferential Testing Pipeline

In inferential_tests.R, the cleaned masterchart is used to test whether high-response rates differ by hospital type:

df3 <- read.csv(\"likert_masterchart.csv\")

# Define “high agreement” as category 3
df3 <- df3 %>% mutate(high_composite_protocol = if_else(composite_protocol == 3, 1, 0))

# Chi-square test
cs_result <- chisq.test(table(df3$hospital_type, df3$high_composite_protocol))
write.csv(as.data.frame(cs_result[c(\"statistic\",\"p.value\")]), \"chi_square_protocol.csv\")

Diagnostic plots (mosaic plots, residual heatmaps) are generated automatically and saved to /results/.

7. Reproducible Reporting & Audit

R Markdown Template:
- Parameters: data_path, output_format
- Sections: Data summary, Dashboard gallery, Inferential results, Executive summary
Git Workflow:
- main branch holds stable release; feature branches (feature/likert-transform, feature/dashboard-macro) undergo pull-request reviews.
Audit Report:
- A final Data Audit PDF includes schema checks, transformation logs, macro version stamp, and test assumptions (e.g., expected cell counts for chi-square) .

8. Lessons & Best Practices

Centralize Data Prep: Use a “masterchart” as the single data source to avoid divergence.
Automate Repetitive Tasks: VBA for dashboards, R functions for transformations ensure uniform output.
Parameterize Reports: R Markdown templates make it trivial to re-run analyses on updated data.
Version Control Everything: Preserves a complete audit trail of data changes and script edits.

This Software & Workflow Walkthrough illustrates how combining Excel macros with R scripting creates a robust, transparent pipeline for Likert-scale data in a Public Health dissertation—delivering audit-ready dashboards and statistical outputs with minimal manual effort.

Explore Other Cases Under this Module

Case Study: End-to-End Automation of Survey Analytics for Hybrid Instruction

Learn how a Ph.D. scholar automated survey data processing—from ingestion to reporting—using Excel VBA, R scripts, and GitHub CI for a large-scale hybrid instruction study.

Domain: Data Analysis

Read

Case Study: Automating the Analytical Workflow for a Quasi-Experimental Educational Intervention Using R & Excel

Discover a case study showing how R and Excel VBA were combined to automate data ingestion, dashboard creation, inferential testing, and reproducible reporting in a quasi-experimental PhD project.

Domain: Data Analysis

Read

Explore Other Modules Under this Guide

Ph.D. Statistical Field Specific Deep Dives

Ph.D. statistical field‑specific deep dives present tailored case studies across diverse disciplines. Firstly, these deep dives focus on contextual research needs and specialized techniques. Additionally, concise explanations guide you through discipline‑driven choices. Consequently, you gain targeted insights to apply in your dissertation.

Domain: Data Analysis

Explore Cases

Ph.D. Statistical Lessons Learned Best Practices

Ph.D. statistical lessons learned and best practices compile critical insights from completed dissertations. Firstly, this collection synthesizes what worked well and what did not. Moreover, it highlights real‑world research challenges and solutions. Consequently, you benefit from distilled expertise without sifting through lengthy reports.

Domain: Data Analysis

Explore Cases

Ph.D. Statistical Methodology Centered Examples

Ph.D. statistical methodology-centered examples demonstrate core techniques applied step by step. Firstly, each example breaks down statistical procedures into clear stages. Additionally, concise explanations focus on ANOVA, multilevel models, and structural equation modeling. Consequently, you build confidence in selecting and justifying methods.

Domain: Data Analysis

Explore Cases

Ph.D. Statistical What if Data Analysis

Ph.D. statistical what‑if data analysis teaches you to question assumptions and test robustness in your dissertation work. Firstly, you learn why exploring alternative scenarios uncovers hidden biases. Moreover, the content demonstrates how small parameter tweaks alter results meaningfully. Consequently, you build confidence in your analytical decisions.

Domain: Data Analysis

Explore Cases

Explore Our Other Guides

Ethical Ph.D. Research Hacks

Ethical Ph.D. research hacks offer practical shortcuts that uphold integrity while improving workflow efficiency. This guide focuses on faculty–scholars managing research responsibilities under time constraints. Moreover, each hack emphasizes ethics without sacrificing analytical depth.

Domain: Research

Explore Hacks

Ph.D. Statistical Data Analysis Critiques

Ph.D. statistical data analysis critiques guide you through rigorous evaluation of statistical methods in dissertations. This content highlights how to spot methodological flaws and biases. Moreover, it demonstrates strategies for constructive critique that improve research quality.

Domain: Critical Analysis

Explore Critiques

Research Advice

This basic advice is available freely for Ph.D. / Doctoral Faculty Scholars in India.

Domain: Ph.D. Research Thesis

Explore Advice

Our Services

📊 Data Analysis

Speciality: Predictive Modeling

Clients: Businesses & Academics

🎓 Ph.D. Consulting

Speciality: Quantitative Analysis

Clients: Faculty Scholars

🚀 Business Engineering

Speciality: Data-driven Organizational Strategy

Clients: Businesses

Who is a Data Scientist?

Expert in statistical analysis, predictive modeling, and data-driven insights for research and business solutions.

Domain: Semantics

Learn More

About Us

Credentials

Comprehensive overview of skills, work ethic, and professional qualifications.

Category: Client Trust

Explore

Practice Verticals

Independent freelancing professional for data-driven research across multiple domains.

Category: Consulting Domains

Explore

Get in Touch

Use any of the methods below to contact me. Please note our preferred channels and business hours.

Category: Client Trust

Explore