Applying natural language processing and machine learning techniques to patient experience feedback: a systematic review.

Khanbhai, M; Anyadi, P; Symons, J; Flott, K; Darzi, A; Mayer, E

View/Open

Published version (595.9Kb)

ICR Author

Darzi, Ara

Author

Khanbhai, M

Anyadi, P

Symons, J

Flott, K

Darzi, A

Show all

Type

Journal Article

Metadata

Show full item record

Abstract

Objectives Unstructured free-text patient feedback contains rich information, and analysing these data manually would require a lot of personnel resources which are not available in most healthcare organisations.To undertake a systematic review of the literature on the use of natural language processing (NLP) and machine learning (ML) to process and analyse free-text patient experience data.Methods Databases were systematically searched to identify articles published between January 2000 and December 2019 examining NLP to analyse free-text patient feedback. Due to the heterogeneous nature of the studies, a narrative synthesis was deemed most appropriate. Data related to the study purpose, corpus, methodology, performance metrics and indicators of quality were recorded.Results Nineteen articles were included. The majority (80%) of studies applied language analysis techniques on patient feedback from social media sites (unsolicited) followed by structured surveys (solicited). Supervised learning was frequently used (n=9), followed by unsupervised (n=6) and semisupervised (n=3). Comments extracted from social media were analysed using an unsupervised approach, and free-text comments held within structured surveys were analysed using a supervised approach. Reported performance metrics included the precision, recall and F-measure, with support vector machine and Naïve Bayes being the best performing ML classifiers.Conclusion NLP and ML have emerged as an important tool for processing unstructured free text. Both supervised and unsupervised approaches have their role depending on the data source. With the advancement of data analysis tools, these techniques may be useful to healthcare organisations to generate insight from the volumes of unstructured free-text data.

Language

eng

Date accepted

2021-01-12

Citation

BMJ health & care informatics, 2021, 28 (1)

Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by-nc/4.0

Publications Repository

Publications Repository