Data supporting: 'Analysing the Sentiment of Air-Traveller: A Comparative Analysis'

Date

2022-08-31 12:55

Advisors

Journal Title

Journal ISSN

Volume Title

Publisher

Cranfield University

Department

Type

Dataset

ISSN

item.page.extent-format

Citation

Homaid, Mohammed Salih A; Bisandu, Desmond Bala; Moulitsas, Irene; Jenkins, Karl (2022). Data supporting: 'Analysing the Sentiment of Air-Traveller: A Comparative Analysis'. Cranfield Online Research Data (CORD). Dataset. https://doi.org/10.17862/cranfield.rd.19375334

Abstract

Airport service qualityis considered to be an indicator of passenger satisfaction. However, assessingthis by conventional methods requires continuous observation and monitoring.Therefore, during the past few years, the use of machine learning techniquesfor this purpose has attracted considerable attention for analysing thesentiment of the air traveller. A sentiment analysis system for textual dataanalytics leverages the natural language processing and machine learningtechniques in order to determine whether a piece of writing is positive, negativeor neutral. Numerous methods exist for estimating sentiments which includelexical-based methodologies and directed artificial intelligence strategies.Despite the wide use and ubiquity of certain strategies, it remains unclearwhich is the best strategy for recognising the intensity of the sentiments of amessage. It is necessary to compare these techniques in order to understandtheir advantages, disadvantages and limitations. In this paper, we compared theValence Aware Dictionary and sentiment Reasoner, a sentiment analysis techniquespecifically attuned and well known for performing good on social media data,with the conventional machine learning techniques of handling the textual databy converting it into numerical form. We used the review data obtained from theSKYTRAX website for each airport. The machine learning algorithms evaluated inthis paper are VADER sentiment and logistic regression. The termfrequency-inverse document frequency is used in order to convert the textualreview datainto the resulting numerical columns. This was formulated as a classificationproblem, whereby the prediction of the algorithm was compared with the actualrecommendation of the passenger in the dataset. The results were analysedaccording to the accuracy, precision, recall and F1-score. From the analysis ofthe results, we observed that logistic regression outperformed the VADERsentiment analysis.

Description

item.page.description-software

item.page.type-software-language

item.page.identifier-giturl

Keywords

Airport service quality, data analytics, machine learning, sentiment analysis, text mining, regression'

Rights

MIT

item.page.relationships

item.page.relation-supplements

Collections