Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis

Chai, Dakun Mang; Moulitsas, Irene; Bisandu, Desmond B.

Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis

Files

Understanding_the_relevance-2025.pdf (613.98 KB)

Date published

2024-10-17

Free to read from

2025-04-14

Authors

Chai, Dakun Mang
Moulitsas, Irene
Bisandu, Desmond B.

Publisher

Association for Computing Machinery (ACM)

Type

Conference paper

URI

https://doi.org/10.1145/3704137.3704142
https://dspace.lib.cranfield.ac.uk/handle/1826/23753

Citation

Chai DM, Moulitsas I, Bisandu DB. (2024) Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis. In: Proceedings of the 2024 8th International Conference on Advances in Artificial Intelligence ICAAI 2024, 17 - 19 Oct 2024, London, United Kingdom, pp. 28-38

Abstract

Sentiment classification is essential in natural language processing, leveraging machine learning algorithms to understand the sentiment expressed in textual data. Over the years, advancements in machine learning, particularly with Naive Bayes (NB) and Support Vector Machines (SVM), have tremendously improved sentiment classification. These models benefit from word embedding techniques such as Word2Vec and GloVe, which provide dense vector representations of words, capturing their semantic and syntactic relationships. This paper explores the parallelisation of NB and SVM models using CUDA on GPUs to enhance computational efficiency and performance. Despite the computational power offered by GPUs, the literature on parallelising machine learning methods, especially for sentiment classification, remains limited. Our work aims to fill this gap by comparing the performance of NB and SVM on CPU and GPU platforms, focusing on execution time and model accuracy. Our experiments demonstrate that NB outperforms SVM in execution time and overall efficiency, mainly when using GPU acceleration. The NB model consistently achieves higher accuracy, precision, and F1 scores with Word2Vec and GloVe embeddings. The results show the importance of leveraging GPU acceleration using varying numbers of threads per block for large-scale sentiment analysis and laying the foundation for parallelising sentiment classification tasks.

Keywords

46 Information and Computing Sciences, 4611 Machine Learning, Bioengineering, Machine Learning and Artificial Intelligence, Networking and Information Technology R&D (NITRD), CUDA, Machine Learning, Sentiment Analysis, Word Embedding

Rights

Attribution 4.0 International

http://creativecommons.org/licenses/by/4.0/

Funder/s

We acknowledge the Petroleum Technology Development Fund (PTDF) Nigeria funding, which sponsors the first author's PhD research, with ID PTDF/ED/OSS/PHD/DMC/1972/22.

Collections

Staff publications (AIRS)

Full item page

Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis

Files

Date published

Free to read from

Authors

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Department

Type

ISSN

Format

URI

Citation

Abstract

Description

Software Description

Software Language

Github

Keywords

DOI

Rights

Relationships

Relationships

Resources

Funder/s

Collections