Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis

Date published

2024-10-17

Free to read from

2025-04-14

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Association for Computing Machinery (ACM)

Department

Type

Conference paper

ISSN

Format

Citation

Chai DM, Moulitsas I, Bisandu DB. (2024) Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis. In: Proceedings of the 2024 8th International Conference on Advances in Artificial Intelligence ICAAI 2024, 17 - 19 Oct 2024, London, United Kingdom, pp. 28-38

Abstract

Sentiment classification is essential in natural language processing, leveraging machine learning algorithms to understand the sentiment expressed in textual data. Over the years, advancements in machine learning, particularly with Naive Bayes (NB) and Support Vector Machines (SVM), have tremendously improved sentiment classification. These models benefit from word embedding techniques such as Word2Vec and GloVe, which provide dense vector representations of words, capturing their semantic and syntactic relationships. This paper explores the parallelisation of NB and SVM models using CUDA on GPUs to enhance computational efficiency and performance. Despite the computational power offered by GPUs, the literature on parallelising machine learning methods, especially for sentiment classification, remains limited. Our work aims to fill this gap by comparing the performance of NB and SVM on CPU and GPU platforms, focusing on execution time and model accuracy. Our experiments demonstrate that NB outperforms SVM in execution time and overall efficiency, mainly when using GPU acceleration. The NB model consistently achieves higher accuracy, precision, and F1 scores with Word2Vec and GloVe embeddings. The results show the importance of leveraging GPU acceleration using varying numbers of threads per block for large-scale sentiment analysis and laying the foundation for parallelising sentiment classification tasks.

Description

Software Description

Software Language

Github

Keywords

46 Information and Computing Sciences, 4611 Machine Learning, Bioengineering, Machine Learning and Artificial Intelligence, Networking and Information Technology R&D (NITRD), CUDA, Machine Learning, Sentiment Analysis, Word Embedding

DOI

Rights

Attribution 4.0 International

Relationships

Relationships

Resources

Funder/s

We acknowledge the Petroleum Technology Development Fund (PTDF) Nigeria funding, which sponsors the first author's PhD research, with ID PTDF/ED/OSS/PHD/DMC/1972/22.