Sentiment analysis in Bengali Text using NLP

Sarkar, Ankon; Sourav, Aishwarja Paul; Ahmed, Rezvi

Sentiment analysis in Bengali Text using NLP

Files

18301273, 18301078, 18301226_CSE.pdf (1.08 MB)

Date

2023-01

Publisher

Brac University

Authors

Sarkar, Ankon

Sourav, Aishwarja Paul

Ahmed, Rezvi

Full item page

URI

http://hdl.handle.net/10361/19148

Abstract

Natural Language Processing, a branch of AI, teaches computers to understand speech and text in multiple languages. Machine learning or deep learning techniques can be used to develop rule-based models of human-spoken languages to simulate accurate text-meaning predictions. Although many studies have vastly improved the categorization of text data in languages such as English, Arabic, Chinese, Urdu, Hindi, etc, Bengali text categorization has not progressed much compared to oth ers. This research proposes an approach to analyzing and extracting basic emotions (Happiness, Sadness, Fear, Anger, Disgust Surprise) from Bengali text data. This can be done by gathering real-life data and producing a special rule-based algorithm using supervised machine learning and deep learning techniques. We evaluate the performance of our models using our own dataset BANEmo, consisting of 14999 annotated Bengali text data. To make text data machine-readable, we employed Bag of words, TF-IDF, Glove, and BERT embedding. We measured performance using supervised machine learning models like Naive Bayes and Support Vector Ma chine. Deep learning techniques like LSTM and Transformers (BERT) were also implemented. Our BERT model outperformed others with an overall accuracy of 69.2%.

Keywords

Natural language processing, Sentiment analysis, Bangla text, Machine learning, Deep learning, LSTM, Transformers, BERT

LC Subject Headings

Computational linguistics., Natural language processing (Computer science)

Description

Cataloged from PDF version of thesis.
Includes bibliographical references (pages 34-36).
This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Computer Science, 2023.

Department

Department of Computer Science and Engineering

Type

Thesis

Collections

Thesis (Bachelor of Science in Computer Science)

Sentiment analysis in Bengali Text using NLP

Files

Date

Publisher

Authors

URI

Citation

Abstract

Keywords

LC Subject Headings

Description

Publisher Link

Department

Type

Collections