Back to Search
Book ChapterUnknown

Hate Speech Detection in the Bengali Language: A Dataset and Its Baseline Evaluation

Author Affiliations
Shahjalal University of Science and Technology
Published InAlgorithms for intelligent systems
Year2021
Citations113

Abstract

Social media sites such as YouTube and Facebook have become an integral part of everyone’s life, and in the last few years, hate speech in the social media comment section has increased rapidly. Detection of hate speech on social media Web sites faces a variety of challenges including small imbalanced datasets, the finding of an appropriate model and also the choice of feature analysis method. Furthermore, this problem is more severe for the Bengali speaking community due to the lack of gold standard labeled datasets. This paper presents a new dataset of 30,000 user comments tagged by crowdsourcing and verified by expert. All the user comments collected from YouTube and Facebook comment section and classified into seven categories: sports, entertainment,…
View at Publisher

BORR does not host full-text PDFs. The button above takes you to the original publisher.