Your browser doesn't support javascript.
Denigrate Comment Detection in Low-Resource Hindi Language Using Attention-Based Residual Networks
ACM Transactions on Asian and Low-Resource Language Information Processing ; 21(1), 2022.
Article in English | Scopus | ID: covidwho-1701467
ABSTRACT
Cyberspace has been recognized as a conducive environment for use of various hostile, direct, and indirect behavioural tactics to target individuals or groups. Denigration is one of the most frequently used cyberbullying ploys to actively damage, humiliate, and disparage the online reputation of target by sending, posting, or publishing cruel rumours, gossip, and untrue statements. Previous pertinent studies report detecting profane, vulgar, and offensive words primarily in the English language. This research puts forward a model to detect online denigration bullying in low-resource Hindi language using attention residual networks. The proposed model Hindi Denigrate Comment-Attention Residual Network (HDC-ARN) intends to uncover defamatory posts (denigrate comments) written in Hindi language which stake and vilify a person or an entity in public. Data with 942 denigrate comments and 1499 non-denigrate comments is scraped using certain hashtags from two recent trending events in India Tablighi Jamaat spiked Covid-19 (April 2020, Event 1) and Sushant Singh Rajput Death (June 2020 Event 2). Only text-based features, that is, the actual content of the post, are considered. The pre-Trained word embedding for Hindi language from fastText is used. The model has three ResNet blocks with an attention layer that generates a post vector for a single input, which is passed through a sigmoid activation function to get the final output as either denigrate (positive class) or non-denigrate (negative class). An F-1 score of 0.642 is achieved on the dataset. © 2021 Association for Computing Machinery.
Keywords

Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: ACM Transactions on Asian and Low-Resource Language Information Processing Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: ACM Transactions on Asian and Low-Resource Language Information Processing Year: 2022 Document Type: Article