CoSaD-Code-Mixed Sentiments Analysis for Dravidian Languages

Balouchzahi, F.; Shashirekha, H. L.; Sidorov, G.

Balouchzahi, F.; Shashirekha, H. L.; Sidorov, G..

Working Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021 ; 3159:887-898, 2021.

Article in English | Scopus | ID: covidwho-1957805

ABSTRACT

ABSTRACT

Analyzing sentiments or opinions in code-mixed languages is gaining importance due to increase in the use of social media and online platforms especially during the Covid-19 pandemic. In a multilingual society like India, code-mixing and script mixing is quite common as people especially the younger generation are quite familiar in using more than one language. In view of this, the current paper describes the models submitted by our team MUCIC for the shared task in’Sentiments Analysis (SA) for Dravidian Languages in Code-Mixed Text’. The objective of this shared task is to develop and evaluate models for code-mixed datasets in three Dravidian languages, namely Kannada, Malayalam, and Tamil mixed with English language resulting in Kannada-English (Ka-En), Malayalam-English (Ma-En), and Tamil-English (Ta-En) language pairs. N-grams of char, char sequences, and syllables features are transformed into feature vectors and are used to train three Machine Learning (ML) classifiers with majority voting. The predictions on the Test set obtained average weighted F1-scores of 0.628, 0.726, and 0.619 securing 2nd, 4th, and 5th ranks for Ka-En, Ma-En, and Ta-En language pairs respectively. © 2021 Copyright for this paper by its authors.

Keywords

Code-Mixing; Dravidian Languages; Machine Learning; n-grams; Sentiments Analysis; Codes (symbols); Computational linguistics; Mixing; Dravidian language; English languages; Language pairs; Machine-learning; Malayalams; Online platforms; Sentiment analysis; Social media platforms

Search on Google

XML

Collection: Databases of international organizations Database: Scopus Language: English Journal: Working Notes of FIRE - 13th Forum for Information Retrieval Evaluation, FIRE-WN 2021 Year: 2021 Document Type: Article

Similar

MEDLINE

LILACS

LIS

Search on Google

XML