Search | VHL Regional Portal

Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content.

Gasparini, Francesca; Rizzi, Giulia; Saibene, Aurora; Fersini, Elisabetta.

Data Brief ; 44: 108526, 2022 Oct.

Article in English | MEDLINE | ID: mdl-36117643

ABSTRACT

In this paper we present a benchmark dataset generated as part of a project for automatic identification of misogyny within online content, which focuses in particular on memes. The benchmark here described is composed of 800 memes collected from the most popular social media platforms, such as Facebook, Twitter, Instagram and Reddit, and consulting websites dedicated to collection and creation of memes. To gather misogynistic memes, specific keywords that refer to misogynistic content have been considered as search criterion, considering different manifestations of hatred against women, such as body shaming, stereotyping, objectification and violence. In parallel, memes with no misogynist content have been manually downloaded from the same web sources. Among all the collected memes, three domain experts have selected a dataset of 800 memes equally balanced between misogynistic and non-misogynistic ones. This dataset has been validated through a crowdsourcing platform, involving 60 subjects for the labelling process, in order to collect three evaluations for each instance. Two further binary labels have been collected from both the experts and the crowdsourcing platform, for memes evaluated as misogynistic, concerning aggressiveness and irony. Finally for each meme, the text has been manually transcribed. The dataset provided is thus composed of the 800 memes, the labels given by the experts and those obtained by the crowdsourcing validation, and the transcribed texts. This data can be used to approach the problem of automatic detection of misogynistic content on the Web relying on both textual and visual cues, facing phenomenons that are growing every day such as cybersexism and technology-facilitated violence.

A p-Median approach for predicting drug response in tumour cells.

Fersini, Elisabetta; Messina, Enza; Archetti, Francesco.

BMC Bioinformatics ; 15: 353, 2014 Oct 29.

Article in English | MEDLINE | ID: mdl-25359173

ABSTRACT

BACKGROUND: The complexity of biological data related to the genetic origins of tumour cells, originates significant challenges to glean valuable knowledge that can be used to predict therapeutic responses. In order to discover a link between gene expression profiles and drug responses, a computational framework based on Consensus p-Median clustering is proposed. The main goal is to simultaneously predict (in silico) anticancer responses by extracting common patterns among tumour cell lines, selecting genes that could potentially explain the therapy outcome and finally learning a probabilistic model able to predict the therapeutic responses. RESULTS: The experimental investigation performed on the NCI60 dataset highlights three main findings: (1) Consensus p-Median is able to create groups of cell lines that are highly correlated both in terms of gene expression and drug response; (2) from a biological point of view, the proposed approach enables the selection of genes that are strongly involved in several cancer processes; (3) the final prediction of drug responses, built upon Consensus p-Median and the selected genes, represents a promising step for predicting potential useful drugs. CONCLUSION: The proposed learning framework represents a promising approach predicting drug response in tumour cells.

Subject(s)

Antineoplastic Agents/pharmacology , Computer Simulation , Models, Biological , Neoplasms/drug therapy , Neoplasms/genetics , Cell Line, Tumor , Cluster Analysis , Gene Expression Profiling , Humans , Models, Statistical

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL