Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Data Brief ; 54: 110440, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38711737

ABSTRACT

The proliferation of online disinformation and fake news, particularly in the context of breaking news events, demands the development of effective detection mechanisms. While textual content remains the predominant medium for disseminating misleading information, the contribution of other modalities is increasingly emerging within online outlets and social media platforms. However, multimodal datasets, which incorporate diverse modalities such as texts and images, are not very common yet, especially in low-resource languages. This study addresses this gap by releasing a dataset tailored for multimodal fake news detection in the Italian language. This dataset was originally employed in a shared task on the Italian language. The dataset is divided into two data subsets, each corresponding to a distinct sub-task. In sub-task 1, the goal is to assess the effectiveness of multimodal fake news detection systems. Sub-task 2 aims to delve into the interplay between text and images, specifically analyzing how these modalities mutually influence the interpretation of content when distinguishing between fake and real news. Both sub-tasks were managed as classification problems. The dataset consists of social media posts and news articles. After collecting it, it was labeled via crowdsourcing. Annotators were provided with external knowledge about the topic of the news to be labeled, enhancing their ability to discriminate between fake and real news. The data subsets for sub-task 1 and sub-task 2 consist of 913 and 1350 items, respectively, encompassing newspaper articles and tweets.

2.
Hum Vaccin Immunother ; 16(5): 1062-1069, 2020 05 03.
Article in English | MEDLINE | ID: mdl-32118519

ABSTRACT

Social media have become a common way for people to express their personal viewpoints, including sentiments about health topics. We present the results of an opinion mining analysis on vaccination performed on Twitter from September 2016 to August 2017 in Italy. Vaccine-related tweets were automatically classified as against, in favor or neutral in respect of the vaccination topic by means of supervised machine-learning techniques. During this period, we found an increasing trend in the number of tweets on this topic. According to the overall analysis by category, 60% of tweets were classified as neutral, 23% against vaccination, and 17% in favor of vaccination. Vaccine-related events appeared able to influence the number and the opinion polarity of tweets. In particular, the approval of the decree introducing mandatory immunization for selected childhood diseases produced a prominent effect in the social discussion in terms of number of tweets. Opinion mining analysis based on Twitter showed to be a potentially useful and timely sentinel system to assess the orientation of public opinion toward vaccination and, in future, it may effectively contribute to the development of appropriate communication and information strategies.


Subject(s)
Social Media , Vaccines , Child , Humans , Italy , Public Opinion , Vaccination
3.
IEEE Trans Cybern ; 48(9): 2656-2669, 2018 Sep.
Article in English | MEDLINE | ID: mdl-28945604

ABSTRACT

Fuzzy associative classification has not been widely analyzed in the literature, although associative classifiers (ACs) have proved to be very effective in different real domain applications. The main reason is that learning fuzzy ACs is a very heavy task, especially when dealing with large datasets. To overcome this drawback, in this paper, we propose an efficient distributed fuzzy associative classification approach based on the MapReduce paradigm. The approach exploits a novel distributed discretizer based on fuzzy entropy for efficiently generating fuzzy partitions of the attributes. Then, a set of candidate fuzzy association rules is generated by employing a distributed fuzzy extension of the well-known FP-Growth algorithm. Finally, this set is pruned by using three purposely adapted types of pruning. We implemented our approach on the popular Hadoop framework. Hadoop allows distributing storage and processing of very large data sets on computer clusters built from commodity hardware. We have performed an extensive experimentation and a detailed analysis of the results using six very large datasets with up to 11 000 000 instances. We have also experimented different types of reasoning methods. Focusing on accuracy, model complexity, computation time, and scalability, we compare the results achieved by our approach with those obtained by two distributed nonfuzzy ACs recently proposed in the literature. We highlight that, although the accuracies result to be comparable, the complexity, evaluated in terms of number of rules, of the classifiers generated by the fuzzy distributed approach is lower than the one of the nonfuzzy classifiers.

4.
IEEE Trans Syst Man Cybern B Cybern ; 34(3): 1360-73, 2004 Jun.
Article in English | MEDLINE | ID: mdl-15484909

ABSTRACT

In this paper, we propose a fuzzy logic-based approach which exploits remotely sensed multispectral measurements of the reflected sunlight to estimate the concentration of optically active constituents of the sea water. The relation between the concentrations of interest and the subsurface reflectances is modeled by a set of fuzzy rules extracted automatically from the data through a two-step procedure. First, a compact initial rule base is generated by projecting onto the input variables the clusters produced by a fuzzy clustering algorithm. Then, a genetic algorithm is applied to optimize the rules. Appropriate constraints maintain the semantic properties of the initial model during the genetic evolution. Results of the application of the fuzzy model obtained from data simulated with an ocean color model over the channels of the Medium Resolution Imaging Spectrometer are shown and discussed.


Subject(s)
Chlorophyll/analysis , Color , Colorimetry/methods , Environmental Monitoring/methods , Fuzzy Logic , Phytoplankton/metabolism , Water/analysis , Algorithms , Artificial Intelligence , Oceans and Seas , Spectrum Analysis/methods , Water Microbiology
5.
IEEE Trans Syst Man Cybern B Cybern ; 34(1): 775-82, 2004 Feb.
Article in English | MEDLINE | ID: mdl-15369122

ABSTRACT

One of the critical aspects of clustering algorithms is the correct identification of the dissimilarity measure used to drive the partitioning of the data set. The dissimilarity measure induces the cluster shape and therefore determines the success of clustering algorithms. As cluster shapes change from a data set to another, dissimilarity measures should be extracted from data. To this aim, we exploit some pairs of points with known dissimilarity value to teach a dissimilarity relation to a feed-forward neural network. Then, we use the neural dissimilarity measure to guide an unsupervised relational clustering algorithm. Experiments on synthetic data sets and on the Iris data set show that the relational clustering algorithm based on the neural dissimilarity outperforms some popular clustering algorithms (with possible partial supervision) based on spatial dissimilarity.

SELECTION OF CITATIONS
SEARCH DETAIL
...