Pesquisa | Portal Regional da BVS

1.

Community detection in hypergraphs via mutual information maximization.

Kritschgau, Jürgen; Kaiser, Daniel; Alvarado Rodriguez, Oliver; Amburg, Ilya; Bolkema, Jessalyn; Grubb, Thomas; Lan, Fangfei; Maleki, Sepideh; Chodrow, Phil; Kay, Bill.

Sci Rep ; 14(1): 6933, 2024 Mar 23.

Artigo em Inglês | MEDLINE | ID: mdl-38521798

RESUMO

The hypergraph community detection problem seeks to identify groups of related vertices in hypergraph data. We propose an information-theoretic hypergraph community detection algorithm which compresses the observed data in terms of community labels and community-edge intersections. This algorithm can also be viewed as maximum-likelihood inference in a degree-corrected microcanonical stochastic blockmodel. We perform the compression/inference step via simulated annealing. Unlike several recent algorithms based on canonical models, our microcanonical algorithm does not require inference of statistical parameters such as vertex degrees or pairwise group connection rates. Through synthetic experiments, we find that our algorithm succeeds down to recently-conjectured thresholds for sparse random hypergraphs. We also find competitive performance in cluster recovery tasks on several hypergraph data sets.

2.

A comprehensive guide to CAN IDS data and introduction of the ROAD dataset.

Verma, Miki E; Bridges, Robert A; Iannacone, Michael D; Hollifield, Samuel C; Moriano, Pablo; Hespeler, Steven C; Kay, Bill; Combs, Frank L.

PLoS One ; 19(1): e0296879, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38252659

RESUMO

Although ubiquitous in modern vehicles, Controller Area Networks (CANs) lack basic security properties and are easily exploitable. A rapidly growing field of CAN security research has emerged that seeks to detect intrusions or anomalies on CANs. Producing vehicular CAN data with a variety of intrusions is a difficult task for most researchers as it requires expensive assets and deep expertise. To illuminate this task, we introduce the first comprehensive guide to the existing open CAN intrusion detection system (IDS) datasets. We categorize attacks on CANs including fabrication (adding frames, e.g., flooding or targeting and ID), suspension (removing an ID's frames), and masquerade attacks (spoofed frames sent in lieu of suspended ones). We provide a quality analysis of each dataset; an enumeration of each datasets' attacks, benefits, and drawbacks; categorization as real vs. simulated CAN data and real vs. simulated attacks; whether the data is raw CAN data or signal-translated; number of vehicles/CANs; quantity in terms of time; and finally a suggested use case of each dataset. State-of-the-art public CAN IDS datasets are limited to real fabrication (simple message injection) attacks and simulated attacks often in synthetic data, lacking fidelity. In general, the physical effects of attacks on the vehicle are not verified in the available datasets. Only one dataset provides signal-translated data but is missing a corresponding "raw" binary version. This issue pigeon-holes CAN IDS research into testing on limited and often inappropriate data (usually with attacks that are too easily detectable to truly test the method). The scarcity of appropriate data has stymied comparability and reproducibility of results for researchers. As our primary contribution, we present the Real ORNL Automotive Dynamometer (ROAD) CAN IDS dataset, consisting of over 3.5 hours of one vehicle's CAN data. ROAD contains ambient data recorded during a diverse set of activities, and attacks of increasing stealth with multiple variants and instances of real (i.e. non-simulated) fuzzing, fabrication, unique advanced attacks, and simulated masquerade attacks. To facilitate a benchmark for CAN IDS methods that require signal-translated inputs, we also provide the signal time series format for many of the CAN captures. Our contributions aim to facilitate appropriate benchmarking and needed comparability in the CAN IDS research field.

Assuntos

Benchmarking , Terapia Implosiva , Animais , Reprodutibilidade dos Testes , Columbidae , Inundações

3.

Correction: Nielson et al. Similarity Downselection: Finding the n Most Dissimilar Molecular Conformers for Reference-Free Metabolomics. Metabolites 2023, 13, 105.

Nielson, Felicity F; Kay, Bill; Young, Stephen J; Colby, Sean M; Renslow, Ryan S; Metz, Thomas O.

Metabolites ; 13(11)2023 Nov 17.

Artigo em Inglês | MEDLINE | ID: mdl-37999262

RESUMO

There were missing figures and associated legends for Figure 3 and Figure 4 as published due to a publication error [...].

4.

Similarity Downselection: Finding the n Most Dissimilar Molecular Conformers for Reference-Free Metabolomics.

Nielson, Felicity F; Kay, Bill; Young, Stephen J; Colby, Sean M; Renslow, Ryan S; Metz, Thomas O.

Metabolites ; 13(1)2023 Jan 09.

Artigo em Inglês | MEDLINE | ID: mdl-36677030

RESUMO

Computational methods for creating in silico libraries of molecular descriptors (e.g., collision cross sections) are becoming increasingly prevalent due to the limited number of authentic reference materials available for traditional library building. These so-called "reference-free metabolomics" methods require sampling sets of molecular conformers in order to produce high accuracy property predictions. Due to the computational cost of the subsequent calculations for each conformer, there is a need to sample the most relevant subset and avoid repeating calculations on conformers that are nearly identical. The goal of this study is to introduce a heuristic method of finding the most dissimilar conformers from a larger population in order to help speed up reference-free calculation methods and maintain a high property prediction accuracy. Finding the set of the n items most dissimilar from each other out of a larger population becomes increasingly difficult and computationally expensive as either n or the population size grows large. Because there exists a pairwise relationship between each item and all other items in the population, finding the set of the n most dissimilar items is different than simply sorting an array of numbers. For instance, if you have a set of the most dissimilar n = 4 items, one or more of the items from n = 4 might not be in the set n = 5. An exact solution would have to search all possible combinations of size n in the population exhaustively. We present an open-source software called similarity downselection (SDS), written in Python and freely available on GitHub. SDS implements a heuristic algorithm for quickly finding the approximate set(s) of the n most dissimilar items. We benchmark SDS against a Monte Carlo method, which attempts to find the exact solution through repeated random sampling. We show that for SDS to find the set of n most dissimilar conformers, our method is not only orders of magnitude faster, but it is also more accurate than running Monte Carlo for 1,000,000 iterations, each searching for set sizes n = 3-7 out of a population of 50,000. We also benchmark SDS against the exact solution for example small populations, showing that SDS produces a solution close to the exact solution in these instances. Using theoretical approaches, we also demonstrate the constraints of the greedy algorithm and its efficacy as a ratio to the exact solution.

5.

Opportunities for neuromorphic computing algorithms and applications.

Schuman, Catherine D; Kulkarni, Shruti R; Parsa, Maryam; Mitchell, J Parker; Date, Prasanna; Kay, Bill.

Nat Comput Sci ; 2(1): 10-19, 2022 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-38177712

RESUMO

Neuromorphic computing technologies will be important for the future of computing, but much of the work in neuromorphic computing has focused on hardware development. Here, we review recent results in neuromorphic computing algorithms and applications. We highlight characteristics of neuromorphic computing technologies that make them attractive for the future of computing and we discuss opportunities for future development of algorithms and applications on these systems.

6.

Publisher Correction: Opportunities for neuromorphic computing algorithms and applications.

Schuman, Catherine D; Kulkarni, Shruti R; Parsa, Maryam; Mitchell, J Parker; Date, Prasanna; Kay, Bill.

Nat Comput Sci ; 2(3): 205, 2022 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38214649

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA