Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Publication year range
1.
Biodivers Data J ; (6): e26659, 2018.
Article in English | MEDLINE | ID: mdl-30393454

ABSTRACT

Scholarly publications of biodiversity literature contain a vast amount of information in human readable format. The detailed morphological descriptions in these publications contain rich information that can be extracted to facilitate analysis and computational biology research. However, the idiosyncrasies of morphological descriptions still pose a number of challenges to machines. In this work, we demonstrate the use of two different approaches to resolve meronym (i.e. part-of) relations between anatomical parts and their anchor organs, including a syntactic rule-based approach and a SVM-based (support vector machine) method. Both methods made use of domain ontologies. We compared the two approaches with two other baseline methods and the evaluation results show the syntactic methods (92.1% F1 score) outperformed the SVM methods (80.7% F1 score) and the part-of ontologies were valuable knowledge sources for the task. It is notable that the mistakes made by the two approaches rarely overlapped. Additional tests will be conducted on the development version of the Explorer of Taxon Concepts toolkit before we make the functionality publicly available. Meanwhile, we will further investigate and leverage the complementary nature of the two approaches to further drive down the error rate, as in practical application, even a 1% error rate could lead to hundreds of errors.

2.
BMC Bioinformatics ; 17(1): 471, 2016 Nov 17.
Article in English | MEDLINE | ID: mdl-27855645

ABSTRACT

BACKGROUND: Taxonomic descriptions are traditionally composed in natural language and published in a format that cannot be directly used by computers. The Exploring Taxon Concepts (ETC) project has been developing a set of web-based software tools that convert morphological descriptions published in telegraphic style to character data that can be reused and repurposed. This paper introduces the first semi-automated pipeline, to our knowledge, that converts morphological descriptions into taxon-character matrices to support systematics and evolutionary biology research. We then demonstrate and evaluate the use of the ETC Input Creation - Text Capture - Matrix Generation pipeline to generate body part measurement matrices from a set of 188 spider morphological descriptions and report the findings. RESULTS: From the given set of spider taxonomic publications, two versions of input (original and normalized) were generated and used by the ETC Text Capture and ETC Matrix Generation tools. The tools produced two corresponding spider body part measurement matrices, and the matrix from the normalized input was found to be much more similar to a gold standard matrix hand-curated by the scientist co-authors. Special conventions utilized in the original descriptions (e.g., the omission of measurement units) were attributed to the lower performance of using the original input. The results show that simple normalization of the description text greatly increased the quality of the machine-generated matrix and reduced edit effort. The machine-generated matrix also helped identify issues in the gold standard matrix. CONCLUSIONS: ETC Text Capture and ETC Matrix Generation are low-barrier and effective tools for extracting measurement values from spider taxonomic descriptions and are more effective when the descriptions are self-contained. Special conventions that make the description text less self-contained challenge automated extraction of data from biodiversity descriptions and hinder the automated reuse of the published knowledge. The tools will be updated to support new requirements revealed in this case study.


Subject(s)
Biological Evolution , Software , Spiders/anatomy & histology , Animals , Humans
3.
Zookeys ; (223): 1-38, 2012.
Article in English | MEDLINE | ID: mdl-23166458

ABSTRACT

The Neotropical evaniid genus Evaniscus Szépligeti currently includes six species. Two new species are described, Evaniscus lansdownei Mullins, sp. n. from Colombia and Brazil and Evaniscus rafaeli Kawada, sp. n. from Brazil. Evaniscus sulcigenis Roman, syn. n., is synonymized under Evaniscus rufithorax Enderlein. An identification key to species of Evaniscus is provided. Thirty-five parsimony informative morphological characters are analyzed for six ingroup and four outgroup taxa. A topology resulting in a monophyletic Evaniscus is presented with Evaniscus tibialis and Evaniscus rafaeli as sister to the remaining Evaniscus species. The Hymenoptera Anatomy Ontology and other relevant biomedical ontologies are employed to create semantic phenotype statements in Entity-Quality (EQ) format for species descriptions. This approach is an early effort to formalize species descriptions and to make descriptive data available to other domains.

4.
Ciênc. agrotec., (Impr.) ; 34(4): 860-869, July-Aug. 2010. tab
Article in Portuguese | LILACS | ID: lil-556973

ABSTRACT

Objetivou-se, no presente trabalho, através de correlação de Pearson e análise de trilha, identificar variáveis para caracterizar porta-enxertos ananizantes para a cultura da pereira (Pyrus communis L.). Neste experimento foram utilizadas 49 plantas de pereira, plantadas nos canteiros do Departamento de Fitotecnia da FAEM/UFPel. As plantas foram avaliadas na época do seu crescimento vegetativo, segundo parâmetros descritos em instruções do Ministério da Agricultura, Pecuária e Abastecimento. Pela correlação de Pearson, as variáveis VP, NRP, HCP e FCNPRCL se destacaram. Na análise de trilha, a variável número de lenticelas obteve maior efeito positivo sobre VP, NRP e FCNPRCL, a variável ramificação do ramo demonstrou-se com efeito positivo sobre HCP, sendo essas duas variáveis consideradas eficazes no processo de seleção de porta-enxerto juntamente com as variáveis básicas.


The objective of this work was to identify, through the Pearson correlation and path analysis, variables to characterize rootstocks suitable for the cultivation of the pear (Pyrus communis L.). In this experiment 49 pear specimens were used, planted in flowerbeds at the Phytotechny Department. The plants were evaluated at the time of growth, according to parameters described in the instructions of the Ministério da Agricultura, Pecuária e Abastecimento. According to the Pearson correlation, the variables VP, NRP, ETA and FCNPRCL stood out. In path analysis, the variable number of lenticels had a more positive effect on VP, NRP and FCNPRCL, variable branch of the industry had a more positive effect on ETA. Both variables were considered effective in the process of selection of rootstock together with the basic variables.

SELECTION OF CITATIONS
SEARCH DETAIL
...