Search | VHL Regional Portal

The flora phenotype ontology (FLOPO): tool for integrating morphological traits and phenotypes of vascular plants.

Hoehndorf, Robert; Alshahrani, Mona; Gkoutos, Georgios V; Gosline, George; Groom, Quentin; Hamann, Thomas; Kattge, Jens; de Oliveira, Sylvia Mota; Schmidt, Marco; Sierra, Soraya; Smets, Erik; Vos, Rutger A; Weiland, Claus.

J Biomed Semantics ; 7(1): 65, 2016 11 14.

Article in English | MEDLINE | ID: mdl-27842607

ABSTRACT

BACKGROUND: The systematic analysis of a large number of comparable plant trait data can support investigations into phylogenetics and ecological adaptation, with broad applications in evolutionary biology, agriculture, conservation, and the functioning of ecosystems. Floras, i.e., books collecting the information on all known plant species found within a region, are a potentially rich source of such plant trait data. Floras describe plant traits with a focus on morphology and other traits relevant for species identification in addition to other characteristics of plant species, such as ecological affinities, distribution, economic value, health applications, traditional uses, and so on. However, a key limitation in systematically analyzing information in Floras is the lack of a standardized vocabulary for the described traits as well as the difficulties in extracting structured information from free text. RESULTS: We have developed the Flora Phenotype Ontology (FLOPO), an ontology for describing traits of plant species found in Floras. We used the Plant Ontology (PO) and the Phenotype And Trait Ontology (PATO) to extract entity-quality relationships from digitized taxon descriptions in Floras, and used a formal ontological approach based on phenotype description patterns and automated reasoning to generate the FLOPO. The resulting ontology consists of 25,407 classes and is based on the PO and PATO. The classified ontology closely follows the structure of Plant Ontology in that the primary axis of classification is the observed plant anatomical structure, and more specific traits are then classified based on parthood and subclass relations between anatomical structures as well as subclass relations between phenotypic qualities. CONCLUSIONS: The FLOPO is primarily intended as a framework based on which plant traits can be integrated computationally across all species and higher taxa of flowering plants. Importantly, it is not intended to replace established vocabularies or ontologies, but rather serve as an overarching framework based on which different application- and domain-specific ontologies, thesauri and vocabularies of phenotypes observed in flowering plants can be integrated.

Subject(s)

Biological Ontologies , Phenotype , Plants/anatomy & histology , Plants/genetics

Integrating and visualizing primary data from prospective and legacy taxonomic literature.

Miller, Jeremy A; Agosti, Donat; Penev, Lyubomir; Sautter, Guido; Georgiev, Teodor; Catapano, Terry; Patterson, David; King, David; Pereira, Serrano; Vos, Rutger Aldo; Sierra, Soraya.

Biodivers Data J ; (3): e5063, 2015.

Article in English | MEDLINE | ID: mdl-26023286

ABSTRACT

Specimen data in taxonomic literature are among the highest quality primary biodiversity data. Innovative cybertaxonomic journals are using workflows that maintain data structure and disseminate electronic content to aggregators and other users; such structure is lost in traditional taxonomic publishing. Legacy taxonomic literature is a vast repository of knowledge about biodiversity. Currently, access to that resource is cumbersome, especially for non-specialist data consumers. Markup is a mechanism that makes this content more accessible, and is especially suited to machine analysis. Fine-grained XML (Extensible Markup Language) markup was applied to all (37) open-access articles published in the journal Zootaxa containing treatments on spiders (Order: Araneae). The markup approach was optimized to extract primary specimen data from legacy publications. These data were combined with data from articles containing treatments on spiders published in Biodiversity Data Journal where XML structure is part of the routine publication process. A series of charts was developed to visualize the content of specimen data in XML-tagged taxonomic treatments, either singly or in aggregate. The data can be filtered by several fields (including journal, taxon, institutional collection, collecting country, collector, author, article and treatment) to query particular aspects of the data. We demonstrate here that XML markup using GoldenGATE can address the challenge presented by unstructured legacy data, can extract structured primary biodiversity data which can be aggregated with and jointly queried with data from other Darwin Core-compatible sources, and show how visualization of these data can communicate key information contained in biodiversity literature. We complement recent studies on aspects of biodiversity knowledge using XML structured data to explore 1) the time lag between species discovry and description, and 2) the prevelence of rarity in species descriptions.

Enriched biodiversity data as a resource and service.

Vos, Rutger Aldo; Biserkov, Jordan Valkov; Balech, Bachir; Beard, Niall; Blissett, Matthew; Brenninkmeijer, Christian; van Dooren, Tom; Eades, David; Gosline, George; Groom, Quentin John; Hamann, Thomas D; Hettling, Hannes; Hoehndorf, Robert; Holleman, Ayco; Hovenkamp, Peter; Kelbert, Patricia; King, David; Kirkup, Don; Lammers, Youri; DeMeulemeester, Thibaut; Mietchen, Daniel; Miller, Jeremy A; Mounce, Ross; Nicolson, Nicola; Page, Rod; Pawlik, Aleksandra; Pereira, Serrano; Penev, Lyubomir; Richards, Kevin; Sautter, Guido; Shorthouse, David Peter; Tähtinen, Marko; Weiland, Claus; Williams, Alan R; Sierra, Soraya.

Biodivers Data J ; (2): e1125, 2014.

Article in English | MEDLINE | ID: mdl-25057255

ABSTRACT

BACKGROUND: Recent years have seen a surge in projects that produce large volumes of structured, machine-readable biodiversity data. To make these data amenable to processing by generic, open source "data enrichment" workflows, they are increasingly being represented in a variety of standards-compliant interchange formats. Here, we report on an initiative in which software developers and taxonomists came together to address the challenges and highlight the opportunities in the enrichment of such biodiversity data by engaging in intensive, collaborative software development: The Biodiversity Data Enrichment Hackathon. RESULTS: The hackathon brought together 37 participants (including developers and taxonomists, i.e. scientific professionals that gather, identify, name and classify species) from 10 countries: Belgium, Bulgaria, Canada, Finland, Germany, Italy, the Netherlands, New Zealand, the UK, and the US. The participants brought expertise in processing structured data, text mining, development of ontologies, digital identification keys, geographic information systems, niche modeling, natural language processing, provenance annotation, semantic integration, taxonomic name resolution, web service interfaces, workflow tools and visualisation. Most use cases and exemplar data were provided by taxonomists. One goal of the meeting was to facilitate re-use and enhancement of biodiversity knowledge by a broad range of stakeholders, such as taxonomists, systematists, ecologists, niche modelers, informaticians and ontologists. The suggested use cases resulted in nine breakout groups addressing three main themes: i) mobilising heritage biodiversity knowledge; ii) formalising and linking concepts; and iii) addressing interoperability between service platforms. Another goal was to further foster a community of experts in biodiversity informatics and to build human links between research projects and institutions, in response to recent calls to further such integration in this research domain. CONCLUSIONS: Beyond deriving prototype solutions for each use case, areas of inadequacy were discussed and are being pursued further. It was striking how many possible applications for biodiversity data there were and how quickly solutions could be put together when the normal constraints to collaboration were broken down for a week. Conversely, mobilising biodiversity knowledge from their silos in heritage literature and natural history collections will continue to require formalisation of the concepts (and the links between them) that define the research domain, as well as increased interoperability between the software platforms that operate on these concepts.

Molecular phylogeny of Macaranga, Mallotus, and related genera (Euphorbiaceae s.s.): insights from plastid and nuclear DNA sequence data.

Kulju, Kristo K M; Sierra, Soraya E C; Draisma, Stefano G A; Samuel, Rosabelle; Welzen, Peter C van.

Am J Bot ; 94(10): 1726-43, 2007 Oct.

Article in English | MEDLINE | ID: mdl-21636369

ABSTRACT

Macaranga and Mallotus (Euphorbiaceae s.s.) are two closely related, large paleo(sub)tropical genera. To investigate the phylogenetic relationships between and within them and to determine the position of related genera belonging to the subtribe Rottlerinae, we sequenced one plastid (trnL-F) and three nuclear (ITS, ncpGS, phyC) markers for species representative of these genera. The analyses demonstrated the monophyly of Macaranga and the paraphyly of Mallotus and revealed three highly supported main clades. The genera Cordemoya and Deuteromallotus and the Mallotus sections Hancea and Oliganthae form a basal Cordemoya s.l. clade. The two other clades, the Macaranga clade and the Mallotus s.s. clade (the latter with Coccoceras, Neotrewia, Octospermum, and Trewia), are sister groups. In the Macaranga clade, two basal lineages (comprising mostly sect. Pseudorottlera) and a crown group with three geographically homogenous main clades were identified. The phylogeny of the Mallotus s.s. clade is less clear because of internal conflict in all four data sets. Many of the sections and informal infrageneric groups of Macaranga and Mallotus do not appear to be monophyletic. In both the Macaranga and Mallotus s.s. clades, the African and/or Madagascan taxa are nested in Asian clades, suggesting migrations or dispersals from Asia to Africa and Madagascar.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL