Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters










Database
Language
Publication year range
1.
PLoS Biol ; 22(1): e3002458, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38206957

ABSTRACT

iBiology Courses provide trainees with just-in-time learning resources to become effective researchers. These courses can help scientists build core research skills, plan their research projects and careers, and learn from scientists with diverse backgrounds.

2.
Plant Direct ; 5(4): e00316, 2021 Apr.
Article in English | MEDLINE | ID: mdl-33870032

ABSTRACT

Population growth and climate change will impact food security and potentially exacerbate the environmental toll that agriculture has taken on our planet. These existential concerns demand that a passionate, interdisciplinary, and diverse community of plant science professionals is trained during the 21st century. Furthermore, societal trends that question the importance of science and expert knowledge highlight the need to better communicate the value of rigorous fundamental scientific exploration. Engaging students and the general public in the wonder of plants, and science in general, requires renewed efforts that take advantage of advances in technology and new models of funding and knowledge dissemination. In November 2018, funded by the National Science Foundation through the Arabidopsis Research and Training for the 21st century (ART 21) research coordination network, a symposium and workshop were held that included a diverse panel of students, scientists, educators, and administrators from across the US. The purpose of the workshop was to re-envision how outreach programs are funded, evaluated, acknowledged, and shared within the plant science community. One key objective was to generate a roadmap for future efforts. We hope that this document will serve as such, by providing a comprehensive resource for students and young faculty interested in developing effective outreach programs. We also anticipate that this document will guide the formation of community partnerships to scale up currently successful outreach programs, and lead to the design of future programs that effectively engage with a more diverse student body and citizenry.

4.
CBE Life Sci Educ ; 17(1)2018.
Article in English | MEDLINE | ID: mdl-29449270

ABSTRACT

The Graduate Student Internships for Career Exploration (GSICE) program at the University of California, San Francisco (UCSF), offers structured training and hands-on experience through internships for a broad range of PhD-level careers. The GSICE program model was successfully replicated at the University of California, Davis (UC Davis). Here, we present outcome data for a total of 217 PhD students participating in the UCSF and UC Davis programs from 2010 to 2015 and 2014 to 2015, respectively. The internship programs at the two sites demonstrated comparable participation, internship completion rates, and overall outcomes. Using survey, focus group, and individual interview data, we find that the programs provide students with career development skills, while increasing students' confidence in career exploration and decision making. Internships, in particular, were perceived by students to increase their ability to discern a career area of choice and to increase confidence in pursuing that career. We present data showing that program participation does not change median time to degree and may help some trainees avoid "default postdocs." Our findings suggest important strategies for institutions developing internship programs for PhD students, namely: including a structured training component, allowing postgraduation internships, and providing a central organization point for internship programs.


Subject(s)
Biological Science Disciplines/education , Career Choice , Decision Making , Education, Graduate , Internship and Residency , Students , Cognition , Curriculum , Faculty , Feedback , Humans , Peer Group , Research Personnel , Surveys and Questionnaires , Universities
5.
Nucleic Acids Res ; 42(Database issue): D521-30, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24271399

ABSTRACT

The Structure-Function Linkage Database (SFLD, http://sfld.rbvi.ucsf.edu/) is a manually curated classification resource describing structure-function relationships for functionally diverse enzyme superfamilies. Members of such superfamilies are diverse in their overall reactions yet share a common ancestor and some conserved active site features associated with conserved functional attributes such as a partial reaction. Thus, despite their different functions, members of these superfamilies 'look alike', making them easy to misannotate. To address this complexity and enable rational transfer of functional features to unknowns only for those members for which we have sufficient functional information, we subdivide superfamily members into subgroups using sequence information, and lastly into families, sets of enzymes known to catalyze the same reaction using the same mechanistic strategy. Browsing and searching options in the SFLD provide access to all of these levels. The SFLD offers manually curated as well as automatically classified superfamily sets, both accompanied by search and download options for all hierarchical levels. Additional information includes multiple sequence alignments, tab-separated files of functional and other attributes, and sequence similarity networks. The latter provide a new and intuitively powerful way to visualize functional trends mapped to the context of sequence similarity.


Subject(s)
Databases, Protein , Enzymes/chemistry , Enzymes/classification , Enzymes/metabolism , Internet , Molecular Sequence Annotation , Sequence Alignment , Structure-Activity Relationship
6.
PLoS Comput Biol ; 9(5): e1003063, 2013.
Article in English | MEDLINE | ID: mdl-23737737

ABSTRACT

The ongoing functional annotation of proteins relies upon the work of curators to capture experimental findings from scientific literature and apply them to protein sequence and structure data. However, with the increasing use of high-throughput experimental assays, a small number of experimental studies dominate the functional protein annotations collected in databases. Here, we investigate just how prevalent is the "few articles - many proteins" phenomenon. We examine the experimentally validated annotation of proteins provided by several groups in the GO Consortium, and show that the distribution of proteins per published study is exponential, with 0.14% of articles providing the source of annotations for 25% of the proteins in the UniProt-GOA compilation. Since each of the dominant articles describes the use of an assay that can find only one function or a small group of functions, this leads to substantial biases in what we know about the function of many proteins. Mass-spectrometry, microscopy and RNAi experiments dominate high throughput experiments. Consequently, the functional information derived from these experiments is mostly of the subcellular location of proteins, and of the participation of proteins in embryonic developmental pathways. For some organisms, the information provided by different studies overlap by a large amount. We also show that the information provided by high throughput experiments is less specific than those provided by low throughput experiments. Given the experimental techniques available, certain biases in protein function annotation due to high-throughput experiments are unavoidable. Knowing that these biases exist and understanding their characteristics and extent is important for database curators, developers of function annotation programs, and anyone who uses protein function annotation data to plan experiments.


Subject(s)
Computational Biology/methods , Databases, Protein , Molecular Sequence Annotation/methods , Proteins/classification , Animals , High-Throughput Screening Assays , Humans , Proteins/chemistry , Proteins/metabolism
7.
Nat Methods ; 10(3): 221-7, 2013 Mar.
Article in English | MEDLINE | ID: mdl-23353650

ABSTRACT

Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be high. Here we report the results from the first large-scale community-based critical assessment of protein function annotation (CAFA) experiment. Fifty-four methods representing the state of the art for protein function prediction were evaluated on a target set of 866 proteins from 11 organisms. Two findings stand out: (i) today's best protein function prediction algorithms substantially outperform widely used first-generation methods, with large gains on all types of targets; and (ii) although the top methods perform well enough to guide experiments, there is considerable need for improvement of currently available tools.


Subject(s)
Computational Biology/methods , Molecular Biology/methods , Molecular Sequence Annotation , Proteins/physiology , Algorithms , Animals , Databases, Protein , Exoribonucleases/classification , Exoribonucleases/genetics , Exoribonucleases/physiology , Forecasting , Humans , Proteins/chemistry , Proteins/classification , Proteins/genetics , Species Specificity
8.
PLoS Comput Biol ; 5(12): e1000605, 2009 Dec.
Article in English | MEDLINE | ID: mdl-20011109

ABSTRACT

Due to the rapid release of new data from genome sequencing projects, the majority of protein sequences in public databases have not been experimentally characterized; rather, sequences are annotated using computational analysis. The level of misannotation and the types of misannotation in large public databases are currently unknown and have not been analyzed in depth. We have investigated the misannotation levels for molecular function in four public protein sequence databases (UniProtKB/Swiss-Prot, GenBank NR, UniProtKB/TrEMBL, and KEGG) for a model set of 37 enzyme families for which extensive experimental information is available. The manually curated database Swiss-Prot shows the lowest annotation error levels (close to 0% for most families); the two other protein sequence databases (GenBank NR and TrEMBL) and the protein sequences in the KEGG pathways database exhibit similar and surprisingly high levels of misannotation that average 5%-63% across the six superfamilies studied. For 10 of the 37 families examined, the level of misannotation in one or more of these databases is >80%. Examination of the NR database over time shows that misannotation has increased from 1993 to 2005. The types of misannotation that were found fall into several categories, most associated with "overprediction" of molecular function. These results suggest that misannotation in enzyme superfamilies containing multiple families that catalyze different reactions is a larger problem than has been recognized. Strategies are suggested for addressing some of the systematic problems contributing to these high levels of misannotation.


Subject(s)
Databases, Protein , Biocatalysis , Database Management Systems
9.
Biochemistry ; 48(7): 1445-53, 2009 Feb 24.
Article in English | MEDLINE | ID: mdl-19220063

ABSTRACT

The mechanistically diverse enolase superfamily is a paradigm for elucidating Nature's strategies for divergent evolution of enzyme function. Each of the different reactions catalyzed by members of the superfamily is initiated by abstraction of the alpha-proton of a carboxylate substrate that is coordinated to an essential Mg(2+). The muconate lactonizing enzyme (MLE) from Pseudomonas putida, a member of a family that catalyzes the syn-cycloisomerization of cis,cis-muconate to (4S)-muconolactone in the beta-ketoadipate pathway, has provided critical insights into the structural bases for evolution of function within the superfamily. A second, divergent family of homologous MLEs that catalyzes anti-cycloisomerization has been identified. Structures of members of both families liganded with the common (4S)-muconolactone product (syn, Pseudomonas fluorescens, gi 70731221 ; anti, Mycobacterium smegmatis, gi 118470554 ) document that the conserved Lys at the end of the second beta-strand in the (beta/alpha)(7)beta-barrel domain serves as the acid catalyst in both reactions. The different stereochemical courses (syn and anti) result from different structural strategies for determining substrate specificity: although the distal carboxylate group of the cis,cis-muconate substrate attacks the same face of the proximal double bond, opposite faces of the resulting enolate anion intermediate are presented to the conserved Lys acid catalyst. The discovery of two families of homologous, but stereochemically distinct, MLEs likely provides an example of "pseudoconvergent" evolution of the same function from different homologous progenitors within the enolase superfamily, in which different spatial arrangements of active site functional groups and substrate specificity determinants support catalysis of the same reaction.


Subject(s)
Evolution, Molecular , Intramolecular Lyases/metabolism , Phosphopyruvate Hydratase/metabolism , Biocatalysis , Cloning, Molecular , Crystallography, X-Ray , Intramolecular Lyases/chemistry , Intramolecular Lyases/genetics , Models, Molecular , Mycobacterium smegmatis/enzymology , Phosphopyruvate Hydratase/chemistry , Phosphopyruvate Hydratase/genetics , Phylogeny , Protein Conformation , Pseudomonas fluorescens/enzymology , Pseudomonas putida/enzymology , Stereoisomerism , Substrate Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...