Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Bioinform Comput Biol ; 6(1): 125-62, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18324750

RESUMO

We model the recombination process of fungal systems via chromatid exchange in meiosis, which accounts for any type of bivalent configuration in a genetic interval in any specified order of genetic markers, for both random spore and tetrad data. First, a probability model framework is developed for two genes and then generalized for an arbitrary number of genes. Maximum likelihood estimators (MLEs) for both random and tetrad data are developed. It is shown that the MLE of recombination for tetrad data is uniformly more efficient over that from random spore data by a factor of at least 4 usually. The MLE for the generalized probability framework is computed using the expectation-maximization (EM) algorithm. Pearson's chi-squared statistic is computed as a measure of goodness of fit using a product-multinomial setup. We implement our model with genetic marker data on the whole genome of Neurospora crassa. Simulated annealing is used to search for the best order of genetic markers for each chromosome, and the goodness of fit value is evaluated for model assumptions. Inferred map orders are corroborated by genomic sequence, with the exception of linkage groups I, II, and V.


Assuntos
Mapeamento Cromossômico/métodos , Cromossomos Fúngicos/genética , Marcadores Genéticos/genética , Meiose/genética , Modelos Genéticos , Neurospora crassa/genética , Recombinação Genética/genética , Simulação por Computador , Funções Verossimilhança , Modelos Estatísticos
2.
J Bioinform Comput Biol ; 5(2a): 201-50, 2007 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-17589960

RESUMO

A multi-locus likelihood of a genetic map is computed based on a mathematical model of chromatid exchange in meiosis that accounts for any type of bivalent configuration in a genetic interval in any specified order of genetic markers. The computational problem is to calculate the likelihood (L) and maximize L by choosing an ordering of genetic markers on the map and the recombination distances between markers. This maximum likelihood estimate (MLE) could be found either with a straightforward algorithm or with the proposed recursive linking algorithm that implements the likelihood computation process involving an iterative procedure is called Expectation Maximization (EM). The time complexity of the straightforward algorithm is exponential without bound in the number of genetic markers, and implementation of the model with a straightforward algorithm for more than seven genetic markers is not feasible, thus motivating the critical importance of the proposed recursive linking algorithm. The recursive linking algorithm decomposes the pool of genetic markers into segments and renders the model implementable for hundreds of genetic markers. The recursive algorithm is shown to reduce the order of time complexity from exponential to linear in the number of markers. The improvement in time complexity is shown theoretically by a worst-case analysis of the algorithm and supported by run time results using data on linkage group-II of the fungal genome Neurospora crassa.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Ligação Genética/genética , Marcadores Genéticos/genética , Recombinação Genética/genética , Design de Software , Software , Simulação por Computador , Modelos Genéticos , Modelos Estatísticos , Análise de Sequência de DNA/métodos
3.
Artigo em Inglês | MEDLINE | ID: mdl-17369637

RESUMO

Assuming no interference, a multi-locus genetic likelihood is implemented based on a mathematical model of the recombination process in meiosis that accounts for events up to double crossovers in the genetic interval for any specified order of genetic markers. The mathematical model is realized with a straightforward algorithm that implements the likelihood computation process. The time complexity of the straightforward algorithm is exponential without bound in the number of genetic markers and implementation of the model for more than 7 genetic markers is not feasible, motivating the need for a novel algorithm. A recursive linking algorithm is proposed that decomposes the pool of genetic markers into segments and renders the model implementable for a large number of genetic markers. The recursive algorithm is shown to reduce the order of time complexity from exponential to linear. The improvement in time complexity is shown theoretically by a worst-case analysis of the algorithm and supported by run time results using data on linkage group-I of the fungal genome Neurospora crassa.


Assuntos
Biologia Computacional/métodos , Ligação Genética , Marcadores Genéticos , Algoritmos , Genoma Fúngico , Funções Verossimilhança , Modelos Genéticos , Modelos Estatísticos , Modelos Teóricos , Neurospora crassa/genética , Probabilidade , Recombinação Genética , Fatores de Tempo
4.
Bioinformatics ; 17(3): 205-13, 2001 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-11294786

RESUMO

MOTIVATION: Contig maps are a type of physical map that show the native order of a set of overlapping genomic clones. Overlaps between clones can be detected by finding common sequences using a number of experimental protocols including hybridization of probes. All current mapping algorithms of which we are aware require that hybridizations be scored using a fixed number of discrete values (typically 0/1 or high/medium/low). When hybridization data is captured automatically using digital equipment, this provides the opportunity for hybridization intensities to be used in map construction. More fine-grained distinctions in the levels of hybridization may be exploited by algorithms to generate more accurate physical maps. RESULTS: We describe an approach to creating contig maps that uses measured hybridization intensities instead of data scored with a fixed number of discrete values. We describe and compare four algorithms for creating physical maps with hybridization intensities. Simulations using measured intensities sampled from actual data on Aspergillus nidulans indicate that using hybridization intensities rather than data that is automatically scored with respect to threshold values may yield more accurate physical maps.


Assuntos
Algoritmos , Mapeamento de Sequências Contíguas , Hibridização de Ácido Nucleico , Aspergillus nidulans/genética , Automação , DNA Fúngico/análise , Processamento Eletrônico de Dados , Software
5.
Genetics ; 157(3): 1021-43, 2001 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-11238392

RESUMO

Reconstructing a physical map of a chromosome from a genomic library presents a central computational problem in genetics. Physical map reconstruction in the presence of errors is a problem of high computational complexity that provides the motivation for parallel computing. Parallelization strategies for a maximum-likelihood estimation-based approach to physical map reconstruction are presented. The estimation procedure entails a gradient descent search for determining the optimal spacings between probes for a given probe ordering. The optimal probe ordering is determined using a stochastic optimization algorithm such as simulated annealing or microcanonical annealing. A two-level parallelization strategy is proposed wherein the gradient descent search is parallelized at the lower level and the stochastic optimization algorithm is simultaneously parallelized at the higher level. Implementation and experimental results on a distributed-memory multiprocessor cluster running the parallel virtual machine (PVM) environment are presented using simulated and real hybridization data.


Assuntos
Mapeamento Físico do Cromossomo/métodos , Algoritmos , Simulação por Computador , Genoma Fúngico , Funções Verossimilhança , Modelos Estatísticos , Modelos Teóricos , Neurospora crassa/genética , Hibridização de Ácido Nucleico , Software , Fatores de Tempo
6.
Genetics ; 157(3): 1045-56, 2001 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-11238393

RESUMO

A contig map is a physical map that shows the native order of a library of overlapping genomic clones. One common method for creating such maps involves using hybridization to detect clone overlaps. False- positive and false-negative hybridization errors, the presence of chimeric clones, and gaps in library coverage lead to ambiguity and error in the clone order. Genomes with good genetic maps, such as Neurospora crassa, provide a means for reducing ambiguities and errors when constructing contig maps if clones can be anchored with genetic markers to the genetic map. A software application called ODS2 for creating contig maps based on clone-clone hybridization data is presented. This application is also designed to exploit partial ordering information provided by anchorage of clones to a genetic map. This information, along with clone-clone hybridization data, is used by a clone ordering algorithm and is represented graphically, allowing users to interactively align physical and genetic maps. ODS2 has a graphical user interface and is implemented entirely in Java, so it runs on multiple platforms. Other features include the flexibility of storing data in a local file or relational database and the ability to create full or minimum tiling contig maps.


Assuntos
Mapeamento Cromossômico/métodos , Mapeamento Físico do Cromossomo/métodos , Software , Algoritmos , Clonagem Molecular , Simulação por Computador , Mapeamento de Sequências Contíguas , Cosmídeos , Biblioteca Gênica , Ligação Genética , Neurospora crassa/genética , Hibridização de Ácido Nucleico
7.
Comput Appl Biosci ; 12(4): 269-80, 1996 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-8902353

RESUMO

A suite of parallel algorithms for ordering DNA sequences (termed PARODS) is presented. The algorithms in PARODS are based on an earlier serial algorithm, ODS, which is a physical mapping algorithm based on simulated annealing. Parallel algorithms for simulated annealing based on Markov chain decomposition are proposed and applied to the problem of physical mapping. Perturbation methods and problem-specific annealing heuristics are proposed and described. Implementations of parallel Single Instruction Multiple Data (SIMD) algorithms on a 2048 processor MasPar MP-2 system and implementations of parallel Multiple Instruction Multiple Data (MIMD) algorithms on an 8 processor Intel iPSC/860 system are presented. The convergence, speedup and scalability characteristics of the aforementioned algorithms are analyzed and discussed. The best SIMD algorithm is shown to have a speedup of approximately 1000 on the 2048 processor MasPar MP-2 system, whereas the best MIMD algorithm is shown to have a speedup of approximately 5 on the 8 processor Intel iPSC/860 system.


Assuntos
Algoritmos , DNA/genética , Análise de Sequência de DNA/métodos , Aspergillus nidulans/genética , Simulação por Computador , Computadores , DNA Fúngico/genética , Humanos , Cadeias de Markov , Hibridização de Ácido Nucleico , Análise de Sequência de DNA/estatística & dados numéricos
8.
Pac Symp Biocomput ; : 85-96, 1996.
Artigo em Inglês | MEDLINE | ID: mdl-9390225

RESUMO

Ordering clones from a genomic library into physical maps of whole chromosomes presents a central computational problem in genetics. Chromosome reconstruction via clone ordering is shown to be isomorphic to the NP-complete Optimal Linear Ordering problem. Massively parallel algorithms for simulated annealing based on Markov chain distribution are proposed and applied to this problem. Perturbation methods and problem-specific annealing heuristics are proposed and described. Experimental results on a 2048 processor MasPar MP-2 system are presented. Convergence, speedup and scalability characteristics of the various algorithms are analyzed and discussed.


Assuntos
Algoritmos , Cromossomos/química , Simulação por Computador , DNA/química , Biblioteca Genômica , Software , Clonagem Molecular , Técnicas Genéticas , Cadeias de Markov
9.
J Comput Biol ; 3(4): 503-28, 1996.
Artigo em Inglês | MEDLINE | ID: mdl-9018601

RESUMO

Ordering clones from a genomic library into physical maps of whole chromosomes presents a central computational problem in genetics. Chromosome reconstruction via clone ordering is usually isomorphic to the NP-complete Optimal Linear Arrangement problem. Parallel SIMD and MIMD algorithms for simulated annealing based on Markov chain distribution are proposed and applied to the problem of chromosome reconstruction via clone ordering. Perturbation methods and problem-specific annealing heuristics are proposed and described. The SIMD algorithms are implemented on a 2048 processor MasPar MP-2 system which is an SIMD 2-D toroidal mesh architecture whereas the MIMD algorithms are implemented on an 8 processor Intel iPSC/860 which is an MIMD hypercube architecture. A comparative analysis of the various SIMD and MIMD algorithms is presented in which the convergence, speedup, and scalability characteristics of the various algorithms are analyzed and discussed. On a fine-grained, massively parallel SIMD architecture with a low synchronization overhead such as the MasPar MP-2, a parallel simulated annealing algorithm based on multiple periodically interacting searches performs the best. For a coarse-grained MIMD architecture with high synchronization overhead such as the Intel iPSC/860, a parallel simulated annealing algorithm based on multiple independent searches yields the best results. In either case, distribution of clonal data across multiple processors is shown to exacerbate the tendency of the parallel simulated annealing algorithm to get trapped in a local optimum.


Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Simulação por Computador , Cadeias de Markov
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...