Pesquisa | Portal Regional da BVS

Approximate Graph Edit Distance in Quadratic Time.

Riesen, Kaspar; Ferrer, Miquel; Bunke, Horst.

IEEE/ACM Trans Comput Biol Bioinform ; 17(2): 483-494, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-26390496

RESUMO

Graph edit distance is one of the most flexible and general graph matching models available. The major drawback of graph edit distance, however, is its computational complexity that restricts its applicability to graphs of rather small size. Recently, the authors of the present paper introduced a general approximation framework for the graph edit distance problem. The basic idea of this specific algorithm is to first compute an optimal assignment of independent local graph structures (including substitutions, deletions, and insertions of nodes and edges). This optimal assignment is complete and consistent with respect to the involved nodes of both graphs and can thus be used to instantly derive an admissible (yet suboptimal) solution for the original graph edit distance problem in O(n3) time. For large scale graphs or graph sets, however, the cubic time complexity may still be too high. Therefore, we propose to use suboptimal algorithms with quadratic rather than cubic time for solving the basic assignment problem. In particular, the present paper introduces five different greedy assignment algorithms in the context of graph edit distance approximation. In an experimental evaluation, we show that these methods have great potential for further speeding up the computation of graph edit distance while the approximated distances remain sufficiently accurate for graph based pattern classification.

Assuntos

Algoritmos , Biologia Computacional/métodos , Reconhecimento Automatizado de Padrão/métodos

A novel word spotting method based on recurrent neural networks.

Frinken, Volkmar; Fischer, Andreas; Manmatha, R; Bunke, Horst.

IEEE Trans Pattern Anal Mach Intell ; 34(2): 211-24, 2012 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-21646681

RESUMO

Keyword spotting refers to the process of retrieving all instances of a given keyword from a document. In the present paper, a novel keyword spotting method for handwritten documents is described. It is derived from a neural network-based system for unconstrained handwriting recognition. As such it performs template-free spotting, i.e., it is not necessary for a keyword to appear in the training set. The keyword spotting is done using a modification of the CTC Token Passing algorithm in conjunction with a recurrent neural network. We demonstrate that the proposed systems outperform not only a classical dynamic time warping-based approach but also a modern keyword spotting system, based on hidden Markov models. Furthermore, we analyze the performance of the underlying neural networks when using them in a recognition task followed by keyword spotting on the produced transcription. We point out the advantages of keyword spotting when compared to classic text line recognition.

Novel kernels for error-tolerant graph classification.

Neuhaus, Michel; Riesen, Kaspar; Bunke, Horst.

Spat Vis ; 22(5): 425-41, 2009.

Artigo em Inglês | MEDLINE | ID: mdl-19814905

RESUMO

One of the major difficulties in graph classification is the lack of mathematical structure in the space of graphs. The use of kernel machines allows us to overcome this fundamental limitation in an elegant manner by addressing the pattern recognition problem in an implicitly existing feature vector space instead of the original space of graphs. In this paper we propose three novel error-tolerant graph kernels -- a diffusion kernel, a convolution kernel, and a random walk kernel. The kernels are closely related to one of the most flexible graph matching methods, graph edit distance. Consequently, our kernels are applicable to virtually any kind of graph. They also show a high degree of robustness against various types of distortion. In an experimental evaluation involving the classification of line drawings, images, diatoms, fingerprints, and molecules, we demonstrate the superior performance of the proposed kernels in conjunction with support vector machines over a standard nearest-neighbor reference method and several other graph kernels including a standard random walk kernel.

Assuntos

Gráficos por Computador/classificação , Percepção de Distância/fisiologia , Modelos Teóricos , Reconhecimento Visual de Modelos/fisiologia , Percepção Espacial/fisiologia , Humanos

Graph classification by means of Lipschitz embedding.

Riesen, Kaspar; Bunke, Horst.

IEEE Trans Syst Man Cybern B Cybern ; 39(6): 1472-83, 2009 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-19447721

RESUMO

In pattern recognition and related fields, graph-based representations offer a versatile alternative to the widely used feature vectors. Therefore, an emerging trend of representing objects by graphs can be observed. This trend is intensified by the development of novel approaches in graph-based machine learning, such as graph kernels or graph-embedding techniques. These procedures overcome a major drawback of graphs, which consists of a serious lack of algorithms for classification. This paper is inspired by the idea of representing graphs through dissimilarities and extends our previous work to the more general setting of Lipschitz embeddings. In an experimental evaluation, we empirically confirm that classifiers that rely on the original graph distances can be outperformed by a classification system using the Lipschitz embedded graphs.

A novel connectionist system for unconstrained handwriting recognition.

Graves, Alex; Liwicki, Marcus; Fernández, Santiago; Bertolami, Roman; Bunke, Horst; Schmidhuber, Jürgen.

IEEE Trans Pattern Anal Mach Intell ; 31(5): 855-68, 2009 May.

Artigo em Inglês | MEDLINE | ID: mdl-19299860

RESUMO

Recognizing lines of unconstrained handwritten text is a challenging task. The difficulty of segmenting cursive or overlapping characters, combined with the need to exploit surrounding context, has led to low recognition rates for even the best current recognizers. Most recent progress in the field has been made either through improved preprocessing or through advances in language modeling. Relatively little work has been done on the basic recognition algorithms. Indeed, most systems rely on the same hidden Markov models that have been used for decades in speech and handwriting recognition, despite their well-known shortcomings. This paper proposes an alternative approach based on a novel type of recurrent neural network, specifically designed for sequence labeling tasks where the data is hard to segment and contains long-range bidirectional interdependencies. In experiments on two large unconstrained handwriting databases, our approach achieves word recognition accuracies of 79.7 percent on online data and 74.1 percent on offline data, significantly outperforming a state-of-the-art HMM-based system. In addition, we demonstrate the network's robustness to lexicon size, measure the individual influence of its hidden layers, and analyze its use of context. Last, we provide an in-depth discussion of the differences between the network and HMMs, suggesting reasons for the network's superior performance.

Assuntos

Algoritmos , Processamento Eletrônico de Dados/métodos , Escrita Manual , Interpretação de Imagem Assistida por Computador/métodos , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Aumento da Imagem/métodos , Modelos Estatísticos , Leitura , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Técnica de Subtração

Offline grammar-based recognition of handwritten sentences.

Zimmermann, Matthias; Chappelier, Jean-Cédric; Bunke, Horst.

IEEE Trans Pattern Anal Mach Intell ; 28(5): 818-21, 2006 May.

Artigo em Inglês | MEDLINE | ID: mdl-16640266

RESUMO

This paper proposes a sequential coupling of a Hidden Markov Model (HMM) recognizer for offline handwritten English sentences with a probabilistic bottom-up chart parser using Stochastic Context-Free Grammars (SCFG) extracted from a text corpus. Based on extensive experiments, we conclude that syntax analysis helps to improve recognition rates significantly.

Assuntos

Algoritmos , Inteligência Artificial , Processamento Eletrônico de Dados/métodos , Escrita Manual , Interpretação de Imagem Assistida por Computador/métodos , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Semântica , Aumento da Imagem/métodos , Sistemas On-Line

Guest editors' introduction to the special section on syntactic and structural pattern recognition.

Basu, Mitra; Bunke, Horst; Del Bimbo, Alberto.

IEEE Trans Pattern Anal Mach Intell ; 27(7): 1009-12, 2005 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-16013749

Assuntos

Algoritmos , Inteligência Artificial , Análise por Conglomerados , Biologia Computacional/métodos , Interpretação de Imagem Assistida por Computador/métodos , Processamento de Linguagem Natural , Reconhecimento Automatizado de Padrão/métodos , Simulação por Computador , Armazenamento e Recuperação da Informação/métodos , Modelos Estatísticos , Interface para o Reconhecimento da Fala

Self-organizing maps for learning the edit costs in graph matching.

Neuhaus, Michel; Bunke, Horst.

IEEE Trans Syst Man Cybern B Cybern ; 35(3): 503-14, 2005 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-15971918

RESUMO

Although graph matching and graph edit distance computation have become areas of intensive research recently, the automatic inference of the cost of edit operations has remained an open problem. In the present paper, we address the issue of learning graph edit distance cost functions for numerically labeled graphs from a corpus of sample graphs. We propose a system of self-organizing maps (SOMs) that represent the distance measuring spaces of node and edge labels. Our learning process is based on the concept of self-organization. It adapts the edit costs in such a way that the similarity of graphs from the same class is increased, whereas the similarity of graphs from different classes decreases. The learning procedure is demonstrated on two different applications involving line drawing graphs and graphs representing diatoms, respectively.

Assuntos

Algoritmos , Inteligência Artificial , Gráficos por Computador , Diatomáceas/citologia , Interpretação de Imagem Assistida por Computador/métodos , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Análise por Conglomerados , Simulação por Computador , Aumento da Imagem/métodos , Modelos Biológicos , Análise Numérica Assistida por Computador , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Processamento de Sinais Assistido por Computador , Técnica de Subtração

Offline recognition of unconstrained handwritten texts using HMMs and statistical language models.

Vinciarelli, Alessandro; Bengio, Samy; Bunke, Horst.

IEEE Trans Pattern Anal Mach Intell ; 26(6): 709-20, 2004 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-18579932

RESUMO

This paper presents a system for the offline recognition of large vocabulary unconstrained handwritten texts. The only assumption made about the data is that it is written in English. This allows the application of Statistical Language Models in order to improve the performance of our system. Several experiments have been performed using both single and multiple writer data. Lexica of variable size (from 10,000 to 50,000 words) have been used. The use of language models is shown to improve the accuracy of the system (when the lexicon contains 50,000 words, the error rate is reduced by approximately 50 percent for single writer data and by approximately 25 percent for multiple writer data). Our approach is described in detail and compared with other methods presented in the literature to deal with the same problem. An experimental setup to correctly deal with unconstrained text recognition is proposed.

Assuntos

Inteligência Artificial , Biometria/métodos , Processamento Eletrônico de Dados/métodos , Escrita Manual , Interpretação de Imagem Assistida por Computador/métodos , Armazenamento e Recuperação da Informação/métodos , Reconhecimento Automatizado de Padrão/métodos , Algoritmos , Gráficos por Computador , Documentação , Aumento da Imagem/métodos , Cadeias de Markov , Modelos Estatísticos , Análise Numérica Assistida por Computador , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Processamento de Sinais Assistido por Computador , Técnica de Subtração , Interface Usuário-Computador

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA