Search | VHL Regional Portal

1.

A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images.

Shivakumara, Palaiahnakote; Banerjee, Ayan; Pal, Umapada; Nandanwar, Lokesh; Lu, Tong; Liu, Cheng-Lin.

IEEE Trans Image Process ; 32: 3552-3566, 2023.

Article in English | MEDLINE | ID: mdl-37342944

ABSTRACT

Due to the adverse effect of quality caused by different social media and arbitrary languages in natural scenes, detecting text from social media images and transferring its style is challenging. This paper presents a novel end-to-end model for text detection and text style transfer in social media images. The key notion of the proposed work is to find dominant information, such as fine details in the degraded images (social media images), and then restore the structure of character information. Therefore, we first introduce a novel idea of extracting gradients from the frequency domain of the input image to reduce the adverse effect of different social media, which outputs text candidate points. The text candidates are further connected into components and used for text detection via a UNet++ like network with an EfficientNet backbone (EffiUNet++). Then, to deal with the style transfer issue, we devise a generative model, which comprises a target encoder and style parameter networks (TESP-Net) to generate the target characters by leveraging the recognition results from the first stage. Specifically, a series of residual mapping and a position attention module are devised to improve the shape and structure of generated characters. The whole model is trained end-to-end so as to optimize the performance. Experiments on our social media dataset, benchmark datasets of natural scene text detection and text style transfer show that the proposed model outperforms the existing text detection and style transfer methods in multilingual and cross-language scenario.

Subject(s)

Social Media , Humans , Language , Benchmarking

2.

Drug discovery through Covid-19 genome sequencing with siamese graph convolutional neural network.

Pati, Soumen Kumar; Gupta, Manan Kumar; Banerjee, Ayan; Shai, Rinita; Shivakumara, Palaiahnakote.

Multimed Tools Appl ; : 1-35, 2023 May 10.

Article in English | MEDLINE | ID: mdl-37362739

ABSTRACT

After several waves of COVID-19 led to a massive loss of human life worldwide due to the changes in its variants and the vast explosion. Several researchers proposed neural network-based drug discovery techniques to fight against the pandemic; utilizing neural networks has limitations (Exponential time complexity, Non-Convergence, Mode Collapse, and Diminished Gradient). To overcome those difficulties, this paper proposed a hybrid architecture that will help to repurpose the most appropriate medicines for the treatment of COVID-19. A brief investigation of the sequences has been made to discover the gene density and noncoding proportion through the next gene sequencing. The paper tracks the exceptional locales in the virus DNA sequence as a Drug Target Region (DTR). Then the variable DNA neighborhood search is applied to this DTR to obtain the DNA interaction network to show how the genes are correlated. A drug database has been obtained based on the ontological property of the genomes with advanced D3Similarity so that all the chemical components of the drug database have been identified. Other methods obtained hydroxychloroquine as an effective drug which was rejected by WHO. However, The experimental results show that Remdesivir and Dexamethasone are the most effective drugs, with 97.41 and 97.93%, respectively.

3.

A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection.

Nandanwar, Lokesh; Shivakumara, Palaiahnakote; Jalab, Hamid A; Ibrahim, Rabha W; Raghavendra, Ramachandra; Pal, Umapada; Lu, Tong; Blumenstein, Michael.

IEEE Trans Neural Netw Learn Syst ; PP2022 Sep 21.

Article in English | MEDLINE | ID: mdl-36129871

ABSTRACT

Detecting forged handwriting is important in a wide variety of machine learning applications, and it is challenging when the input images are degraded with noise and blur. This article presents a new model based on conformable moments (CMs) and deep ensemble neural networks (DENNs) for forged handwriting detection in noisy and blurry environments. Since CMs involve fractional calculus with the ability to model nonlinearities and geometrical moments as well as preserving spatial relationships between pixels, fine details in images are preserved. This motivates us to introduce a DENN classifier, which integrates stenographic kernels and spatial features to classify input images as normal (original, clean images), altered (handwriting changed through copy-paste and insertion operations), noisy (added noise to original image), blurred (added blur to original image), altered-noise (noise is added to the altered image), and altered-blurred (blur is added to the altered image). To evaluate our model, we use a newly introduced dataset, which comprises handwritten words altered at the character level, as well as several standard datasets, namely ACPR 2019, ICPR 2018-FDC, and the IMEI dataset. The first two of these datasets include handwriting samples that are altered at the character and word levels, and the third dataset comprises forged International Mobile Equipment Identity (IMEI) numbers. Experimental results demonstrate that the proposed method outperforms the existing methods in terms of classification rate.

4.

A new split based searching for exact pattern matching for natural texts.

Hakak, Saqib; Kamsin, Amirrudin; Shivakumara, Palaiahnakote; Idna Idris, Mohd Yamani; Gilkar, Gulshan Amin.

PLoS One ; 13(7): e0200912, 2018.

Article in English | MEDLINE | ID: mdl-30048486

ABSTRACT

Exact pattern matching algorithms are popular and used widely in several applications, such as molecular biology, text processing, image processing, web search engines, network intrusion detection systems and operating systems. The focus of these algorithms is to achieve time efficiency according to applications but not memory consumption. In this work, we propose a novel idea to achieve both time efficiency and memory consumption by splitting query string for searching in Corpus. For a given text, the proposed algorithm split the query pattern into two equal halves and considers the second (right) half as a query string for searching in Corpus. Once the match is found with second halves, the proposed algorithm applies brute force procedure to find remaining match by referring the location of right half. Experimental results on different S1 Dataset, namely Arabic, English, Chinese, Italian and French text databases show that the proposed algorithm outperforms the existing S1 Algorithm in terms of time efficiency and memory consumption as the length of the query pattern increases.

Subject(s)

Algorithms , Data Mining/methods , Software

5.

Contour Restoration of Text Components for Recognition in Video/Scene Images.

Wu, Yirui; Shivakumara, Palaiahnakote; Lu, Tong; Tan, Chew Lim; Blumenstein, Michael; Kumar, Govindaraj Hemantha.

IEEE Trans Image Process ; 25(12): 5622-5634, 2016 12.

Article in English | MEDLINE | ID: mdl-27623587

ABSTRACT

Text recognition in video/natural scene images has gained significant attention in the field of image processing in many computer vision applications, which is much more challenging than recognition in plain background images. In this paper, we aim to restore complete character contours in video/scene images from gray values, in contrast to the conventional techniques that consider edge images/binary information as inputs for text detection and recognition. We explore and utilize the strengths of zero crossing points given by the Laplacian to identify stroke candidate pixels (SPC). For each SPC pair, we propose new symmetry features based on gradient magnitude and Fourier phase angles to identify probable stroke candidate pairs (PSCP). The same symmetry properties are proposed at the PSCP level to choose seed stroke candidate pairs (SSCP). Finally, an iterative algorithm is proposed for SSCP to restore complete character contours. Experimental results on benchmark databases, namely, the ICDAR family of video and natural scenes, Street View Data, and MSRA data sets, show that the proposed technique outperforms the existing techniques in terms of both quality measures and recognition rate. We also show that character contour restoration is effective for text detection in video and natural scene images.

6.

Multi-Spectral Fusion Based Approach for Arbitrarily Oriented Scene Text Detection in Video Images.

Liang, Guozhu; Shivakumara, Palaiahnakote; Lu, Tong; Tan, Chew Lim.

IEEE Trans Image Process ; 24(11): 4488-501, 2015 Nov.

Article in English | MEDLINE | ID: mdl-26259083

ABSTRACT

Scene text detection from video as well as natural scene images is challenging due to the variations in background, contrast, text type, font type, font size, and so on. Besides, arbitrary orientations of texts with multi-scripts add more complexity to the problem. The proposed approach introduces a new idea of convolving Laplacian with wavelet sub-bands at different levels in the frequency domain for enhancing low resolution text pixels. Then, the results obtained from different sub-bands (spectral) are fused for detecting candidate text pixels. We explore maxima stable extreme regions along with stroke width transform for detecting candidate text regions. Text alignment is done based on the distance between the nearest neighbor clusters of candidate text regions. In addition, the approach presents a new symmetry driven nearest neighbor for restoring full text lines. We conduct experiments on our collected video data as well as several benchmark data sets, such as ICDAR 2011, ICDAR 2013, and MSRA-TD500 to evaluate the proposed method. The proposed approach is compared with the state-of-the-art methods to show its superiority to the existing methods.

7.

A Laplacian approach to multi-oriented text detection in video.

Shivakumara, Palaiahnakote; Phan, Trung Quy; Tan, Chew Lim.

IEEE Trans Pattern Anal Mach Intell ; 33(2): 412-9, 2011 Feb.

Article in English | MEDLINE | ID: mdl-20733217

ABSTRACT

In this paper, we propose a method based on the Laplacian in the frequency domain for video text detection. Unlike many other approaches which assume that text is horizontally-oriented, our method is able to handle text of arbitrary orientation. The input image is first filtered with Fourier-Laplacian. K-means clustering is then used to identify candidate text regions based on the maximum difference. The skeleton of each connected component helps to separate the different text strings from each other. Finally, text string straightness and edge density are used for false positive elimination. Experimental results show that the proposed method is able to handle graphics text and scene text of both horizontal and nonhorizontal orientation.

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL