Your browser doesn't support javascript.
Fuzzy Information Discrimination Measures and Their Application to Low Dimensional Embedding Construction in the UMAP Algorithm.
Demidova, Liliya A; Gorchakov, Artyom V.
  • Demidova LA; Institute of Information Technologies, Federal State Budget Educational Institution of Higher Education "MIREA-Russian Technological University", 78, Vernadsky Avenue, 119454 Moscow, Russia.
  • Gorchakov AV; Institute of Information Technologies, Federal State Budget Educational Institution of Higher Education "MIREA-Russian Technological University", 78, Vernadsky Avenue, 119454 Moscow, Russia.
J Imaging ; 8(4)2022 Apr 15.
Article in English | MEDLINE | ID: covidwho-1809975
ABSTRACT
Dimensionality reduction techniques are often used by researchers in order to make high dimensional data easier to interpret visually, as data visualization is only possible in low dimensional spaces. Recent research in nonlinear dimensionality reduction introduced many effective algorithms, including t-distributed stochastic neighbor embedding (t-SNE), uniform manifold approximation and projection (UMAP), dimensionality reduction technique based on triplet constraints (TriMAP), and pairwise controlled manifold approximation (PaCMAP), aimed to preserve both the local and global structure of high dimensional data while reducing the dimensionality. The UMAP algorithm has found its application in bioinformatics, genetics, genomics, and has been widely used to improve the accuracy of other machine learning algorithms. In this research, we compare the performance of different fuzzy information discrimination measures used as loss functions in the UMAP algorithm while constructing low dimensional embeddings. In order to achieve this, we derive the gradients of the considered losses analytically and employ the Adam algorithm during the loss function optimization process. From the conducted experimental studies we conclude that the use of either the logarithmic fuzzy cross entropy loss without reduced repulsion or the symmetric logarithmic fuzzy cross entropy loss with sufficiently large neighbor count leads to better global structure preservation of the original multidimensional data when compared to the loss function used in the original UMAP algorithm implementation.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Randomized controlled trials Language: English Year: 2022 Document Type: Article Affiliation country: Jimaging8040113

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Type of study: Experimental Studies / Randomized controlled trials Language: English Year: 2022 Document Type: Article Affiliation country: Jimaging8040113