Search | VHL Regional Portal

DermSynth3D: Synthesis of in-the-wild annotated dermatology images.

Sinha, Ashish; Kawahara, Jeremy; Pakzad, Arezou; Abhishek, Kumar; Ruthven, Matthieu; Ghorbel, Enjie; Kacem, Anis; Aouada, Djamila; Hamarneh, Ghassan.

Med Image Anal ; 95: 103145, 2024 Jul.

Article in English | MEDLINE | ID: mdl-38615432

ABSTRACT

In recent years, deep learning (DL) has shown great potential in the field of dermatological image analysis. However, existing datasets in this domain have significant limitations, including a small number of image samples, limited disease conditions, insufficient annotations, and non-standardized image acquisitions. To address these shortcomings, we propose a novel framework called DermSynth3D. DermSynth3D blends skin disease patterns onto 3D textured meshes of human subjects using a differentiable renderer and generates 2D images from various camera viewpoints under chosen lighting conditions in diverse background scenes. Our method adheres to top-down rules that constrain the blending and rendering process to create 2D images with skin conditions that mimic in-the-wild acquisitions, ensuring more meaningful results. The framework generates photo-realistic 2D dermatological images and the corresponding dense annotations for semantic segmentation of the skin, skin conditions, body parts, bounding boxes around lesions, depth maps, and other 3D scene parameters, such as camera position and lighting conditions. DermSynth3D allows for the creation of custom datasets for various dermatology tasks. We demonstrate the effectiveness of data generated using DermSynth3D by training DL models on synthetic data and evaluating them on various dermatology tasks using real 2D dermatological images. We make our code publicly available at https://github.com/sfu-mial/DermSynth3D.

Subject(s)

Skin Diseases , Humans , Skin Diseases/diagnostic imaging , Imaging, Three-Dimensional/methods , Deep Learning , Image Interpretation, Computer-Assisted/methods

Dynamic Facial Expression Generation on Hilbert Hypersphere With Conditional Wasserstein Generative Adversarial Nets.

Otberdout, Naima; Daoudi, Mohamed; Kacem, Anis; Ballihi, Lahoucine; Berretti, Stefano.

IEEE Trans Pattern Anal Mach Intell ; 44(2): 848-863, 2022 02.

Article in English | MEDLINE | ID: mdl-32750786

ABSTRACT

In this work, we propose a novel approach for generating videos of the six basic facial expressions given a neutral face image. We propose to exploit the face geometry by modeling the facial landmarks motion as curves encoded as points on a hypersphere. By proposing a conditional version of manifold-valued Wasserstein generative adversarial network (GAN) for motion generation on the hypersphere, we learn the distribution of facial expression dynamics of different classes, from which we synthesize new facial expression motions. The resulting motions can be transformed to sequences of landmarks and then to images sequences by editing the texture information using another conditional Generative Adversarial Network. To the best of our knowledge, this is the first work that explores manifold-valued representations with GAN to address the problem of dynamic facial expression generation. We evaluate our proposed approach both quantitatively and qualitatively on two public datasets; Oulu-CASIA and MUG Facial Expression. Our experimental results demonstrate the effectiveness of our approach in generating realistic videos with continuous motion, realistic appearance and identity preservation. We also show the efficiency of our framework for dynamic facial expressions generation, dynamic facial expression transfer and data augmentation for training improved emotion recognition models.

Subject(s)

Facial Expression , Neural Networks, Computer , Algorithms , Face , Motion

A Novel Geometric Framework on Gram Matrix Trajectories for Human Behavior Understanding.

Kacem, Anis; Daoudi, Mohamed; Amor, Boulbaba Ben; Berretti, Stefano; Alvarez-Paiva, Juan Carlos.

IEEE Trans Pattern Anal Mach Intell ; 42(1): 1-14, 2020 01.

Article in English | MEDLINE | ID: mdl-30281437

ABSTRACT

In this paper, we propose a novel space-time geometric representation of human landmark configurations and derive tools for comparison and classification. We model the temporal evolution of landmarks as parametrized trajectories on the Riemannian manifold of positive semidefinite matrices of fixed-rank. Our representation has the benefit to bring naturally a second desirable quantity when comparing shapes-the spatial covariance-in addition to the conventional affine-shape representation. We derived then geometric and computational tools for rate-invariant analysis and adaptive re-sampling of trajectories, grounding on the Riemannian geometry of the underlying manifold. Specifically, our approach involves three steps: (1) landmarks are first mapped into the Riemannian manifold of positive semidefinite matrices of fixed-rank to build time-parameterized trajectories; (2) a temporal warping is performed on the trajectories, providing a geometry-aware (dis-)similarity measure between them; (3) finally, a pairwise proximity function SVM is used to classify them, incorporating the (dis-)similarity measure into the kernel function. We show that such representation and metric achieve competitive results in applications as action recognition and emotion recognition from 3D skeletal data, and facial expression recognition from videos. Experiments have been conducted on several publicly available up-to-date benchmarks.

Subject(s)

Image Processing, Computer-Assisted/methods , Movement/physiology , Pattern Recognition, Automated/methods , Anatomic Landmarks/diagnostic imaging , Databases, Factual , Emotions/physiology , Humans , Support Vector Machine , Video Recording

Automatic Analysis of Facial Expressions Based on Deep Covariance Trajectories.

Otberdout, Naima; Kacem, Anis; Daoudi, Mohamed; Ballihi, Lahoucine; Berretti, Stefano.

IEEE Trans Neural Netw Learn Syst ; 31(10): 3892-3905, 2020 10.

Article in English | MEDLINE | ID: mdl-31725395

ABSTRACT

In this article, we propose a new approach for facial expression recognition (FER) using deep covariance descriptors. The solution is based on the idea of encoding local and global deep convolutional neural network (DCNN) features extracted from still images, in compact local and global covariance descriptors. The space geometry of the covariance matrices is that of symmetric positive definite (SPD) matrices. By conducting the classification of static facial expressions using a support vector machine (SVM) with a valid Gaussian kernel on the SPD manifold, we show that deep covariance descriptors are more effective than the standard classification with fully connected layers and softmax. Besides, we propose a completely new and original solution to model the temporal dynamic of facial expressions as deep trajectories on the SPD manifold. As an extension of the classification pipeline of covariance descriptors, we apply SVM with valid positive definite kernels derived from global alignment for deep covariance trajectories classification. By performing extensive experiments on the Oulu-CASIA, CK+, static facial expression in the wild (SFEW), and acted facial expressions in the wild (AFEW) data sets, we show that both the proposed static and dynamic approaches achieve the state-of-the-art performance for FER outperforming many recent approaches.

Detecting Depression Severity by Interpretable Representations of Motion Dynamics.

Kacem, Anis; Hammal, Zakia; Daoudi, Mohamed; Cohn, Jeffrey.

Proc Int Conf Autom Face Gesture Recognit ; 2018: 739-745, 2018 May.

Article in English | MEDLINE | ID: mdl-30271308

ABSTRACT

Recent breakthroughs in deep learning using automated measurement of face and head motion have made possible the first objective measurement of depression severity. While powerful, deep learning approaches lack interpretability. We developed an interpretable method of automatically measuring depression severity that uses barycentric coordinates of facial landmarks and a Lie-algebra based rotation matrix of 3D head motion. Using these representations, kinematic features are extracted, preprocessed, and encoded using Gaussian Mixture Models (GMM) and Fisher vector encoding. A multi-class SVM is used to classify the encoded facial and head movement dynamics into three levels of depression severity. The proposed approach was evaluated in adults with history of chronic depression. The method approached the classification accuracy of state-of-the-art deep learning while enabling clinically and theoretically relevant findings. The velocity and acceleration of facial movement strongly mapped onto depression severity symptoms consistent with clinical data and theory.

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL