Search | VHL Regional Portal

Phenotyping urban built and natural environments with high-resolution satellite images and unsupervised deep learning.

Metzler, A Barbara; Nathvani, Ricky; Sharmanska, Viktoriia; Bai, Wenjia; Muller, Emily; Moulds, Simon; Agyei-Asabere, Charles; Adjei-Boadi, Dina; Kyere-Gyeabour, Elvis; Tetteh, Jacob Doku; Owusu, George; Agyei-Mensah, Samuel; Baumgartner, Jill; Robinson, Brian E; Arku, Raphael E; Ezzati, Majid.

Sci Total Environ ; 893: 164794, 2023 Oct 01.

Article in English | MEDLINE | ID: mdl-37315611

ABSTRACT

Cities in the developing world are expanding rapidly, and undergoing changes to their roads, buildings, vegetation, and other land use characteristics. Timely data are needed to ensure that urban change enhances health, wellbeing and sustainability. We present and evaluate a novel unsupervised deep clustering method to classify and characterise the complex and multidimensional built and natural environments of cities into interpretable clusters using high-resolution satellite images. We applied our approach to a high-resolution (0.3 m/pixel) satellite image of Accra, Ghana, one of the fastest growing cities in sub-Saharan Africa, and contextualised the results with demographic and environmental data that were not used for clustering. We show that clusters obtained solely from images capture distinct interpretable phenotypes of the urban natural (vegetation and water) and built (building count, size, density, and orientation; length and arrangement of roads) environment, and population, either as a unique defining characteristic (e.g., bodies of water or dense vegetation) or in combination (e.g., buildings surrounded by vegetation or sparsely populated areas intermixed with roads). Clusters that were based on a single defining characteristic were robust to the spatial scale of analysis and the choice of cluster number, whereas those based on a combination of characteristics changed based on scale and number of clusters. The results demonstrate that satellite data and unsupervised deep learning provide a cost-effective, interpretable and scalable approach for real-time tracking of sustainable urban development, especially where traditional environmental and demographic data are limited and infrequent.

Subject(s)

Deep Learning , Environment , Cities , Ghana

Free-HeadGAN: Neural Talking Head Synthesis With Explicit Gaze Control.

Doukas, Michail Christos; Ververas, Evangelos; Sharmanska, Viktoriia; Zafeiriou, Stefanos.

IEEE Trans Pattern Anal Mach Intell ; 45(8): 9743-9756, 2023 Aug.

Article in English | MEDLINE | ID: mdl-37028333

ABSTRACT

We present Free-HeadGAN, a person-generic neural talking head synthesis system. We show that modeling faces with sparse 3D facial landmarks is sufficient for achieving state-of-the-art generative performance, without relying on strong statistical priors of the face, such as 3D Morphable Models. Apart from 3D pose and facial expressions, our method is capable of fully transferring the eye gaze, from a driving actor to a source identity. Our complete pipeline consists of three components: a canonical 3D key-point estimator that regresses 3D pose and expression-related deformations, a gaze estimation network and a generator that is built upon the architecture of HeadGAN. We further experiment with an extension of our generator to accommodate few-shot learning using an attention mechanism, in case multiple source images are available. Compared to recent methods for reenactment and motion transfer, our system achieves higher photo-realism combined with superior identity preservation, while offering explicit gaze control.

Subject(s)

Algorithms , Face , Humans , Fixation, Ocular , Learning , Facial Expression

Optimization of deep learning methods for visualization of tumor heterogeneity and brain tumor grading through digital pathology.

Truong, An Hoai; Sharmanska, Viktoriia; LimbÓck-Stanic, Clara; Grech-Sollars, Matthew.

Neurooncol Adv ; 2(1): vdaa110, 2020.

Article in English | MEDLINE | ID: mdl-33196039

ABSTRACT

BACKGROUND: Variations in prognosis and treatment options for gliomas are dependent on tumor grading. When tissue is available for analysis, grade is established based on histological criteria. However, histopathological diagnosis is not always reliable or straight-forward due to tumor heterogeneity, sampling error, and subjectivity, and hence there is great interobserver variability in readings. METHODS: We trained convolutional neural network models to classify digital whole-slide histopathology images from The Cancer Genome Atlas. We tested a number of optimization parameters. RESULTS: Data augmentation did not improve model training, while a smaller batch size helped to prevent overfitting and led to improved model performance. There was no significant difference in performance between a modular 2-class model and a single 3-class model system. The best models trained achieved a mean accuracy of 73% in classifying glioblastoma from other grades and 53% between WHO grade II and III gliomas. A visualization method was developed to convey the model output in a clinically relevant manner by overlaying color-coded predictions over the original whole-slide image. CONCLUSIONS: Our developed visualization method reflects the clinical decision-making process by highlighting the intratumor heterogeneity and may be used in a clinical setting to aid diagnosis. Explainable artificial intelligence techniques may allow further evaluation of the model and underline areas for improvements such as biases. Due to intratumor heterogeneity, data annotation for training was imprecise, and hence performance was lower than expected. The models may be further improved by employing advanced data augmentation strategies and using more precise semiautomatic or manually labeled training data.

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL