Search | VHL Regional Portal

1.

Neural Network Emulation of Synthetic Hyperspectral Sentinel-2-like Imagery with Uncertainty.

Morata, Miguel; Siegmann, Bastian; Pérez-Suay, Adrián; García-Soria, José Luis; Rivera-Caicedo, Juan Pablo; Verrelst, Jochem.

IEEE J Sel Top Appl Earth Obs Remote Sens ; 16: 762-772, 2023.

Article in English | MEDLINE | ID: mdl-36644656

ABSTRACT

Hyperspectral satellite imagery provides highly-resolved spectral information for large areas and can provide vital information. However, only a few imaging spectrometer missions are currently in operation. Aiming to generate synthetic satellite-based hyperspectral imagery potentially covering any region, we explored the possibility of applying statistical learning, i.e. emulation. Based on the relationship of a Sentinel-2 (S2) scene and a hyperspectral HyPlant airborne image, this work demonstrates the possibility to emulate a hyperspectral S2-like image. We tested the role of different machine learning regression algorithms (MLRA) and varied the image-extracted training dataset size. We found superior performance of Neural Network (NN) as opposed to the other algorithms when trained with large datasets (up to 100'000 samples). The developed emulator was then applied to the L2A (bottom-of-atmosphere reflectance) S2 subset, and the obtained S2-like hyperspectral reflectance scene was evaluated. The validation of emulated against reference spectra demonstrated the potential of the technique. R 2 values between 0.75-0.9 and NRMSE between 2-5% across the full 402-2356 nm range were obtained. Moreover, epistemic uncertainty is obtained using the dropout technique, revealing spatial fidelity of the emulated scene. We obtained highest SD values of 0.05 (CV of 8%) in clouds and values below 0.01 (CV of 7%) in vegetation land covers. Finally, the emulator was applied to an entire S2 tile (5490x5490 pixels) to generate a hyperspectral reflectance datacube with the texture of S2 (60Gb, at a speed of 0.14sec/10000pixels). As the emulator can convert any S2 tile into a hyperspectral image, such scenes give perspectives how future satellite imaging spectroscopy will look like.

2.

Introducing ARTMO's Machine-Learning Classification Algorithms Toolbox: Application to Plant-Type Detection in a Semi-Steppe Iranian Landscape.

Aghababaei, Masoumeh; Ebrahimi, Ataollah; Naghipour, Ali Asghar; Asadi, Esmaeil; Pérez-Suay, Adrián; Morata, Miguel; Garcia, Jose Luis; Caicedo, Juan Pablo Rivera; Verrelst, Jochem.

Remote Sens (Basel) ; 14(18): 4452, 2022 Sep 06.

Article in English | MEDLINE | ID: mdl-36172268

ABSTRACT

Accurate plant-type (PT) detection forms an important basis for sustainable land management maintaining biodiversity and ecosystem services. In this sense, Sentinel-2 satellite images of the Copernicus program offer spatial, spectral, temporal, and radiometric characteristics with great potential for mapping and monitoring PTs. In addition, the selection of a best-performing algorithm needs to be considered for obtaining PT classification as accurate as possible. To date, no freely downloadable toolbox exists that brings the diversity of the latest supervised machine-learning classification algorithms (MLCAs) together into a single intuitive user-friendly graphical user interface (GUI). To fill this gap and to facilitate and automate the usage of MLCAs, here we present a novel GUI software package that allows systematically training, validating, and applying pixel-based MLCA models to remote sensing imagery. The so-called MLCA toolbox has been integrated within ARTMO's software framework developed in Matlab which implements most of the state-of-the-art methods in the machine learning community. To demonstrate its utility, we chose a heterogeneous case study scene, a landscape in Southwest Iran to map PTs. In this area, four main PTs were identified, consisting of shrub land, grass land, semi-shrub land, and shrub land-grass land vegetation. Having developed 21 MLCAs using the same training and validation, datasets led to varying accuracy results. Gaussian process classifier (GPC) was validated as the top-performing classifier, with an overall accuracy (OA) of 90%. GPC follows a Laplace approximation to the Gaussian likelihood under the supervised classification framework, emerging as a very competitive alternative to common MLCAs. Random forests resulted in the second-best performance with an OA of 86%. Two other types of ensemble-learning algorithms, i.e., tree-ensemble learning (bagging) and decision tree (with error-correcting output codes), yielded an OA of 83% and 82%, respectively. Following, thirteen classifiers reported OA between 70% and 80%, and the remaining four classifiers reported an OA below 70%. We conclude that GPC substantially outperformed all classifiers, and thus, provides enormous potential for the classification of a diversity of land-cover types. In addition, its probabilistic formulation provides valuable band ranking information, as well as associated predictive variance at a pixel level. Nevertheless, as these are supervised (data-driven) classifiers, performances depend on the entered training data, meaning that an assessment of all MLCAs is crucial for any application. Our analysis demonstrated the efficacy of ARTMO's MLCA toolbox for an automated evaluation of the classifiers and subsequent thematic mapping.

3.

Prototyping Crop Traits Retrieval Models for CHIME: Dimensionality Reduction Strategies Applied to PRISMA Data.

Pascual-Venteo, Ana B; Portalés, Enrique; Berger, Katja; Tagliabue, Giulia; Garcia, Jose L; Pérez-Suay, Adrián; Rivera-Caicedo, Juan Pablo; Verrelst, Jochem.

Remote Sens (Basel) ; 14(10): 2448, 2022 May 19.

Article in English | MEDLINE | ID: mdl-36017157

ABSTRACT

In preparation for new-generation imaging spectrometer missions and the accompanying unprecedented inflow of hyperspectral data, optimized models are needed to generate vegetation traits routinely. Hybrid models, combining radiative transfer models with machine learning algorithms, are preferred, however, dealing with spectral collinearity imposes an additional challenge. In this study, we analyzed two spectral dimensionality reduction methods: principal component analysis (PCA) and band ranking (BR), embedded in a hybrid workflow for the retrieval of specific leaf area (SLA), leaf area index (LAI), canopy water content (CWC), canopy chlorophyll content (CCC), the fraction of absorbed photosynthetic active radiation (FAPAR), and fractional vegetation cover (FVC). The SCOPE model was used to simulate training data sets, which were optimized with active learning. Gaussian process regression (GPR) algorithms were trained over the simulations to obtain trait-specific models. The inclusion of PCA and BR with 20 features led to the so-called GPR-20PCA and GPR-20BR models. The 20PCA models encompassed over 99.95% cumulative variance of the full spectral data, while the GPR-20BR models were based on the 20 most sensitive bands. Validation against in situ data obtained moderate to optimal results with normalized root mean squared error (NRMSE) from 13.9% (CWC) to 22.3% (CCC) for GPR-20PCA models, and NRMSE from 19.6% (CWC) to 29.1% (SLA) for GPR-20BR models. Overall, the GPR-20PCA slightly outperformed the GPR-20BR models for all six variables. To demonstrate mapping capabilities, both models were tested on a PRecursore IperSpettrale della Missione Applicativa (PRISMA) scene, spectrally resampled to Copernicus Hyperspectral Imaging Mission for the Environment (CHIME), over an agricultural test site (Jolanda di Savoia, Italy). The two strategies obtained plausible spatial patterns, and consistency between the two models was highest for FVC and LAI (R 2 = 0.91, R 2 = 0.86) and lowest for SLA mapping (R 2 = 0.53). From these findings, we recommend implementing GPR-20PCA models as the most efficient strategy for the retrieval of multiple crop traits from hyperspectral data streams. Hence, this workflow will support and facilitate the preparations of traits retrieval models from the next-generation operational CHIME.

4.

Correction: Kernel methods and their derivatives: Concept and perspectives for the earth system sciences.

Johnson, J Emmanuel; Laparra, Valero; Pérez-Suay, Adrián; Mahecha, Miguel D; Camps-Valls, Gustau.

PLoS One ; 16(2): e0246775, 2021.

Article in English | MEDLINE | ID: mdl-33534865

ABSTRACT

[This corrects the article DOI: 10.1371/journal.pone.0235885.].

5.

Kernel methods and their derivatives: Concept and perspectives for the earth system sciences.

Johnson, J Emmanuel; Laparra, Valero; Pérez-Suay, Adrián; Mahecha, Miguel D; Camps-Valls, Gustau.

PLoS One ; 15(10): e0235885, 2020.

Article in English | MEDLINE | ID: mdl-33119617

ABSTRACT

Kernel methods are powerful machine learning techniques which use generic non-linear functions to solve complex tasks. They have a solid mathematical foundation and exhibit excellent performance in practice. However, kernel machines are still considered black-box models as the kernel feature mapping cannot be accessed directly thus making the kernels difficult to interpret. The aim of this work is to show that it is indeed possible to interpret the functions learned by various kernel methods as they can be intuitive despite their complexity. Specifically, we show that derivatives of these functions have a simple mathematical formulation, are easy to compute, and can be applied to various problems. The model function derivatives in kernel machines is proportional to the kernel function derivative and we provide the explicit analytic form of the first and second derivatives of the most common kernel functions with regard to the inputs as well as generic formulas to compute higher order derivatives. We use them to analyze the most used supervised and unsupervised kernel learning methods: Gaussian Processes for regression, Support Vector Machines for classification, Kernel Entropy Component Analysis for density estimation, and the Hilbert-Schmidt Independence Criterion for estimating the dependency between random variables. For all cases we expressed the derivative of the learned function as a linear combination of the kernel function derivative. Moreover we provide intuitive explanations through illustrative toy examples and show how these same kernel methods can be applied to applications in the context of spatio-temporal Earth system data cubes. This work reflects on the observation that function derivatives may play a crucial role in kernel methods analysis and understanding.

Subject(s)

Computer Simulation , Earth Sciences , Machine Learning , Support Vector Machine , Entropy , Humans , Normal Distribution

6.

Synergistic integration of optical and microwave satellite data for crop yield estimation.

Mateo-Sanchis, Anna; Piles, Maria; Muñoz-Marí, Jordi; Adsuara, Jose E; Pérez-Suay, Adrián; Camps-Valls, Gustau.

Remote Sens Environ ; 234: 111460, 2019 Dec 01.

Article in English | MEDLINE | ID: mdl-31798192

ABSTRACT

Developing accurate models of crop stress, phenology and productivity is of paramount importance, given the increasing need of food. Earth observation (EO) remote sensing data provides a unique source of information to monitor crops in a temporally resolved and spatially explicit way. In this study, we propose the combination of multisensor (optical and microwave) remote sensing data for crop yield estimation and forecasting using two novel approaches. We first propose the lag between Enhanced Vegetation Index (EVI) derived from MODIS and Vegetation Optical Depth (VOD) derived from SMAP as a new joint metric combining the information from the two satellite sensors in a unique feature or descriptor. Our second approach avoids summarizing statistics and uses machine learning to combine full time series of EVI and VOD. This study considers two statistical methods, a regularized linear regression and its nonlinear extension called kernel ridge regression to directly estimate the county-level surveyed total production, as well as individual yields of the major crops grown in the region: corn, soybean and wheat. The study area includes the US Corn Belt, and we use agricultural survey data from the National Agricultural Statistics Service (USDA-NASS) for year 2015 for quantitative assessment. Results show that (1) the proposed EVI-VOD lag metric correlates well with crop yield and outperforms common single-sensor metrics for crop yield estimation; (2) the statistical (machine learning) models working directly with the time series largely improve results compared to previously reported estimations; (3) the combined exploitation of information from the optical and microwave data leads to improved predictions over the use of single sensor approaches with coefficient of determination R ≥ 2 0.76 ; (4) when models are used for within-season forecasting with limited time information, crop yield prediction is feasible up to four months before harvest (models reach a plateau in accuracy); and (5) the robustness of the approach is confirmed in a multi-year setting, reaching similar performances than when using single-year data. In conclusion, results confirm the value of using both EVI and VOD at the same time, and the advantage of using automatic machine learning models for crop yield/production estimation.

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL