Kim, H., Crow, W., Li, X., Wagner, W., Hahn, S., & Lakshmi, V. (2023). True global error maps for SMAP, SMOS, and ASCAT soil moisture data based on machine learning and triple collocation analysis. Remote Sensing of Environment, 298, Article 113776. https://doi.org/10.34726/4844
Quantifying the accuracy of the satellite-based soil moisture (SM) data is important for a number of key applications, such as: combining satellite-based SM products for long-term SM analyses, assimilating SM data into land surface models, and providing quality flags to mask bad quality SM data.
A range of statistical methods have been proposed to estimate error statistics for large-scale SM datasets including the: instrumental variable (IV) method, triple collocation analysis (TCA), and quadruple collocation analysis (QCDA). While requiring only two input products, the IV method also imposes an additional assumption that one input product possesses serially uncorrelated errors - thus limiting its scope compared to TC. Likewise, QCDA requires four independent SM data products that are difficult to obtain and may not always be available for analysis. Nonetheless, TCA-based methods still cannot provide truly global error maps for satellite SM products due to the limited number of independent SM products and difficulties with baseline TCA assumptions. Moreover, temporal sampling requirements for TCA are often impractical because of low SM retrieval skill in forested and arid areas – as well as in regions prone to radio frequency interference.
Here, we seek to fill significant spatial gaps in TCA results using machine learning (ML) and therefore provide spatially complete error maps for the satellite-based SM data products derived from the Soil Moisture Active Passive (SMAP), Soil Moisture and Ocean Salinity (SMOS), and Advanced Scatterometer (ASCAT) systems. Furthermore, we use SHapley Additive exPlanations (SHAP) values, a model-agnostic technique for interpreting ML models, to examine the impact of various environmental conditions on the quality of satellite-based SM retrievals.
Globally, and across all three products, 72.0% of missing error information in a TCA-based analysis, due to either the lack of valid data or the inability of TCA to provide reliable results, can be reconstructed from the ensemble prediction mean of the ML models. Overall, we found that 22.7% (a.m.) and 34.2% (p.m.) of the Earth'sSM dynamics (between 60°S to 60°N) have not been investigated properly across all three satellite missions.
en
Research Areas:
Environmental Monitoring and Climate Adaptation: 100%