Surface soil moisture (SM) plays a crucial role in hydrological processes and terrestrial ecosystems in desertification areas. Passive microwave remote sensing products such as Soil Moisture Active Passive (SMAP) satellites have been proven to effectively monitor surface soil moisture. However, the low spatial resolution and lack of comprehensive coverage of these products greatly limit their application in desertification areas. To overcome these limitations, we combined various machine learning methods, including multiple linear regression (MLR), support vector regression (SVR), artificial neural network (ANN), random forest (RF), and extreme gradient boosting (XGB), to perform dimensionality reduction on the 36 kilometer SMAP SM product. We also generated higher spatial resolution SM data based on relevant surface variables such as vegetation index and surface temperature. The study selected desertification areas in North China that are sensitive to SM as the research area and produced daily downscaled SM with a resolution of 1 km from 2015 to 2020.
collect time | 2015/01/01 - 2020/12/31 |
---|---|
collect place | North China Desert Region |
data size | 6.4 GiB |
data format | TIFF |
Coordinate system |
(1) This article uses MODIS products MOD09A1, MOD11A1, MOD13A2, MOD15A2H, and MCD43D58 (Table 2). 1 kilometer of LST per day is provided by MOD11A1, and 1 kilometer of 16 day EVI and NDVI is provided by MOD13A2. MOD15A2H provides 8d Leaf Area Index (LAI) with a spatial resolution of 500 m, while MCD43D58 provides daily albedo data with a spatial resolution of 30 arcseconds (∼ 1000 m). Some indices related to soil moisture, including NDWI, NSDSI, and Surface Water Index (LSWI), were produced by MOD09A1.
(2) Terrain factors are closely related to SM, including altitude, slope, and aspect, obtained from the Land Process Distributed Activity Archive Center (LP DAAC)
(3) The 1000 meter resolution soil data used in this study, including the proportions of sand, silt, and clay, was obtained from the China Soil Characteristics Dataset (CSCD) obtained from the National Qinghai Tibet Plateau Data Center
(4) The in-situ SM measurement values were collected from the data provided by the Maqu monitoring network and Babao monitoring network.
(5) The daily precipitation and temperature data come from 131 meteorological stations of the China Meteorological Data Service Center.
(1) Based on selected variable indicators (mainly including terrain data, soil data, and some MODIS products) and machine learning methods, a SMAP SM downscaling framework was constructed using multiple machine learning methods, and machine learning methods widely used to construct SM and its related variable regression models were selected;
(2) Firstly, all data needs to be preprocessed. The daily LST data may be affected by clouds, so we use its quality control (QC) band to perform quality control on MOD11A1 products and select high-quality cloud free pixels. All selected variables, including LST, albedo LAI、NDWI、LSWI、NSDSI、NDVI、EVI、DEM、 Slope, aspect, sand, silt, and clay are all aggregated in GeoTIFF format to a resolution of 1 km. Use nearest neighbor interpolation to further resample these variables to achieve spatial resolution of SMAP SM data (36 km);
(3) Secondly, obtain valid samples and split them;
(4) Thirdly, determine the regression model based on the training and testing sets. Considering that the sample size is crucial for the accuracy of the regression model, we only selected periods with more than 100 samples to construct the model;
(5) Finally, rotate the hyperparameters and select the optimal model.
The results indicate that compared with the SM data observed in situ, its performance is good, with an average unbiased root mean square error value of 0.057 m3m-3. In addition, its time series is consistent with precipitation, and its performance is superior to common grid based SM products. These data can be used to assess soil drought conditions and provide reference for reversing desertification in the study area.
# | title | file size |
---|---|---|
1 | 2015.zip | 905.0 MiB |
2 | 2016.zip | 1.1 GiB |
3 | 2017.zip | 1.1 GiB |
4 | 2018.zip | 1.1 GiB |
5 | 2019.zip | 1.1 GiB |
6 | 2020.zip | 1.1 GiB |
7 | _ncdc_meta_.json | 6.9 KiB |
Surface Soil Moisture (SM) Multiple Linear Regression (MLR) Support Vector Regression (SVR) Artificial Neural Network (ANN) Random Forest (RF) Soil Moisture Active Passive (SMAP) Satellite
©Copyright 2005-. Northwest Institute of Eco-Environment and Resources, CAS.
Donggang West Road 320, Lanzhou, Gansu, China (730000)