Correction of Overestimation in Observed Land Surface Temperatures Based on Machine Learning Models

Fa Liu aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Search for other papers by Fa Liu in
Current site
Google Scholar
PubMed
Close
,
Xunming Wang aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
bCollege of Resources and Environment, University of Chinese Academy of Sciences, Beijing, China

Search for other papers by Xunming Wang in
Current site
Google Scholar
PubMed
Close
,
Fubao Sun aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China
bCollege of Resources and Environment, University of Chinese Academy of Sciences, Beijing, China
cState Key Laboratory of Desert and Oasis Ecology, Xinjiang Institute of Ecology and Geography, Chinese Academy of Sciences, Urumqi, China
dAkesu National Station of Observation and Research for Oasis Agro-Ecosystem, Akesu, China

Search for other papers by Fubao Sun in
Current site
Google Scholar
PubMed
Close
,
Hong Wang aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Search for other papers by Hong Wang in
Current site
Google Scholar
PubMed
Close
,
Lifeng Wu eSchool of Hydraulic and Ecological Engineering, Nanchang Institute of Technology, Nanchang, China

Search for other papers by Lifeng Wu in
Current site
Google Scholar
PubMed
Close
,
Xuanze Zhang aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Search for other papers by Xuanze Zhang in
Current site
Google Scholar
PubMed
Close
,
Wenbin Liu aKey Laboratory of Water Cycle and Related Land Surface Processes, Institute of Geographic Science and Science and Natural Resources Research, Chinese Academy of Sciences, Beijing, China

Search for other papers by Wenbin Liu in
Current site
Google Scholar
PubMed
Close
, and
Huizheng Che fKey Laboratory of Atmospheric Chemistry (LAC), Institute of Atmospheric Composition and Environment Meteorology, Chinese Academy of Meteorological Sciences, China Meteorological Administration, Beijing, China

Search for other papers by Huizheng Che in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Land surface temperature (LST) is an essential variable for high-temperature prediction, drought monitoring, climate, and ecological environment research. Several recent studies reported that LST observations in China warmed much faster than surface air temperature (SAT), especially after 2002. Here we found that the abrupt change in daily LST was mainly due to the overestimation of LST values from the automatic recording thermometer under snow cover conditions. These inhomogeneity issues in LST data could result in wrong conclusions without appropriate correction. To address these issues, we proposed three machine learning models—multivariate adaptive regression spline (MARS), random forest (RF), and a novel simple tree-based method named extreme gradient boosting (XGBoost)—for accurate prediction of daily LST using conventional meteorological data. Daily air temperature (maximum, minimum, mean), sunshine duration, precipitation, wind speed, relative humidity, daily solar radiation, and diurnal temperature range of 2185 stations over 1971–2002 from four regions of China were used to train and test the models. The results showed that the machine learning models, particularly XGBoost, outperformed other models in estimating daily LST. Based on LST data corrected by the XGBoost model, the dramatic increase in LST disappeared. The long-term trend for the new LST was estimated to be 0.32° ± 0.03°C decade−1 over 1971–2019, which is close to the trend in SAT (0.30° ± 0.03°C decade−1). This study corrected the inhomogeneities of daily LST in China, indicating the strong potential of machine learning models for improving estimation of LST and other surface climatic factors.

© 2022 American Meteorological Society. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding authors: X. Wang, xunming@igsnrr.ac.cn; F. Sun, sunfb@igsnrr.ac.cn

Abstract

Land surface temperature (LST) is an essential variable for high-temperature prediction, drought monitoring, climate, and ecological environment research. Several recent studies reported that LST observations in China warmed much faster than surface air temperature (SAT), especially after 2002. Here we found that the abrupt change in daily LST was mainly due to the overestimation of LST values from the automatic recording thermometer under snow cover conditions. These inhomogeneity issues in LST data could result in wrong conclusions without appropriate correction. To address these issues, we proposed three machine learning models—multivariate adaptive regression spline (MARS), random forest (RF), and a novel simple tree-based method named extreme gradient boosting (XGBoost)—for accurate prediction of daily LST using conventional meteorological data. Daily air temperature (maximum, minimum, mean), sunshine duration, precipitation, wind speed, relative humidity, daily solar radiation, and diurnal temperature range of 2185 stations over 1971–2002 from four regions of China were used to train and test the models. The results showed that the machine learning models, particularly XGBoost, outperformed other models in estimating daily LST. Based on LST data corrected by the XGBoost model, the dramatic increase in LST disappeared. The long-term trend for the new LST was estimated to be 0.32° ± 0.03°C decade−1 over 1971–2019, which is close to the trend in SAT (0.30° ± 0.03°C decade−1). This study corrected the inhomogeneities of daily LST in China, indicating the strong potential of machine learning models for improving estimation of LST and other surface climatic factors.

© 2022 American Meteorological Society. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding authors: X. Wang, xunming@igsnrr.ac.cn; F. Sun, sunfb@igsnrr.ac.cn

Supplementary Materials

    • Supplemental Materials (PDF 730 KB)
Save
  • Alexandersson, H., 1986: A homogeneity test applied to precipitation data. J. Climatol., 6, 661675, https://doi.org/10.1002/joc.3370060607.

    • Search Google Scholar
    • Export Citation
  • Allen, R. G., L. S. Pereira, D. Raes, and M. Smith, 1998: Crop evapotranspiration—Guidelines for computing crop water requirements. FAO Irrigation and Drainage Paper 56, Food and Agriculture Organization, 300 pp., http://www.fao.org/docrep/x0490e/x0490e00.htm.

  • Arnfield, A. J., 2003: Two decades of urban climate research: A review of turbulence, exchanges of energy and water, and the urban heat island. Int. J. Climatol., 23 (1), 126, https://doi.org/10.1002/joc.859

    • Search Google Scholar
    • Export Citation
  • Bartlett, M. G., D. S. Chapman, and R. N. Harris, 2005: Snow effect on North American ground temperatures, 1950–2002. J. Geophys. Res., 110, F03008, https://doi.org/10.1029/2005JF000293.

    • Search Google Scholar
    • Export Citation
  • Bosilovich, M. G., 2006: A comparison of MODIS land surface temperature with in situ observations. Geophys. Res. Lett., 33, L20112, https://doi.org/10.1029/2006GL027519.

    • Search Google Scholar
    • Export Citation
  • Breiman, L., 2001: Random forests. Mach. Learn., 45, 532, https://doi.org/10.1023/A:1010933404324.

  • Chen, T., and C. Guestrin, 2016: XGBoost: A scalable tree boosting system. Proc. 22nd ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, San Francisco, CA, Association for Computing Machinery, 785794.

  • Citakoglu, H., 2017: Comparison of artificial intelligence techniques for prediction of soil temperatures in Turkey. Theor. Appl. Climatol., 130, 545556, https://doi.org/10.1007/s00704-016-1914-7.

    • Search Google Scholar
    • Export Citation
  • CMA, 2003: Specifications for Surface Meteorological Observation (in Chinese). China Meteorological Press, 151 pp.

  • Cohen, J., J. A. Screen, J. C. Furtado, M. Barlow, D. Whittleston, D. Coumou, and J. Jones, 2014: Recent Arctic amplification and extreme mid-latitude weather. Nat. Geosci., 7, 627637, https://doi.org/10.1038/ngeo2234.

    • Search Google Scholar
    • Export Citation
  • Crino, S., and D. E. Brown, 2007: Global optimization with multivariate adaptive regression splines. IEEE Trans. Syst. Man Cybern., 37, 333340, https://doi.org/10.1109/TSMCB.2006.883430.

    • Search Google Scholar
    • Export Citation
  • Cui, Y., W. Xu, Z. Zhou, C. Zhao, Y. Ding, X. Ao, and X. Zhou, 2020: Bias analysis and correction of ground surface temperature observations across China. J. Meteor. Res., 34, 13241334, https://doi.org/10.1007/s13351-020-0031-9.

    • Search Google Scholar
    • Export Citation
  • DelSole, T., and M. K. Tippett, 2014: Comparing forecast skill. Mon. Wea. Rev., 142, 46584678, https://doi.org/10.1175/MWR-D-14-00045.1.

    • Search Google Scholar
    • Export Citation
  • DelSole, T., and M. K. Tippett, 2016: Forecast comparison based on random walks. Mon. Wea. Rev., 144, 615626, https://doi.org/10.1175/MWR-D-15-0218.1.

    • Search Google Scholar
    • Export Citation
  • Du, J., K. Wang, B. Cui, and S. Jiang, 2020: Correction of inhomogeneities in observed land surface temperatures over China. J. Climate, 33, 88858902, https://doi.org/10.1175/JCLI-D-19-0521.1.

    • Search Google Scholar
    • Export Citation
  • Fan, J., X. Wang, L. Wu, H. Zhou, F. Zhang, X. Yu, X. Lu, and Y. Xiang, 2018: Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China. Energy Convers. Manage., 164, 102111, https://doi.org/10.1016/j.enconman.2018.02.087.

    • Search Google Scholar
    • Export Citation
  • Farhangi, F., A. Sadeghi-Niaraki, A. Nahvi, and S. V. Razavi-Termeh, 2020: Spatial modelling of accidents risk caused by driver drowsiness with data mining algorithms. Geocarto Int., https://doi.org/10.1080/10106049.2020.1831626, in press.

    • Search Google Scholar
    • Export Citation
  • Friedman, J. H., 1991: Multivariate adaptive regression splines. Ann. Stat., 19, 167, https://doi.org/10.1214/aos/1176347963.

  • Hauke, J., and T. Kossowski, 2011: Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data. Quaestiones Geogr.,30, 8793, https://doi.org/10.2478/v10117-011-0021-1.

  • He, C., J. Liu, F. Xu, T. Zhang, S. Chen, Z. Sun, and J. He, 2020: Improving solar radiation estimation in China based on regional optimal combination of meteorological factors with machine learning methods. Energy Convers. Manage., 220, 113111, https://doi.org/10.1016/j.enconman.2020.113111.

    • Search Google Scholar
    • Export Citation
  • Huang, B., C. Liu, V. Banzon, E. Freeman, G. Graham, B. Hankins, and H. M. Zhang, 2021: Improvements of the Daily Optimum Interpolation Sea Surface Temperature (DOISST) version 2.1. J. Climate, 34, 29232939, https://doi.org/10.1175/JCLI-D-20-0166.1.

    • Search Google Scholar
    • Export Citation
  • Huang, X., J. Deng, X. Ma, Y. Wang, Q. Feng, X. Hao, and T. Liang, 2016: Spatiotemporal dynamics of snow cover based on multi-source remote sensing data in China. Cryosphere, 10, 24532463, https://doi.org/10.5194/tc-10-2453-2016.

    • Search Google Scholar
    • Export Citation
  • Jarlan, L., S. Er-Raki, R. Balaghi, A. Amazirh, B. Richard, and S. Khabba, 2021: Cereal yield forecasting with satellite drought-based indices, weather data and regional climate indices using machine learning in Morocco. Remote Sens., 13, 3101, https://doi.org/10.3390/rs13163101.

    • Search Google Scholar
    • Export Citation
  • Jiménez, C., C. Prigent, J. Catherinot, W. Rossow, P. Liang, and J. L. Moncet, 2012: A comparison of ISCCP land surface temperature with other satellite and in situ observations. J. Geophys. Res., 117, D08111 https://doi.org/10.1029/2011JD017058

    • Search Google Scholar
    • Export Citation
  • Jung, C., Y. Lee, Y. Cho, and S. Kim, 2017: A study of spatial soil moisture estimation using a multiple linear regression model and MODIS land surface temperature data corrected by conditional merging. Remote Sens., 9, 870, https://doi.org/10.3390/rs9080870.

    • Search Google Scholar
    • Export Citation
  • Kalma, J. D., T. R. McVicar, and M. F. McCabe, 2008: Estimating land surface evaporation: A review of methods using remotely sensed surface temperature data. Surv. Geophys., 29, 421469, https://doi.org/10.1007/s10712-008-9037-z.

    • Search Google Scholar
    • Export Citation
  • Kaneko, H., and K. Funatsu, 2015: Fast optimization of hyperparameters for support vector regression models with highly predictive ability. Chemom. Intell. Lab. Syst., 142, 6469, https://doi.org/10.1016/j.chemolab.2015.01.001.

    • Search Google Scholar
    • Export Citation
  • Khosravi, K., L. Mao, O. Kisi, Z. M. Yaseen, and S. Shahid, 2018: Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized Andean catchment in Chile. J. Hydrol., 567, 165179, https://doi.org/10.1016/j.jhydrol.2018.10.015.

    • Search Google Scholar
    • Export Citation
  • Lee, Y., and S. Kim, 2016: The modified SEBAL for mapping daily spatial evapotranspiration of South Korea using three flux towers and Terra MODIS data. Remote Sens., 8, 983, https://doi.org/10.3390/rs8120983.

    • Search Google Scholar
    • Export Citation
  • Li, Z.-L., B.-H. Tang, H. Wu, H. Ren, G. Yan, Z. Wan, I. F. Trigo, and J. A. Sobrino, 2013: Satellite-derived land surface temperature: Current status and perspectives. Remote Sens. Environ., 131, 1437, https://doi.org/10.1016/j.rse.2012.12.008.

    • Search Google Scholar
    • Export Citation
  • Liao, Y. M., D. Chen, and Q. F. Liu, 2019: The spatiotemporal characteristics and long-term trends of surface-air temperatures difference in China. Climate Change Res., 15, 374384, https://doi.org/10.12006/j.issn.1673-1719.2018.199.

    • Search Google Scholar
    • Export Citation
  • Liu, F., F. Sun, W. Liu, T. Wang, H. Wang, X. Wang, and W. H. Lim, 2019: On wind speed pattern and energy potential in China. Appl. Energy, 236, 867876, https://doi.org/10.1016/j.apenergy.2018.12.056.

    • Search Google Scholar
    • Export Citation
  • Liu, X., Z. Ren, and Y. Wang, 2008: Differences between automatic-observed and manual-observed surface temperature. J. Appl. Meteor. Sci., 19, 554563, http://112.126.69.253/en/article/id/20080506.

    • Search Google Scholar
    • Export Citation
  • Ma, X., C. Fang, and J. Ji, 2020: Prediction of outdoor air temperature and humidity using XGBoost. IOP Conf. Ser., 427, 012013, https://doi.org/10.1088/1755-1315/427/1/012013.

    • Search Google Scholar
    • Export Citation
  • Mehdizadeh, S., J. Behmanesh, and K. Khalili, 2018: Comprehensive modeling of monthly mean soil temperature using multivariate adaptive regression splines and support vector machine. Theor. Appl. Climatol., 133, 911924, https://doi.org/10.1007/s00704-017-2227-1.

    • Search Google Scholar
    • Export Citation
  • Meng, C. L., Z. L. Li, X. Zhan, J. C. Shi, and C. Y. Liu, 2009: Land surface temperature data assimilation and its impact on evapotranspiration estimates from the Common Land Model. Water Resour. Res., 45, W02421, https://doi.org/10.1029/2008WR006971.

    • Search Google Scholar
    • Export Citation
  • Nanda, A., S. Sen, A. N. Sharma, and K. P. Sudheer, 2020: Soil temperature dynamics at hillslope scale—Field observation and machine learning-based approach. Water, 12, 713, https://doi.org/10.3390/w12030713.

    • Search Google Scholar
    • Export Citation
  • Pan, B., 2018: Application of XGBoost algorithm in hourly PM2.5 concentration prediction. IOP Conf. Ser., 113, 012127, https://doi.org/10.1088/1755-1315/113/1/012127.

    • Search Google Scholar
    • Export Citation
  • Rodriguez-Galiano, V., M. Sanchez-Castillo, M. Chica-Olmo, and M. Chica-Rivas, 2015: Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines. Ore Geol. Rev., 71, 804818, https://doi.org/10.1016/j.oregeorev.2015.01.001.

    • Search Google Scholar
    • Export Citation
  • Schwartz, M. D., R. Ahas, and A. Aasa, 2006: Onset of spring starting earlier across the Northern Hemisphere. Glob. Change Biol., 12, 343351, https://doi.org/10.1111/j.1365-2486.2005.01097.x.

    • Search Google Scholar
    • Export Citation
  • Sheridan, R. P., W. M. Wang, A. Liaw, J. Ma, and E. M. Gifford, 2016: Extreme gradient boosting as a method for quantitative structure–activity relationships. J. Chem. Inf. Model., 56, 23532360, https://doi.org/10.1021/acs.jcim.6b00591.

    • Search Google Scholar
    • Export Citation
  • Song, R., S. Chen, B. Deng, and L. Li, 2016: eXtreme gradient boosting for identifying individual users across different digital devices. International Conference on Web-Age Information Management, Springer, 4354.

  • Thiebault, K., and S. Young, 2020: Snow cover change and its relationship with land surface temperature and vegetation in northeastern North America from 2000 to 2017. Int. J. Remote Sens., 41, 84538474, https://doi.org/10.1080/01431161.2020.1779379.

    • Search Google Scholar
    • Export Citation
  • Tomlinson, C. J., L. Chapman, J. E. Thornes, and C. Baker, 2011: Remote sensing land surface temperature for meteorology and climatology: A review. Meteor. Appl., 18, 296306, https://doi.org/10.1002/met.287.

    • Search Google Scholar
    • Export Citation
  • Trewin, B. C., 2010: Exposure, instrumentation and observing practice effects on land temperature measurements. Wiley Interdiscip. Rev.: Climate Change, 1, 490506, https://doi.org/10.1002/wcc.46.

    • Search Google Scholar
    • Export Citation
  • Voogt, J. A., and T. R. Oke, 2003: Thermal remote sensing of urban climates. Remote Sens. Environ., 86, 370384, https://doi.org/10.1016/S0034-4257(03)00079-8.

    • Search Google Scholar
    • Export Citation
  • Walpole, R. E., R. H. Myers, S. L. Myers, and K. Ye, 2012: Probability and Statistics for Engineers and Scientists. 9th ed. Prentice Hall, 783 pp.

  • Wang, D., A. Wang, and X. Kong, 2021: Homogenization of the daily land surface temperature over the mainland of China from 1960 through 2017. Adv. Atmos. Sci., 38, 18111822, https://doi.org/10.1007/s00376-021-1038-6.

    • Search Google Scholar
    • Export Citation
  • Wang, J., Z. Pan, and G. Han, 2016: Variation in ground temperature at a depth of 0 cm and the relationship with air temperature in China from 1961 to 2010. Resour. Sci., 38, 17331741, https://doi.org/10.18402/resci.2016.09.11.

    • Search Google Scholar
    • Export Citation
  • Wang, K., Z. Wan, P. Wang, M. Sparrow, J. Liu, X. Zhou, and S. Haginoya, 2005: Estimation of surface long wave radiation and broadband emissivity using Moderate Resolution Imaging Spectroradiometer (MODIS) land surface temperature/emissivity products. J. Geophys. Res., 110, D11109, https://doi.org/10.1029/2004JD005566.

    • Search Google Scholar
    • Export Citation
  • Wang, Y., Z. Z. Hu, and F. Yan, 2017: Spatiotemporal variations of differences between surface air and ground temperatures in China. J. Geophys. Res. Atmos., 122, 79907999, https://doi.org/10.1002/2016JD026110.

    • Search Google Scholar
    • Export Citation
  • Wu, L., G. Huang, J. Fan, F. Zhang, X. Wang, and W. Zeng, 2019: Potential of kernel-based nonlinear extension of Arps decline model and gradient boosting with categorical features support for predicting daily global solar radiation in humid regions. Energy Convers. Manage., 183, 280295, https://doi.org/10.1016/j.enconman.2018.12.103.

    • Search Google Scholar
    • Export Citation
  • Xing, L., L. Li, J. Gong, C. Ren, J. Liu, and H. Chen, 2018: Daily soil temperatures predictions for various climates in United States using data-driven model. Energy, 160, 430440, https://doi.org/10.1016/j.energy.2018.07.004.

    • Search Google Scholar
    • Export Citation
  • Xu, X., C. Lu, X. Shi, and S. Gao, 2008: World water tower: An atmospheric perspective. Geophys. Res. Lett., 35, L20815, https://doi.org/10.1029/2008GL035867.

    • Search Google Scholar
    • Export Citation
  • Yang, J. W., L. M. Jiang, J. Lemmetyinen, J. M. Pan, K. Luojus, and M. Takala, 2021: Improving snow depth estimation by coupling HUT-optimized effective snow grain size parameters with the random forest approach. Remote Sens. Environ., 264, 112630, https://doi.org/10.1016/j.rse.2021.112630.

    • Search Google Scholar
    • Export Citation
  • Yang, K., and J. Zhang, 2018: Evaluation of reanalysis datasets against observational soil temperature data over China. Climate Dyn., 50, 317337, https://doi.org/10.1007/s00382-017-3610-4.

    • Search Google Scholar
    • Export Citation
  • Yu, S., Z. Chen, B. Yu, L. Wang, B. Wu, J. Wu, and F. Zhao, 2020: Exploring the relationship between 2D/3D landscape pattern and land surface temperature based on explainable eXtreme Gradient Boosting tree: A case study of Shanghai, China. Sci. Total Environ., 725, 138229, https://doi.org/10.1016/j.scitotenv.2020.138229.

    • Search Google Scholar
    • Export Citation
  • Zhao, B., K. Mao, Y. Cai, J. Shi, Z. Li, Z. Qin, and Z. Guo, 2020: A combined Terra and Aqua MODIS land surface temperature and meteorological station data product for China from 2003 to 2017. Earth Syst. Sci. Data, 12, 25552577, https://doi.org/10.5194/essd-12-2555-2020.

    • Search Google Scholar
    • Export Citation
  • Zhao, W., S. B. Duan, A. Li, and G. Yin, 2019: A practical method for reducing terrain effect on land surface temperature using random forest regression. Remote Sens. Environ., 221, 635649, https://doi.org/10.1016/j.rse.2018.12.008.

    • Search Google Scholar
    • Export Citation
  • Zhong, R., T. Zhao, Y. He, and X. Chen, 2019: Hydropower change of the water tower of Asia in 21st century: A case of the Lancang River hydropower base, upper Mekong. Energy, 179, 685696, https://doi.org/10.1016/j.energy.2019.05.059.

    • Search Google Scholar
    • Export Citation
  • Zhou, C., and K. Wang, 2016: Land surface temperature over global deserts: Means, variability, and trends. J. Geophys. Res. Atmos., 121, 14  344–14 357, https://doi.org/10.1002/2016JD025410.

    • Search Google Scholar
    • Export Citation
  • Zhou, C., K. Wang, and Q. Ma, 2017: Evaluation of eight current reanalyses in simulating land surface temperature from 1979 to 2003 in China. J. Climate, 30, 73797398, https://doi.org/10.1175/JCLI-D-16-0903.1.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 724 0 0
Full Text Views 2644 2040 82
PDF Downloads 880 278 12