TY - JOUR
T1 - A methodology to derive global maps of leaf traits using remote sensing and climate data
AU - Moreno-Martínez, Álvaro
AU - Camps-Valls, Gustau
AU - Kattge, Jens
AU - Robinson, Nathaniel
AU - Reichstein, Markus
AU - van Bodegom, Peter
AU - Kramer, Koen
AU - Cornelissen, J. Hans C.
AU - Reich, Peter
AU - Bahn, Michael
AU - Niinemets, Ülo
AU - Peñuelas, Josep
AU - Craine, Joseph M.
AU - Cerabolini, Bruno E.L.
AU - Minden, Vanessa
AU - Laughlin, Daniel C.
AU - Sack, Lawren
AU - Allred, Brady
AU - Baraloto, Christopher
AU - Byun, Chaeho
AU - Soudzilovskaia, Nadejda A.
AU - Running, Steve W.
N1 - Publisher Copyright:
© 2018 Elsevier Inc.
PY - 2018/12/1
Y1 - 2018/12/1
N2 - This paper introduces a modular processing chain to derive global high-resolution maps of leaf traits. In particular, we present global maps at 500 m resolution of specific leaf area, leaf dry matter content, leaf nitrogen and phosphorus content per dry mass, and leaf nitrogen/phosphorus ratio. The processing chain exploits machine learning techniques along with optical remote sensing data (MODIS/Landsat) and climate data for gap filling and up-scaling of in-situ measured leaf traits. The chain first uses random forests regression with surrogates to fill gaps in the database (> 45% of missing entries) and maximizes the global representativeness of the trait dataset. Plant species are then aggregated to Plant Functional Types (PFTs). Next, the spatial abundance of PFTs at MODIS resolution (500 m) is calculated using Landsat data (30 m). Based on these PFT abundances, representative trait values are calculated for MODIS pixels with nearby trait data. Finally, different regression algorithms are applied to globally predict trait estimates from these MODIS pixels using remote sensing and climate data. The methods were compared in terms of precision, robustness and efficiency. The best model (random forests regression) shows good precision (normalized RMSE≤ 20%) and goodness of fit (averaged Pearson's correlation R = 0.78) in any considered trait. Along with the estimated global maps of leaf traits, we provide associated uncertainty estimates derived from the regression models. The process chain is modular, and can easily accommodate new traits, data streams (traits databases and remote sensing data), and methods. The machine learning techniques applied allow attribution of information gain to data input and thus provide the opportunity to understand trait-environment relationships at the plant and ecosystem scales. The new data products – the gap-filled trait matrix, a global map of PFT abundance per MODIS gridcells and the high-resolution global leaf trait maps – are complementary to existing large-scale observations of the land surface and we therefore anticipate substantial contributions to advances in quantifying, understanding and prediction of the Earth system.
AB - This paper introduces a modular processing chain to derive global high-resolution maps of leaf traits. In particular, we present global maps at 500 m resolution of specific leaf area, leaf dry matter content, leaf nitrogen and phosphorus content per dry mass, and leaf nitrogen/phosphorus ratio. The processing chain exploits machine learning techniques along with optical remote sensing data (MODIS/Landsat) and climate data for gap filling and up-scaling of in-situ measured leaf traits. The chain first uses random forests regression with surrogates to fill gaps in the database (> 45% of missing entries) and maximizes the global representativeness of the trait dataset. Plant species are then aggregated to Plant Functional Types (PFTs). Next, the spatial abundance of PFTs at MODIS resolution (500 m) is calculated using Landsat data (30 m). Based on these PFT abundances, representative trait values are calculated for MODIS pixels with nearby trait data. Finally, different regression algorithms are applied to globally predict trait estimates from these MODIS pixels using remote sensing and climate data. The methods were compared in terms of precision, robustness and efficiency. The best model (random forests regression) shows good precision (normalized RMSE≤ 20%) and goodness of fit (averaged Pearson's correlation R = 0.78) in any considered trait. Along with the estimated global maps of leaf traits, we provide associated uncertainty estimates derived from the regression models. The process chain is modular, and can easily accommodate new traits, data streams (traits databases and remote sensing data), and methods. The machine learning techniques applied allow attribution of information gain to data input and thus provide the opportunity to understand trait-environment relationships at the plant and ecosystem scales. The new data products – the gap-filled trait matrix, a global map of PFT abundance per MODIS gridcells and the high-resolution global leaf trait maps – are complementary to existing large-scale observations of the land surface and we therefore anticipate substantial contributions to advances in quantifying, understanding and prediction of the Earth system.
KW - Climate
KW - Landsat
KW - MODIS
KW - Machine learning
KW - Plant ecology
KW - Plant traits
KW - Random forests
KW - Remote sensing
UR - http://www.scopus.com/inward/record.url?scp=85053850209&partnerID=8YFLogxK
U2 - 10.1016/j.rse.2018.09.006
DO - 10.1016/j.rse.2018.09.006
M3 - Article
AN - SCOPUS:85053850209
SN - 0034-4257
VL - 218
SP - 69
EP - 88
JO - Remote Sensing of Environment
JF - Remote Sensing of Environment
ER -