Published September 28, 2020 | Version v0.3
Dataset Open

Continental Europe Digital Terrain Model at 30 m resolution based on GEDI, ICESat-2, AW3D, GLO-30, EUDEM, MERIT DEM and background layers

  • 1. OpenGeoHub foundation
  • 2. MultiOne

Description

Digital Terrain Model for Continental Europe based on the three publicly available Digital Surface Models and predicted using an Ensemble Machine Learning (EML). EML was trainined using GEDI level 2B points (Level 2A; "elev_lowestmode") and ICESat-2 (ATL08; "h_te_mean"): about 9 million points were overlaid vs MERITDEM, AW3D30, GLO-30, EU DEM, GLAD canopy height, tree cover and surface water cover maps, then an ensemble prediction model (mlr package in R) was fitted using random forest, Cubist and GLM, and used to predict most probable terrain height (bare earth). Input layers used to train the EML include:

Detailed processing steps can be found here. Read more about the processing steps here.

Training data set can be obtained in the file "gedi_elev.lowestmode_2019_eumap.RDS". The initial linear model fitted using the four independent Digital Surface / Digital Terrain models shows:

Residuals:
  Min       1Q   Median       3Q      Max 
-124.627   -1.097    0.973    2.544   59.324 
 
Coefficients:
  Estimate Std. Error t value Pr(>|t|)    
(Intercept)         -1.6220640  0.0032415  -500.4   <2e-16 ***
  eu_dem25m_          -0.1092988  0.0005531  -197.6   <2e-16 ***
  eu_AW3Dv2012_30m_    0.0933774  0.0005957   156.7   <2e-16 ***
  eu_GLO30_30m_        0.2637153  0.0006062   435.1   <2e-16 ***
  eu_MERITv1.0.1_30m_  0.7496494  0.0005009  1496.6   <2e-16 ***
  ---
  Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 7.059 on 9588230 degrees of freedom
(2046196 observations deleted due to missingness)
Multiple R-squared:  0.9996,	Adjusted R-squared:  0.9996 
F-statistic: 5.343e+09 on 4 and 9588230 DF,  p-value: < 2.2e-16

Which show that MERIT DEM (Yamazaki et al., 2019) is the most correlated DEM with GEDI and ICESat-2, most likely because it has been systematically post-processed and majority of canopy problems have been removed. Summary results of the model training (mlr::makeStackedLearner) using all covariates (including canopy height, tree cover, bare earth cover) shows:

Variable: elev_lowestmode.f 
R-square: 1 
Fitted values sd: 333 
RMSE: 6.54 

Ensemble model:
Call:
stats::lm(formula = f, data = d)

Residuals:
     Min       1Q   Median       3Q      Max 
-118.788   -0.871    0.569    1.956  165.119 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept) -0.198402   0.003045  -65.15   <2e-16 ***
regr.ranger  0.452543   0.001117  405.04   <2e-16 ***
regr.cubist  0.527011   0.001516  347.61   <2e-16 ***
regr.glm     0.020033   0.001217   16.47   <2e-16 ***
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 6.544 on 9588231 degrees of freedom
Multiple R-squared:  0.9996,	Adjusted R-squared:  0.9996 
F-statistic: 8.29e+09 on 3 and 9588231 DF,  p-value: < 2.2e-16

Which indicates that the elevation errors are in average (2/3rd of pixels) between +1-2 m. The variable importance based on Random Forest package ranger shows:

Variable importance:
               variable   importance
4   eu_MERITv1.0.1_30m_ 430641370770
1     eu_AW3Dv2012_30m_ 291483345389
2         eu_GLO30_30m_ 201517488587
3            eu_dem25m_ 132742500162
9 eu_canopy_height_30m_   5148617173
7             bare2010_   2087304901
8        treecover2000_   1761597272
6        treecover2010_    141670217

The output predicted terrain model includes the following two layers:

  • "dtm_elev.lowestmode_gedi.eml_mf": mean estimate of the terrain elevation in dm (decimeters) filtered using Gaussian filter and 2x pixel window in SAGA GIS,
  • "dtm_elev.lowestmode_gedi.eml_md": standard deviation of the independently fitted stacked predictors quantifying the prediction uncertainty in dm (decimeters),

The predicted elevations are based on the GEDI data hence the reference water surface (WGS84 ellipsoid) is about 43 m higher than the sea water surface for a specific EU country. Before modeling, we have corrected the reference elevations to the Earth Gravitational Model 2008 (EGM2008) by using the 5-arcdegree resolution correction surface (Pavlis et al, 2012).

All GeoTIFFs were prepared using Integer format (elevations rounded to 1 m) and have been converted to Cloud Optimized GeoTIFFs using GDAL.

Disclaimer: The output DTM still shows forest canopy (overestimation of the terrain elevation) and has not been hydrologically corrected for spurious sinks and similar. This data set is continuously updated. To report a bug or suggest an improvement, please visit here. To access DTM derivatives at 30-m, 100-m and 250-m please visit here. To register for updates please subscribe to: https://twitter.com/HarmonizerGeo.

Notes

This work has received funding from the European Union's the Innovation and Networks Executive Agency (INEA) under Grant Agreement Connecting Europe Facility (CEF) Telecom project 2018-EU-IA-0095 (https://ec.europa.eu/inea/en/connecting-europe-facility/cef-telecom/2018-eu-ia-0095).

Remote files

Additional details

See also

Created:
March 14, 2025
Modified:
March 14, 2025