s2glc posterwolrcover2017 b1s2glc.cbk.waw.pl/sites/default/files/inline-images/iabg...^^ed/e > î...

1
SENTINEL 2 DATA SENTINEL 2 DATA SENTINEL 2 DATA S2GLC NOMENCLATURE Site examples - challenges SENTINEL 2 DATA SENTINEL 2 DATA SENTINEL 2 DATA Project Consortium (1-4) : 1 Space Research Center of Polish Academy of Sciences (CBK PAN), Earth Observation Group, Poland; 2 IABG (Industrieanlagen Betriebs- gesellschaft mbH), Germany; 3 Friedrich-Schiller-Universität, The Department of Earth Observation at the Institute of Geography, Germany; 4 EOXPLORE UG, Germany Project funded by (5) : 5 ESA/ESRIN, Italy (Project ID: ESA/ESRIN 40000116197/15/I-Sbo) A Sampling Approach for Advanced Multi-Temporal Classification of Sentinel-2 Data Elke Krätzschmar 2 , Peter Hofmann 2 , Stanisław Lewiński 1 , Artur Nowakowski 1 , Marcin Rybicki 1 , Ewa Kukawska 1 , Radosław Malinowski 1 , Michał Krupiński 1 , Conrad Bielski 3 , Denise Dejon 4 , Diego Fernández Prieto 5 Sentinel-2 Global Land Cover (S2GLC) is an ESA SEOM project with the aim to define a scientific roadmap and recommendations for creating a Global Land Cover (GLC) database based on Sentinel-2 data. The database will bring a new quality of information to the Remote Sensing user community by exploiting Sentinnel-2 capabilities. The project began in 2016 comprising four partners: CBK PAN, IABG, EOXPLORE UG and FSU. When fostering methods of satellite image analysis according to the increasing technological capabilities new satellite technology offers, these methods should systematically build on previously achieved efforts. Simultaneously, they should continue recent methods and take advantage of the new technology’s temporal, thematic and spatial capabilities. After an extensive review of currently available Global Landcover (GLC) databases, as an initial step of this project a preliminary hierarchical legend of land cover classes was defined, which intends to benefit from the high temporal, spatial and spectral resolution of Sentinel-2 in combination with Sentinel-1. Subject of this presentation is the preparation of suitable reference data used in the classification and validation process performed within this study and beyond. For this purpose, five test sites were selected globally, enabling us to react sensible towards regional variety. The test sites are of significant extent and heterogeneous in their environmental character resulting in heterogeneous land cover classes. They are located in: Italy, Germany, Namibia, Colombia and China. Quantity and quality of already existing classification results is very heterogeneous. Supplementary regional data sets (for example LUCAS, CORINE) with different accuracy than the envisaged data set but of higher accuracy than existing GLC data sets were considered as additional thematic information. Recent investigations took place on selected subsets. Overall goal is to identify and develop strategies to combine available GLC data and addition- nal thematic layers in order to reduce manual effort. The latter concen- trates on providing significant input for the set-up of an overall roadmap regarding a Global Sampling Database. Investigations consider available global classification and validation results in combination with the coarse analysis of up-to-date Sentinel2 data supporting information extraction suitable for classifications and validation issues. Main aim within the project is to support the multi-temporal classification process, which is subject of a different presentation. Abstract GLC datasets Summary and Outlook GLOBAL, (SUPER)NATIONAL THEMATIC DATABASES, (OF DIFFERENT TYPE) of S2GLC classes for initial classification, specification of mapping details SERVICE REQUIREMENTS, POTENTIAL OF SENTINEL DATA REVIEW AND DEFINITION USABILITY? & COMBINATION Sample ‘equal’ thematic information VALIDATION DATASET 1/3 VALIDATION REPORT(S) SENTINEL CLASSIFICATION TRAINING SAMPLE DATASET 2/3 THEMATIC , TEMPORAL & GEOMETRIC ENHANCEMENT SAMPLE DATASET contains expected classes, reduced according to logic spectral means Expensive taks, that requires suiable automation as much as possible; Status requirement: pixel-correct reduction; SENTINEL 2 DATA SENTINEL 2 DATA (1) Quantity (2) weighted Quantity (3) Quality (other aspects) Manual/ visual scan of ProtoType Sites looking at it entirety, valuing geometric and general geo-spatial aspects Within the review of global datasets, Global Validation Datasets (GLCV) were considered but rejected due to insufficient coverage regarding this project and date. Alternative a wide range of GLC were reviewed obser- ving thematic and geometric suita- bility. A combina- tion GlobCover 2005 and CCI-LC 2010 had been selected, accom- plished by Globe Land30 (except Europe). The shown result values more than GlobCover 2005 and CCI-LC 2010 (color: matching results, based on original nomenclature; black: discrepancies) 64% with comparable classification (not considering water bodies). How- ever, there are uncertainties regarding nomenclature issues. It can be sta- References: Tsendbazar N., Bruin, Fritz, Herold (2015): Spatial Accuracy Assessment and Integration of Global Land Cover Datasets. MDPI Remote Sensing Iss. Tsendbazar N., Bruin, Herold (2015): Assessing global land cover reference datasets for different user communities. ISPRS J. Photogramm. Remote Sens. 2015, 103, 93–114. Esch, T., Marconcini, Felbier, Roth, Heldens, Huber, Schwinger, Taubenböck, Müller, Dech, (2013): Urban Footprint Processor – Fully Automated Processing Chain Generating Settlement Masks from Global Data of the TanDEM-X Mission. IEEE Geoscience and Remote Sensing Letters, Vol. 10, No. 6, November 2013. Pp. 1617-1621. ISSN 1545-598X, DOI 10.1109/LGRS.2013.2272953. tet that GLC show local variation in accu- racy and thematic em- phasis [Tsendazar, 2015]. The Global Urban Footprint (GUF) [DLR 2016; Esch et.al., 2012/13] could com- pensate temporal lim- itations in the refe- rence data regarding artificial structures. Regional datasets Regional datasets are of severe interest, especially within a conceptual phase of a project: they are in general of higher spatial and thematic reso- lution. Test sites under investigation were chosen considering European CORINE, LUCAS, as well as a Colombian CORINE layer as additional ground information. Comparison of information layer illustrates the overall challenge Comparison of potential reference data sets; site: Italy challenge: matching class results represent matching spectral charac- teristics of a class according to mapping rules, which in GLC context is 0.9ha. As a consequence classes are often mapped as mixed whereas terrestrial LUCAS data identifies mixed rather than unique features. Follo- wing this finding LUCAS was reviewed & modified before further analysis. Overall Concept COLOMBIA GERMANY NAMIBIA t1 t2 trees vegetated (strong) soil trees veg(strong) soil veg(rare) evergreen trees managed land Aggregation … was chosen due to previous project activity in South America. On the other hand data availability is a challenging issue on most tropic and sub-topic regions (see table). Best data available is highly disturbed by clouds and scattered shadow. So far it was not possible to distinguish between the different vegetation types as well as to validate reference data (GLC/ available national). Acquisition dates are distributed adverse that low and elevated vegetation types can not be to distinguished appropriately. … is characterized by a high degree of artificially embossed land- scape. Data availability is sufficient, even though influenced by strong atmospheric effects. At present a feature aggregation model is set up on one fifth of the Germany site (referring to three S-2A Reduced GLC classification with high reliability towards Sentinel-2A Class behaviour per scene (selected tiles) scenes/ tile). Focus is to optimize and reduce manual impact towards semi-automatic procedures that can be transferred to larger extent. … shows very low agreement within the GLC comparison. This is also a result of the landcovers sensible behavior towards water, soil and (ex)posi- tion. The S-2A data pair shows how spectral features change after rainfall, hence characteristics of single vegetation classes cannot be described within a seasonal manner. The site of Namibia was investigated by virtual truth (google.earth) and it became clear, that for some vegetation classes (grassland, trees, shrub) within acquisition range no specific spec- tral behavior could be identified. Before rain event and after; comparable shrub land; 5km distance 20160402 20160810 On the other hand co-registration of the African S-2A data sets is exceeding general quality standards of 1 pixel and makes it difficult to model ob- jects on pixel level. Here, general app- roach may need to be reviewed. Generation of training and validation data is an important task and has significant impact on quality of the classification and its evaluation. The project presented here consists of two phases in which (1) selected sub- sites are reviewed in detail. Here experiences are made and strategy gets developed, which will then (2) be transferred onto the entire sites for which increasing the degree of automation will be focus of development. At present the classification process of our partner CBK is supervised, pixel-based and therefore highly dependent on availability of “pure” trai- ning data. However, within the process of increasing efficiency and reducing manual impact within the sampling task it needs to be identified which features and thematic classes do not require individual knowledge. With respect to significant characteristics of the Sentinel-2 data available as well as the anticipated workflow the project had to re-structure issues regarding spatial accuracy, data preprocessing and data handling. Training and Sample point selection so far is done on Sentinel-2A pixel level. A random sampling procedure is settled on top of this. This process needs to be investigated in more detail in order to specify suitable amount of training and sampling points as well as its spatial distribution. Overall goal will contribute to the set-up of a global sampling and validation data concept. Requirements in providing such database will increase significantly, aslo within the context of the growing number of free EO data available. THEMATIC TRANSFORMATION

Upload: others

Post on 24-Feb-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: S2GLC posterWolrCover2017 B1s2glc.cbk.waw.pl/sites/default/files/inline-images/IABG...^^Ed/E > î î d d ^ Ed/E > î d ^ î'> EKD E > dhZ ^ ] Æ u o r Z o o v P ^^ Ed/E

SENTINEL 2 DATASENTINEL 2 DATASENTINEL 2 DATA

S2GLC NOMENCLATURE

Site examples - challenges

SENTINEL 2 DATASENTINEL 2 DATASENTINEL 2 DATA

Project Consortium (1-4): 1 Space Research Center of Polish Academy of Sciences (CBK PAN), Earth Observation Group, Poland; 2 IABG (Industrieanlagen Betriebs-gesellschaft mbH), Germany; 3 Friedrich-Schiller-Universität, The Department of Earth Observation at the Institute of Geography, Germany; 4 EOXPLORE UG, GermanyProject funded by (5): 5 ESA/ESRIN, Italy (Project ID: ESA/ESRIN 40000116197/15/I-Sbo)

A Sampling Approach for Advanced Multi-Temporal Classification of Sentinel-2 DataElke Krätzschmar2, Peter Hofmann2, Stanisław Lewiński1, Artur Nowakowski1, Marcin Rybicki1, Ewa Kukawska1, Radosław Malinowski1, Michał Krupiński1, Conrad Bielski3,

Denise Dejon4, Diego Fernández Prieto5

Sentinel-2 Global Land Cover (S2GLC) is an ESA SEOM project with the aimto define a scientific roadmap and recommendations for creating a GlobalLand Cover (GLC) database based on Sentinel-2 data. The database willbring a new quality of information to the Remote Sensing user communityby exploiting Sentinnel-2 capabilities. The project began in 2016 comprisingfour partners: CBK PAN, IABG, EOXPLORE UG and FSU.When fostering methods of satellite image analysis according to the

increasing technological capabilities new satellite technology offers, thesemethods should systematically build on previously achieved efforts.Simultaneously, they should continue recent methods and take advantageof the new technology’s temporal, thematic and spatial capabilities.

After an extensive review of currently available Global Landcover (GLC)

databases, as an initial step of this project a preliminary hierarchical legendof land cover classes was defined, which intends to benefit from the hightemporal, spatial and spectral resolution of Sentinel-2 in combination withSentinel-1.Subject of this presentation is the preparation of suitable reference data

used in the classification and validation process performed within thisstudy and beyond. For this purpose, five test sites were selected globally,enabling us to react sensible towards regional variety. The test sites are ofsignificant extent and heterogeneous in their environmental characterresulting in heterogeneous land cover classes. They are located in: Italy,Germany, Namibia, Colombia and China.Quantity and quality of already existing classification results is very

heterogeneous. Supplementary regional data sets (for example LUCAS,CORINE) with different accuracy than the envisaged data set but of higher

accuracy than existing GLC data sets were considered as additionalthematic information.Recent investigations took place on selected subsets. Overall goal is to

identify and develop strategies to combine available GLC data and addition-nal thematic layers in order to reduce manual effort. The latter concen-trates on providing significant input for the set-up of an overall roadmapregarding a Global Sampling Database.Investigations consider available global classification and validation results

in combination with the coarse analysis of up-to-date Sentinel2 datasupporting information extraction suitable for classifications and validationissues. Main aim within the project is to support the multi-temporalclassification process, which is subject of a different presentation.

Abstract

GLC datasets

Summary and Outlook

GLOBAL, (SUPER)NATIONALTHEMATIC DATABASES,(OF DIFFERENT TYPE)of S2GLC classes for initial classification,specification of mapping detailsSERVICE REQUIREMENTS, POTENTIAL OF SENTINEL DATA

REVIEW AND DEFINITION

USABILITY? & COMBINATION

Sample

‘equal’ thematic information

VALIDATION DATASET1/3

VALIDATION REPORT(S)SENTINELCLASSIFICATION

TRAINING SAMPLE DATASET 2/3

THEMATIC , TEMPORAL & GEOMETRIC ENHANCEMENT

SAMPLE DATASETcontains expected classes, reduced according to logic spectral meansExpensive taks, that requires suiable automation as much as possible;Status requirement: pixel-correct reduction;

SENTINEL 2 DATASENTINEL 2 DATA

(1) Quantity(2) weighted Quantity(3) Quality (other aspects)Manual/ visual scan ofProtoType Sites looking at it entirety, valuinggeometric and general geo-spatial aspects

Within the review of global datasets, Global Validation Datasets (GLCV)were considered but rejected due to insufficient coverage regarding thisproject and date. Alternative a wide range of GLC were reviewed obser-ving thematic andgeometric suita-bility. A combina-tion GlobCover2005 and CCI-LC2010 had beenselected, accom-plished by GlobeLand30 (exceptEurope).The shown resultvalues more than

GlobCover 2005 and CCI-LC 2010 (color: matching results, based on original nomenclature; black: discrepancies) 64% with comparable classification (not considering water bodies). How-ever, there are uncertainties regarding nomenclature issues. It can be sta-

References:Tsendbazar N., Bruin, Fritz, Herold (2015): Spatial Accuracy Assessment

and Integration of Global Land Cover Datasets. MDPI Remote Sensing Iss.Tsendbazar N., Bruin, Herold (2015): Assessing global land cover

reference datasets for different user communities. ISPRS J. Photogramm.Remote Sens. 2015, 103, 93–114.Esch, T., Marconcini, Felbier, Roth, Heldens, Huber, Schwinger,

Taubenböck, Müller, Dech, (2013): Urban Footprint Processor – FullyAutomated Processing Chain Generating Settlement Masks from GlobalData of the TanDEM-X Mission. IEEE Geoscience and Remote SensingLetters, Vol. 10, No. 6, November 2013. Pp. 1617-1621. ISSN 1545-598X,DOI 10.1109/LGRS.2013.2272953.

tet that GLC showlocal variation in accu-racy and thematic em-phasis [Tsendazar,2015].The Global Urban

Footprint (GUF) [DLR2016; Esch et.al.,2012/13] could com-pensate temporal lim-itations in the refe-rence data regardingartificial structures.

Regional datasetsRegional datasets are of severe interest, especially within a conceptual

phase of a project: they are in general of higher spatial and thematic reso-lution. Test sites under investigation were chosen considering EuropeanCORINE, LUCAS, as well as a Colombian CORINE layer as additional groundinformation. Comparison of information layer illustrates the overallchallenge

Comparison of potential reference data sets; site: Italychallenge: matching class results represent matching spectral charac-teristics of a class according to mapping rules, which in GLC context is0.9ha. As a consequence classes are often mapped as mixed whereasterrestrial LUCAS data identifies mixed rather than unique features. Follo-wing this finding LUCAS was reviewed & modified before further analysis.

Overall Concept

COLOMBIA GERMANY NAMIBIA

t1 t2

trees vegetated (strong) soil trees veg(strong) soilveg(rare) evergreen trees managed land

Aggregation

… was chosen due to previous projectactivity in South America. On the otherhand data availability is a challengingissue on most tropic and sub-topicregions (see table).Best data available is highly disturbed

by clouds and scattered shadow. So farit was not possible to distinguishbetween the different vegetation types

as well as to validate reference data (GLC/ available national). Acquisitiondates are distributed adverse that low and elevated vegetation types cannot be to distinguished appropriately.

… is characterized by a high degree of artificially embossed land-scape. Data availability is sufficient, even though influenced bystrong atmospheric effects. At present a feature aggregation modelis set up on one fifth of the Germany site (referring to three S-2A

Reduced GLC classification with high reliability towards Sentinel-2A

Class behaviourper scene(selected tiles)

scenes/ tile). Focus is to optimize and reduce manual impact towardssemi-automatic procedures that can be transferred to larger extent.

… shows very low agreement within the GLC comparison. This is also aresult of the landcovers sensible behavior towards water, soil and (ex)posi-tion. The S-2A data pair shows how spectral features change after rainfall,hence characteristics of single vegetation classes cannot be describedwithin a seasonal manner. The site of Namibia was investigated by virtualtruth (google.earth) and it became clear, that for some vegetation classes

(grassland, trees, shrub) within acquisition range no specific spec-tral behavior could be identified.

Before rain event and after; comparable shrub land; 5km distance20160402 20160810

On the other handco-registration of theAfrican S-2A data setsis exceeding generalquality standards of1 pixel and makes itdifficult to model ob-jects on pixel level.Here, general app-roach may need to bereviewed.

Generation of training and validation data is an important task and hassignificant impact on quality of the classification and its evaluation. Theproject presented here consists of two phases in which (1) selected sub-sites are reviewed in detail. Here experiences are made and strategy getsdeveloped, which will then (2) be transferred onto the entire sites for whichincreasing the degree of automation will be focus of development.At present the classification process of our partner CBK is supervised,

pixel-based and therefore highly dependent on availability of “pure” trai-ning data. However, within the process of increasing efficiency and reducingmanual impact within the sampling task it needs to be identified which

features and thematic classes do not require individual knowledge.With respect to significant characteristics of the Sentinel-2 data available

as well as the anticipated workflow the project had to re-structure issuesregarding spatial accuracy, data preprocessing and data handling.Training and Sample point selection so far is done on Sentinel-2A pixel

level. A random sampling procedure is settled on top of this. This processneeds to be investigated in more detail in order to specify suitable amountof training and sampling points as well as its spatial distribution.Overall goal will contribute to the set-up of a global sampling and

validation data concept. Requirements in providing such database willincrease significantly, aslo within the context of the growing number offree EO data available.

THEMATIC TRANSFORMATION