31
Amazon Biome ATBD_R Algorithm Theoretical Basis Document & Results Instituto do Homem and Meio Ambiente da Amazônia - Imazon Centro de Geotecnologia do Imazon - CGI February, 2017 1

Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

Embed Size (px)

Citation preview

Page 1: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

 

Amazon Biome

ATBD_R Algorithm Theoretical Basis Document & Results

Instituto do Homem and Meio Ambiente da Amazônia - Imazon

Centro de Geotecnologia do Imazon - CGI  

    

 February, 2017

1

Page 2: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

 

Executive Summary This ATBD_R is a dynamic and evolving document. Each time we launch a new collection, this

document will be revised accordingly.

By reading the document one should be able to understand how the collection of maps was

produced, what datasets and methodological approaches were utilized and the rationale and

theory behind these approaches.

2

Page 3: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Content 1. Introduction 7

1.1 Study Area 7

1.2. Key Science and Applications 8

2. Overview and Background Information 9

2.1. Context and Key Information 9

2.2. Existent Maps and Mapping Initiatives 10

3. Algorithm Descriptions, Assumptions, and Approaches 12

3.1 ImgTools Algorithms 12

3.2 Collection 1: MapBiomas Amazonia 13

3.2.1 Algorithm 13

3.2.2. Data availability and screening 14

3.2.3. Classification scheme and parameters 17

3.2.4. Data volumes and processing issues 23

3.3 Collection 2 - MapBiomas Amazonia 23

3.3.1. Algorithm 23

3.3.2. Data availability and screening 26

3.3.3. Classification scheme and parameters 27

3.3.4. Data volumes and processing issues 28

4. Validation Strategies 28

5. Map Collections and Analysis 31

6. Practical Considerations 33

7. Concluding Remarks and Perspectves 33

8. References 34

9. Appendices 35

9.1. Supplementary Material 35

9.2. Algorithm Code 35

 

List of Figures

3

Page 4: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

 Figure 1. Location of the Amazon biome in Brazil. 6 Figure 2. Forcing factors are indicated with red ovals; processes addressed in this Review are indicated by green boxes and arrows; and consequences for human society are indicated by blue boxes with rounded corners. 7 Figure 3. Methodology applied in processing Landsat images and implemented in the ImgTools software. 12 Figure 4. Stages for processing of Landsat images in order to generate Collection 1 of the MapBiomas Amazon project. 13 Figure 5. Distribution of mapping charts for MapBiomas Amazonia. 14 Figure 6. Coverage of Landsat scenes to make up the temporal mosaic of the charts processed for the Amazon biome for the period of 2008 to 2015, with up to 30% of cloud cover. 15 Figure 7. Spectral curves of the pure components utilized in the MME applied to Brazilian biomes; (a) for Landsat 5 and 7 in surface reflectance, (b) for Landsat 8 for reflectance at the top of the atmosphere. 16 Figure 8. Empirical decision tree applied to the NDFI, GV and GVs to map the classes of Forest and Non-Forest. Masks for Water and Cloud are applied to make up the final map. The variables (V1, V2, V3, V4) are adjusted for each chart. 18 Figure 9. Frequency distribution of the values assumed by variables v1, v2, v3 and v4. 18 Figure 10. Empirical decision tree applied to Collection 2 for the amazon biome. 22 Figure 11.Coverage of Landsat scenes to make up the temporal mosaic of the charts processed for the Amazon biome for the period of 2000 to 2016, with up to 30% of cloud cover. 26 Figure 12. Spatial distribution of the sampling points for analyzing the accuracy of Collections 1 and 2 of MapBiomas Amazonia. 28 Figure 13. Sampling scheme and interpretation grade of Landsat pixels utilized in visual interpretation for analysis of accuracy. 29 Figure 14. Preliminary result of the accuracy analysis of the Amazon biome for Collection 1. Accuracy by year (Level 3) 30 Figure 15. Data classification panel for Collection 2 MapBiomas Amazonia (A), and a more detailed view of the classification map for 2008 (B). 31

4

Page 5: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

List of Tables

 Table 1. Reference maps utilized in the MapBiomas Amazon project. 10 Table 2. Number of images utilized per year, sensor and maximum percentage of cloud cover accepted for Collection 1. 14 Table 3. Rules for reclassification applied by the temporal filter. 19 Table 4. Temporal filter rules applied to Collection 2 land cover classes. 23 Table 5. Number of images utilized per year, sensor and maximum percentage of cloud cover accepted for Collection 1. 25 Table 6. Classification scheme of Collection 2 for the Amazon Biome. 26

5

Page 6: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

1. Introduction

1.1 Study Area

The Amazon Biome has an extension of approximately 4,196,943 km², representing 49.29% of Brazilian territory. It fully occupies the States of Acre, Amapá, Amazonas, Pará, and Roraima, and sizable portions of Rondônia (99%), Mato Grosso (54%), Maranhão (34%) and Tocantins (9%), according to the IBGE (2004). The largest biodiversity reserve in the world, it is mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species.

The region is marked by an occupation process many times associated with social conflicts, deforestation and degradation of natural resources. Data from the National Institute of Space Research (INPE) reveal that, by 2015, almost 20% of Amazon forests had already been

deforested (PRODES, 2016). Imazon indicates that more than 300 thousand kilometers of clandestine roads have already been opened in the Amazon biome, providing access to illegal logging and mining and the degradation of natural resources. Despite those problems, the Amazon biome still retains 80% of its forest cover and vast expanses of protected areas (PRODES, 2016).

Figure 1. Location of the Amazon biome in Brazil.

6

Page 7: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

1.2. Key Science and Applications

The Amazon biome is one of the most intensely studied biomes in Brazil. Several multidisciplinary projects have been conducted in this region, notably The Large-Scale Biosphere-Atmosphere Experiment in Amazonia (LBA) conducted from 1995-2005 and led by Brazil. This region has also been monitored by INPE since 1988 through the PRODES project. Independent forest monitoring has also being conducted by national and international organizations (e.g., Imazon, University of Maryland GLAD systems, Terra-i and Global Forest Watch). None of these projects focus on annual historical (1985 through the present) mapping of land cover and land use classes with Landsat imagery. MapBiomas Amazonia has the ambitious proposal of doing that. There are several applications that can be developed for the Amazon region with the MapBiomas maps, including: Mapping and quantifying land cover and land use transitions. Quantification of gross forest losses and gains. Monitoring of secondary growth forests. Monitoring of water resources and their interaction with land cover classes. Expansion of infrastructure and urbanization. Regional planning. Management of Protected Areas. Estimation of greenhouse gas emissions.

Scientific applications can also benefit from the MapBiomas results for the Amazon biome. Interactions between global climate, land use, fire, hydrology, ecology and human dimensions, already reported as critical for the future of the Amazon region (Figure 2) (Davidson et al., 2012), can be better understood with long time-series of land cover and land use maps, as provided by MapBiomas Amazonia.

1

7

Page 8: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Figure 2. Forcing factors are indicated with red ovals; processes addressed in this Review are indicated by green boxes and arrows; and consequences for human society are indicated by blue boxes with rounded corners.

2. Overview and Background Information

In March 2015, at a technical seminar on use of remote sensing and vegetation mapping data, the Project for Annual Mapping of Land Cover and Use in Brazil (MapBiomas) began its research activities. In that seminar, held at the Getúlio Vargas Foundation (FGV) in São Paulo, specialists in the areas of remote sensing of the various Brazilian biomes and on mapping of land use gathered to discuss the possibility of producing annual maps of land cover and use for the Brazilian territory in a rapid and low cost form. The purpose of the Project is to create updated maps and recover information on land use and cover over the last thirty years with Landsat images. Additionally, MapBiomas utilizes computer technology on clouds to process the Landsat images and web platforms for accessing and disseminating data.

Imazon has been present since the beginning of the MapBiomas project and contributed to the technical and scientific development of the project. The algorithms for processing Landsat images for application of a linear pixel mixture model (MLMP) and for calculation of the NDFI (Normalized Difference Fraction Index) (Souza et al., 2003) are examples of contributions by Imazon. Additionally, the Institute has contributed through research, development of the ImgTools software (Souza Jr. & Siqueira, 2013) and mapping of deforestation and forest degradation in the Amazon biome, publication of several papers on this issue and capacitation of several professionals working in this area. That knowledge learned and accumulated by the Institute has been shared with researchers interested in mapping pressures on tropical forests around the world. MapBiomas received that collection of knowledge, the algorithms implemented in ImgTools, and adapted a large part of the methodologies and algorithms for processing the satellite data utilized (and in some cases developed) by Imazon in its research carried out over the last decade. Imazon’s experience in processing of large-scale satellite images contributed to MapBiomas obtaining the first version of its annual forest maps for all Brazilian biomes in only six months.

The methodology utilized by Imazon for processing satellite images and generating data on deforestation and forest degradation gave rise to the ImgTools software (Souza Jr. & Siqueira, 2013). This software is a package of tools and algorithms for processing and classifying satellite images of the Landsat series (5, 7 and 8) and served as the basis for developing processing codes for the MLMP, NDFI, Empirical Decision Tree Builder, and spatial and temporal filters, currently implemented on the MapBiomas Workspace platform. That platform is a web application that uses the Earth Engine (EE) computer cloud to carry out the processing of remote sensing data collections in a parallel and distributed manner. EE was developed by Google in order to in a few seconds process images that a single computer would take hours or even days to do. The combination of the algorithms base from ImgTools and the EE processing power was one of the points that made possible production of land cover and use data for the Amazon biome in this project.

2.1. Context and Key Information

8

Page 9: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Land use in the Amazon biome is quite heterogenous and complex throughout its 4.1 million square kilometers (MMA, 2013), a reflex of the historical dynamics of occupation of that territory, and understanding that process is essential in studies on biodiversity, climate change and environmental modeling, as well as for conceiving and following up policies directed towards land use (Almeida et al, 2016).

Up to 2014 land cover of altered areas in the Amazon was made up predominantly of pastures, with approximately 479 thousand km² or 63% of the biome, and in smaller proportions by secondary vegetation (22.8%) and areas with annual agriculture (6%) (INPE, 2016). This reflects the current and historical dynamics of occupation of the Amazonian space, which, without planning, generate negative socioeconomic and environmental impacts to this day, such as illegal deforestation and land-grabbing on public lands, all of which ends up impacting ecological services, biodiversity and water and carbon cycles (Fearnside, 2006).

2.2. Existent Maps and Mapping Initiatives

The growing concern with the consequences of climate changes has generated interest in questions related to monitoring of tropical forests and to actions for fighting deforestation in the Amazon. The first initiatives were developed for the purpose of mapping and quantifying the occurrence of deforestation in the region in order to advise the Brazilian government in devising public policies. The National Institute for Space Research (INPE) was a pioneer in monitoring deforestation of primary forests through clear-cutting, with a minimum area mapped of 6.25 ha, beginning in 1988 through the PRODES project, which utilizes Landsat satellite images to generate the official annual deforestation rates for the biome.

The dynamics of deforestation in the region have presented changes over time and the significant increase in rates at the beginning of the years 2000 pointed to the need for obtaining information on deforestation in a more expedited manner in order to enable identification and punishment of the offenders, not allowing the expansion of deforested areas to increase until the closing of the deforestation calendar. In that context, the deforestation alerts systems come into play, with the objective of making available to enforcement agencies the location of occurrences of clearcutting and forest degradation, as well as informing society with respect to the situation of forests in the Amazon.

In 2004 the System for Detecting Deforestation in Real Time (DETER) began operating and generating monthly deforestation alerts. DETER employs daily images from the MODIS sensor of the Terra satellite and is able to detect deforestation events in areas as low as 25 ha. In 2006 Imazon’s Deforestation Alert System (SAD) began its activities with its independent monitoring also using images from the MODIS sensor, but with a distinct method for processing data, which makes it possible to detect alerts of both clearcutting and forest degradation, due to timber harvesting or to forest fires, on areas greater than 10 ha.

Making available information on deforestation in a rapid and precise fashion is linked to speed in processing the data. SAD has carried out a partnership with Google so that the stages of processing and validating the alerts are performed on the Google Earth Engine platform, which makes it possible to access, organize and visualize geospatial data in the cloud. Thus, beginning in 2012, processing of MODIS images, which was previously done on Imazon computers and Imazon, began to be carried out in the Google computer clouds, making it possible to have communication of deforestation alerts to enforcement agencies and society in a shorter period.

9

Page 10: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

The growing access to timely and free of charge geospatial data via computers (clouds) has favored the development of applications on larger scales and with more precise sensors. That is the case, for example, with the Global Land Analysis and Discovery (GLAD), a system that generates deforestation alerts in tropical forests of Brazil, Peru, Republic of the Congo and Indonesia through Landsat satellite images.

The main initiatives for mapping and monitoring of the Amazon biome are presented below in Table 1. These data serve as a reference for mapping of the Amazon biome through the MapBiomas project.  

Table 1. Reference maps utilized in the MapBiomas Amazon project.

Systems for Monitoring and Mapping Acronym Start of

Activities Objective Website

Projeto de Monitoramento do desmatamento na Amazônia Legal por Satélite

PRODES 1988 Generate annual rates of deforestation

http://www.obt.inpe.br/prodes/index.php

Detecção de Desmatamento em Tempo Real

DETER 2004 Monthly deforestation alerts

http://www.obt.inpe.br/deter/

Sistema de Alerta de Desmatamento do Imazon

SAD 2006 Monthly deforestation alerts and forest degradation

http://www.imazongeo.org.br/imazongeo.php

Global Land Analysis and Discovery GLAD 2016 Monthly deforestation alerts

http://glad.umd.edu/

Levantamento de informações do uso e cobertura da terra na Amazônia

TerraClass 2004 Map land use and cover

http://www.inpe.br/cra/projetos_pesquisas/dados_terraclass.php

Mapeamento da Degradação Florestal na Amazônia Brasileira

DEGRAD 2007 Annual mapping of forest degradation

http://www.obt.inpe.br/degrad/

Red Amazónica de Información Socioambiental Georreferenciada

RAISG 2010 Map deforestation and pressure factors on the forest

https://raisg.socioambiental.org/

Global Forest Watching GFW 1997 Map loss and gain of forests

http://www.globalforestwatch.org/

 

3. Algorithm Descriptions, Assumptions, and Approaches

The algorithms, mathematical models for processing of Landsat images, assumptions and a methodology applied in the Amazon MapBiomas project are presented in the sections below. First we present the algorithms for ImgTools, developed by Imazon and which serve as the basis for processing the Landsat images in order to generate Collections 1 and 2 of MapBiomas

10

Page 11: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Amazonia. The adjustments and improvements applied in those collections are presented in the correspondent sections of those collections.

3.1 ImgTools Algorithms

Collection 1 of the MapBiomas Amazon project, which covered the period of 2008 to 2015, was generated based on methods developed by Imazon (Souza et al., 2005; 2013, Souza and Siqueira 2013; Souza 2012). The processing follows the approach originally implemented in the ImgTools software, which was re-codified in the MapBiomas Workspace platform for processing in Earth Engine. We initially presented the ImgTools approach and the adaptations and improvements developed in MapBiomas Amazonia.

ImgTools was developed to integrate fundamental image processing steps in order to accomplish classification of very large Landsat data sets across space and time. These steps include: (1) Pre-processing; (2) Construction of a spectral library; (3) Spectral Mixture Analysis; (4) Building Empirical Decision Tree; (5) Post-classification. All these steps are explained in detail below (Figure 2). Step 1 – Pre-Processing. The first step in the methodology consisted of producing annual data sets of Landsat images for mapping in a per scene basis. The Landsat data used had already been orthorectified and radiometrically corrected (atmospheric correction and removal of smoke noise using the Carlotto (1999) method (Souza Jr. and Siqueira, 2013). Step 2 – Construction of a spectral library. This consisted of identifying pure spectral components (endmembers) in order to estimate their abundance in each pixel. The endmembers of interest for forest monitoring are: green vegetation, non-photosynthetic vegetation (NPV), soil, cloud and shadow. The process for identifying and creating the spectral library involves identifying the spectral curves that have the potential to be endmembers. Next, a visual inspection of those curves is made for the final selection of the set of those pure components. Detailed information about these procedures are described in Souza et al., (2005). Step 3 – Spectral Mixture Analysis (SMA). The reflectance images processed in Step 1 are combined with the endmembers obtained in Step 2 to estimate the abundance of those pure components in each pixel of the image. To do that, we applied the SMA technique, using the spectral library of those pure components. The fraction images resulting from the SMA were used to calculate the NDFI (Normalized Differencing Fraction Image) (Souza Jr. et al., 2005), and combined in a classifier through a generic empirical decision tree in the next stage. Step 4 – Empirical Decision Tree Classification. We built the decision tree empirically in order to classify the deforested areas, degraded forests, water bodies, shadow and cloud (Figure 3). As an entry for classification we used the fraction images obtained with the MME and the NDFI spectral index NDFI.

11

Page 12: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Figure 3. Methodology applied in processing Landsat images and implemented in the ImgTools software.

  Step 5 – Post-Classification. Spatial and temporal filters were constructed in order to remove spurious classifications and non-allowed temporal transitions between mapping classes, respectively. The spatial power makes it possible for given classes to be defined together with their least mappable area. That way, isolated pixels classified in a given image can be eliminated and substituted by the dominant class in their neighborhood. The temporal filter is a set of rules for non-allowed transitions that are applied to each image classified in a given year. That way, it is possible to remove clouds and correct non-allowed transitions.

3.2 Collection 1: MapBiomas Amazonia

3.2.1 Algorithm Collection 1 of the MapBiomas Amazon project was generated based on collections of

Landsat images (5, 7 and 8) with standard for correction L1T, available at the Earth Engine 1

platform. Processing of Landsat images was conducted on the Google Earth platform following the stages described in Figure 1. Those stages were implemented in the MapBiomas Workspace Platform (Figure 4), following the processing steps and the algorithms of ImgTools for calculating the SMA, NDFI, empirical decision tree and spatial and temporal filters. The principal difference in applying those algorithms in Collection 1 do MapBiomas is in the mapping area that went from scene to mosaic (see Stage 1 below for more details). [English translation of table below on page 52]

1 Landsat image collection IDs for: i) Landsat 5: ; ii) Landsat 7: …., and Landsat 8.

12

Page 13: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Figure 4. Stages for processing of Landsat images in order to generate Collection 1 of the MapBiomas Amazon

project.

3.2.2. Data availability and screening

The mapping unit for MapBiomas is the International Map of the World to the Millionth on the scale of 1:250.000, covering an area of 1°30’ of longitude by 1° of lattitude (Figure 5). A total of 294 charts cover the Amazon biome, and 2 to 6 Landsat scenes are necessary to cover the area of each chart, with the great majority having 4 scenes. Table 2 shows the number of images utilized in each year analyzed for the Amazon biome. For each chart, the following parameters of the Landsat images are defined:

i) the year of mapping (from 2008 to 2015 for that phase of the project). ii) the period during the year for selecting images (t0 and t1 on day/month/year) que that enables the maximum spectral contrast between the forest and non-forest classes. In the case of the Amazon biome, we concentrated mainly (>90%) on the period of June 1 to October 31 for selecting the Landsat images.

iii) the maximum cloud cover in the Landsat images that will be utilized in processing, which was 30% for the majority of charts, although in areas with less frequent cloud cover we utilized up to 70% of cloud cover.

iv) the type of sensor (Landsat 5 [L5], Landsat [L7], and Landsat 8 [L8]). 

13

Page 14: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

 Figure 5. Distribution of mapping charts for MapBiomas Amazonia.

Those parameters are obtained from the metadata file of the collections of Landsat

images available on the Earth Engine platform. We utilized collections of ortho-rectified surface reflectance images (L1T) for L5 and L7, with reflectance from the top of the atmosphere for L8.  Table 2. Number of images utilized per year, sensor and maximum percentage of cloud cover accepted for Collection 1.

Year Number of

Landsat images Amount size

(GByte) Biome data cover

(%)

2008 1815 726.0 97

2009 2004 801.6 97

2010 1883 3,766.0 95

2011 1644 657.6 95

2012 1883 3,766.0 95

2013 1870 3,740.0 93

2014 1976 3,952.0 95

2015 2140 4,280.0 97

Total 15215 17,408.0 -

The application of the parameters described above (period and cloud cover) made it

possible to obtain a set of scenes from Landsat images for a given year, in a defined period,

14

Page 15: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

covering the area of the chart selected for processing (Figure 6). That set of Landsat scenes, shown in Figure 6, is processed with the objective of

generating a mosaic [temporal] with the lowest cloud cover for each chart, optimizing the period of greatest spectral contrast of the land cover classes that will be mapped. One may observe that the central-southern portion of the arc of deforestation has the largest number of Landsat scenes with more than 6 scenes for the period of June 1 to October 31. In the northern portion of the Amazon biome, we can observe gaps with scenes having up to 30% of cloud cover. For those areas, we had to expand the cloud percentage (up to 70%) in order to obtain the temporal mosaic.

Figure 6. Coverage of Landsat scenes to make up the temporal mosaic of the charts processed for the Amazon biome for the period of 2008 to 2015, with up to 30% of cloud cover.

The Landsat scenes selected in order to generate the mosaics for the charts were then processed to generate the temporal mosaic. Initially, a buffer zone is applied along the scene of the Landsat image to remove possible edge noises. Landsat scenes that do not have data on all bands are considered to be noise. The next step is to apply an algorithm for detecting and removing clouds, based on spectral mixture modeling (MEM; see Stage 3 for details regarding that method), and images of the thermal band of the electromagnetic spectrum (Band 6 for L5 and L7, and Band 10 for L8; all converted to the Celsius temperature scale). Pixels with a cloud fraction greater than or equal to 10% and temperature lower than or equal to 22 degrees Celsius are detected initially as clouds.

A surrounding region of 10 pixels (300 meters, considering a Landsat pixel Landsat as

15

Page 16: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

30m) is generated along the areas initially detected as having having clouds, utilizing a circular matrix filter (kernel). That process allows detection of areas with shadow generated by the clouds originally detected. In those areas the cloud fraction percentage in the pixel diminishes, and because of that, all the pixels surrounding the clouds, with values greater than or equal to 7% of the cloud fraction are added to the areas initially classified as clouds. That process generates the areas with clouds for all of the Landsat scenes that make up the area of the chart, for the year and period selected. The areas classified as Clouds are ignored in the subsequent processing through generation of a mask applied to those regions. Finally, a median filter is applied to the pixels not covered by clouds in the images, to compose the final image of the chart, known as the temporal mosaic.

3.2.3. Classification scheme and parameters The inputs for classification were generated using the MEM technique, which makes it

possible to estimate composition of the pixel in terms of pure components (spectral signatures). The MEM applied in MapBiomas utilizes a spectral library defined by five pure components: Vegetation, Non-Photosynthetically Active Vegetation, Soils and Clouds (Figure 7). That spectral library is the same one utilized in the ImgTools software, defined by Souza et al. (2005).

(a) (b)

 Figure 7. Spectral curves of the pure components utilized in the MME applied to Brazilian biomes; (a) for Landsat 5 and 7 in surface reflectance, (b) for Landsat 8 for reflectance at the top of the atmosphere.

 The MME assumes that the reflectance of each pixel, in each band of the Landsat image,

results from the linear combination of the product of the percentage of pure component (i.e., proportion in the pixel) by its respective value in the band, based on the equation:

Rb = Fi Ri,b + εb (1)

for

ΣFi = 1 (2)

16

Page 17: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

where Rb is a reflectance in band b, Ri,b is a reflectance of the pure component i, in band b, Fi the fraction (or proportion) of the pure component I, whose sum is 1 (or 100%), and εb is the residual error of each band.

The MME is resolved to estimate the Fi values (i.e., the fraction or proportion of the pure component), thus obtaining Fraction images for Vegetation, Non-Photosynthetically Active Vegetation, Soil and Cloud, which were utilized for calculating the NDFI spectral difference, obtained by the normalized different of the Vegetation fraction image and the sum of the Non-Photosynthetically Active Vegetation and Soil fraction images, according to the equation below:

Where,

and,

The Fraction and NDFI images are utilized as entry parameters for classifying ground cover

(see Stage 4 below, and legend) and to generate masks for clouds in order to generate the temporal mosaic (Stage 2).

The Fraction images obtained with the MME are utilized to generate the ground cover map for classes of Level 1 of the Classification Legend, including: Forest, Non-Forest, Water and Cloud. Classification is conducted according to three steps (Figure 1). First, we apply the cloud mask described above, to map areas covered by clouds that could not be filtered during the process of generating the temporal mosaic for the chart. Next, we apply the mask for water based on the following criteria: pixels with Shadow Fraction greater than 75%, and Vegetation less than 10% and Soil less than 5%, and that are not in areas of shaded relief (we detected shaded relief using the Hill Shadow model and illumination data from each scene available in their metadata). Finally, we apply a decision tree (empirically defined) to map the forest and non-forest areas. The cloud and water masks are combined with the map resulting from the decision tree to generate the final map (Figure 8). The statistical distribution of the parameters is presented below in Figure 9.  

17

Page 18: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

  Figure 8. Empirical decision tree applied to the NDFI, GV and GVs to map the classes of Forest and Non-Forest. Masks for Water and Cloud are applied to make up the final map. The variables (V1, V2, V3, V4) are adjusted for each chart.

 

 Figure 9. Frequency distribution of the values assumed by variables v1, v2, v3 and v4.

Application of the decision tree generates the land cover map with the Forest, Non-Forest, Water and Cloud classes. The maps generated in the Classification and Land Cover Stage are subjected to processes of spatial and temporal filtering. The spatial filter segments and indexes the Forests, Non-Forest and Water classes in contiguous regions that are later on identified and reclassified based on the following criterion: areas smaller than or equal to half a hectare (i.e., approximately 5 pixels) are reclassified based on the majority of the neighboring classes. Thus, for example, a segment of the Non-Forest class with up to 5 pixels is identified, the neighboring pixels are identified, and the segment is reclassified with the predominant value of the neighboring class of pixels (Figure 7). That process is applied for all of the segments of the classes selected for filtering (i.e. Forests, Non-Forest and Water).

The next step is to utilize temporal information to identify transitions of classes that are not allowed between consecutive years. For example, if a pixel is classified as Forest in a given

18

Page 19: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

year ti (with i = 2008, 2009, ..., 2015), and Non-Forest in year ti+1 and in ti+2 , that pixel is reclassified as Forest in year ti. There are several rules for transitions listed in Table 3 that are applied by the temporal filter.

 Table 3. Rules for reclassification applied by the temporal filter.

Rules Input (Year) Output

General ti ti+1 ti+2 ti+5 ti ti+1 ti+2 ti+3

RG-1 NF F F F F F F F

RG-2 NF F NF NF NF NF NF NF

RG-3 NF SI * * NF NF * *

RG-4 NF A NF NF NF NF NF NF

RG-5 F SI * * F F * *

RG-6 F A F F F F F F

RG-7 F NF F F F F F F

RG-8 A F F F F F F F

RG-9 A NF NF NF NF NF NF NF

RG-10 A * A A A A A A

RG-11 A SI * * A A * *

RG-12 * NF SI * * NF NF *

19

Page 20: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

RG-13 * F SI * * F F *

RG-14 * A SI * * A A *

RG-15 * * NF SI * * NF NF

RG-16 * * F SI * * F F

RG-17 * * A SI * * A A

Specific Rules

2012 2013 2014 2015 2012 2013 2014 2015

RE-1 NF NF F NF NF NF NF NF

RE-2 NF NF A NF NF NF NF NF

RE-3 F F NF F F F F F

RE-4 F F A F F F F F

NF = Non-forest; F = Forest; A = Water; SI = No information or Cloud; * = Any class; i = {2008, 2009, 2010, ..., 2015}.

After application of the spatial and temporal filters, the resulting land cover maps were evaluated based on analyses of spatial concordance with maps [reference] of the biomes published by other initiatives (Table 1). The Prodes deforestation maps Prodes were selected for analysis of the spatial agreement. A 5km x 5km grid was created to compare the proportion of forest area classified by the MapBiomas maps and the reference maps. Those cells of the analysis grid are classified according to three levels of agreement: High (>80%), Medium (80-60%) and Low (>60). That procedure was applied to a set of 10 charts and the results point to a high correspondence between those results for the Forest class.

Finally, the maps for areas of agricultural crops, pastures and planted forests, independently generated with automatic classifiers based on statistical decision trees, were integrated with the map of forests produced for Collection 1 of MapBiomas Amazonia. Those maps were combined with the land cover maps of the Amazon biome to generate the final consolidated map of Collection 1.

20

Page 21: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

3.2.4. Data volumes and processing issues The volume of Landsat imagery data processed for Collection 1 of the Amazon Biome

reached 17.408 GigaBytes. These data generated several outputs that comprise XXX TeraBytes, and are available at MapBiomas platform.  

3.3 Collection 2 - MapBiomas Amazonia Collection 2 covered the period from 2000 to 2016 and followed the image process

protocol described for Collection 1. The differences in the image processing approach are highlighted in this section.

3.3.1. Algorithm

The algorithm applied to Collection 2 followed the image processing steps applied to Collection 1. There are changes regarding the decision trees and new rules to temporal filter. A new empirical decision tree was built to deal with the new classification scheme of Collection 2.

Figure 10. Empirical decision tree applied to Collection 2 for the amazon biome.

 

21

Page 22: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Two empirical decision trees were defined - one that does not include Non-Forest Natural Formation class (Figure 10A) and another that does (Figure 10B). The only structural difference in these two trees is in node 4-1 (NDFI>=5), which is split in the second decision tree into Agriculture or Pasture and Non-Forest Natural Formations classes (Figure 10B). A new set of temporal filters rules was developed to deal with the new land cover classes (Table 4).  

Table 4. Temporal filter rules applied to Collection 2 land cover classes.

Rule Type Kernel Notes t-2 t-1 T t+1 t+2 Result

RG01 RG 3 F-N-F null 3 27 3 null 3

RG02 RG 3 F-AP-F null 3 15 3 null 3

RG03 RG 3 F-FNNF-F null 3 10 3 null 3

RG04 RG 3 AP-F-AP null 15 3 15 null 15

RG05 RG 3 AP-N-AP null 15 0 15 null 15

RG06 RG 3 AP-FNNF-AP null 15 10 15 null 15

RG07 RG 3 AP-A-AP null 15 26 15 null 15

RG08 RG 3 FS-N-FS null 8 0 8 null 8

RG09 RG 3 FNNF-N-FNNF null 10 0 10 null 10

RG10 RG 3 FNNF-A-FNNF null 10 26 10 null 10

RG11 RG 3 ANV-F-ANV null 22 3 22 null 22

RG12 RG 3 ANV-N-ANV null 22 0 22 null 22

RG13 RG 3 A-N-A null 26 27 26 null 26

22

Page 23: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

RG14 RG 3 A-FNNF-A null 26 10 26 null 26

RG15 RG 3 A-AP-A null 26 15 26 null 26

RG16 RG 3 FS-FNNF-FS null 8 10 8 null 8

RG17 RG 3 A-ANV-A null 26 22 26 null 26

RG18 RG 3 ANV-FS-ANV null 22 10 22 null 22

RG19 RG 3 FNNF-FS-FNNF null 10 8 10 null 10

RG20 RG 3 FNNF-FD-FNNF null 10 3 10 null 10

RP01 RP 3 N-A-A null 27 26 26 null 26

RU01 RU 3 A-A-N null 26 26 27 null 26

 

3.3.2. Data availability and screening  

The same mapping unit (i.e., 1 degree by 1.5 degree map sheet area) was used for Collection 2. The main difference was in selection of the temporal window to generate temporal image mosaics. For Collection 2, we used data collected for each map sheet during 2000 through 2016 to define the period for selecting Landsat images to build the temporal mosaic. The optimal period for the Amazon biome was from June 1 to October 31. Therefore, the initial selection of a time period for building image mosaics was automatic. At that point, image analysts inspected the results and made changes to improve the mosaics (i.e., reduce cloud and/or increase surface observation areas within the mosaic) if necessary. The total amount of Landsat scenes processed per year to produce the image mosaics is presented in Table 5. Table 5. Number of images utilized per year, sensor and maximum percentage of cloud cover accepted for Collection 1.

Year Number of

Landsat images

Amount size (GByte)

Biome data cover

(%)

2000 1657 662.8 97 2001 2112 844.8 85

23

Page 24: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

2002 1384 553.6 90 2003 1156 462.4 93 2004 1536 614.4 97 2005 1484 593.6 95 2006 1559 623.6 93 2007 1403 561.2 93 2008 1815 726.0 97 2009 2004 801.6 97 2010 1883 3,766.0 95 2011 1644 657.6 95 2012 1883 3,766.0 95 2013 1870 3,740.0 93 2014 1976 3,952.0 95 2015 2140 4,280.0 97 2016 1680 3,360.0 95

Total 29186 29965.6 -

 

Figure 11.Coverage of Landsat scenes to make up the temporal mosaic of the charts processed for the Amazon biome for the period of 2000 to 2016, with up to 30% of cloud cover.

3.3.3. Classification scheme and parameters The classification scheme for Collection 2 of MapBiomas Amazonia is presented in Table 6.

These classes are a sub-set of the whole MapBiomas classification scheme and will be the main input for classification integration with other classes (which is discussed in the MapBiomas Integration ATBD).

24

Page 25: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Table 6. Classification scheme of Collection 2 for the Amazon Biome.

Value Color Color code Class

0 #FFFFFF No Information

3 #006400 Dense Forest

6 #76A5AF Inundated Forest

7 #29EEE4 Degraded Forest

8 #77A605 Secondary Forest

10 #FF9966 Nature Non-Forest Formations

14 #FFFFB2 Agriculture and Pasture

22 #EA9999 Non-Vegetated Areas

26 #0000FF Water Surface

27 #D5D5E5 Unobserved

 

3.3.4. Data volumes and processing issues  

The volume of Landsat imagery data processed for Collection 2 of the Amazon Biome reached XXXX TeraBytes. These data generated several outputs that include: Collection 2 Asset for the Amazon Biome, composed of temporal mosaics of Landsat imagery, SMA, NDFI and derived spectral indices. These datasets together comprise XXX TeraBytes, and are available at the MapBiomas platform.

4. Validation Strategies

The methodology for analyzing the accuracy of the annual land cover and use maps of the Amazon was applied with the following objectives: a) define the sample plan; and b) conduct an analysis of the accuracy of Collections 1 and 2 of the annual mapping (in other words, of all of the years of mapping those collections) of land cover in Brazil performed by the project:

The sampling plan is the methodology for calculating the number of samples and for allocating the sampling points necessary for estimating the accuracy of mapping of Brazilian biomes.

The analysis of accuracy is the estimate of the Fraction of the correctly classified number of pixels from mapping, their sampling errors and confidence intervals, besides the estimate of the real area occupied by each class of land cover.  

25

Page 26: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

For the Amazon biome a number of 938 samples per year was defined for the period of 2008 to 2015, which were interpreted by 3 evaluators (Figure 11).  

 Figure 12. Spatial distribution of the sampling points for analyzing the accuracy of Collections 1 and 2 of MapBiomas Amazonia.

Landsat images were utilized for visual interpretation of the sampling points (Figure 11). AccTools software was utilized to collect and visually interpret those points. A grid of 3x3 pixel points was employed, using the location of the sampling as the central pixel of that grid as a reference. Thus, the resulting sample is formed by the central pixel and its neighboring pixels, totaling 9 pixels, in other ways, an area of approximately one hectare (Figure 12).

26

Page 27: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

Figure 13. Sampling scheme and interpretation grade of Landsat pixels utilized in visual interpretation for analysis of accuracy.

The samples from each Collection were interpreted by three independent evaluators. In each pixel, interpretation is made of the fractions occupied by each class of land cover in the pixel. The fraction occupied by classes not identified by the evaluator is included in the “other” class. For the accuracy analysis, the dominant class visually interpreted with the Landsat images will be taken as the reference class of the pixel. The dominant class is the class with the largest proportion in the pixel, automatically calculated by the system.

After data collection using AccTools, the data collected are exported to the Excel format, for analysis of map accuracy. Besides the proportions of the pixel occupied by each class of cover, the following information is extracted: biome, dominant class, page, pixel number in the block, sample number, geographic coordinate of the sample and observations.

The Amazon presented an overall accuracy of 95% for Collection 1, considering all of the pixels analyzed. In evaluating the historical series of 2008 to 2015, the accuracy results presented values greater than 90%, reaching 96% in 2008 (Figure 13). The accuracy analysis in Collection 2 has not yet been concluded.

27

Page 28: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

 Figure 14. Preliminary result of the accuracy analysis of the Amazon biome for Collection 1.

5. Map Collections and Analysis The annual time-series classification dataset produced for MapBiomas Amazonia can be

used in several types of spatial Analyses. For that purpose, only Collection 2 will be used, since it represents a more robust classification dataset as compared to the previous one. An example of the classification panel for Collection 2 is presented in Figure 14. Applications of these datasets have not yet been conducted. A potential list of such applications includes:

1. Quantification of gross annual deforestation. 2. Quantification of annual forest degradation. 3. Annual quantification of secondary growth forests. 4. Quantification of annual forest fragmentation. 5. Quantification of annual net forest balance. 6. Land cover and land use transitions. 7. Spatial analyses of agreement and disagreement among existing forest and land cover

maps. 8. Quantification of the natural dynamics of water surfaces.

Figure 15. Data classification panel for Collection 2 MapBiomas Amazonia (A), and a more detailed view of the classification map for 2008 (B).

28

Page 29: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

6. Practical Considerations This is a live document and we expect to update it as soon as Collection 2 becomes

available. New information about the results will be included.

7. Concluding Remarks and Perspectves The proposal algorithms for pre-processing and classifying Landsat imagery hold promise

for revolutionizing the production of land cover and land use maps at a large scale. Thanks to Google Earth Engine and open source technology it is possible to access and process large scale datasets of satellite imagery such as the one presented in this document. The replication of this type of project is viable for other areas of the planet. The next step of this project is to expand mapping and monitoring of the Pan-Amazon and other tropical forest regions. This will involve producing Collection 3, which will include a large time-series (1985 to the present). Future developments include using the entire spectral-temporal information of Landsat data and integration with other sensors such as Sentinel-2 and AWiFs-Resourcesat.

                

29

Page 30: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

8. References

Almeida et al. 2016. High spatial resolution land use and land cover mapping of the

Brazilian Legal Amazon in 2008 using Landsat-5/TM and Modis data. Acta Amazônica, 46(3),

291-302.

Carlotto, M. J. Reducing the effects of space-varying, wavelength dependent scattering in

multispectral imagery. International Journal of Remote Sensing, v. 20, n. 17, p. 3333-3344, 1999.

Fearnside, P.M. 2006. Desmatamento na Amazônia: dinâmica, impactos e controle. Acta

Amazônica, 36, 395-400.

INPE – Instituto Nacional de Pesquisas Espaciais, 2016. Projeto TerraClass – Dinâmica do

uso and cobertura da terra para o período de 10 anos nas áreas desflorestadas da Amazônia

Legal Brasileira. (http://www.inpe.br/cra/projetos_pesquisas/dados_terraclass.php) Acesso in

03/13/2017.

MMA – Ministério do Meio Ambiente, 2013. Biomas. (www.mma.gov.br/biomas). Access

on 03/13/2017.

Souza Jr., C., Roberts, D. A. and Cochrane, M. A. Combining spectral and spatial

information to map canopy damage from selective logging and forest fires. Remote Sensing of

Environment, v. 98, n 2-3, p. 329-343. 2005a.

Souza Jr., C. and Siqueira, J. V. ImgTools: a software for optical remotely sensed data

analysis. In: XVI Simpósio Brasileiro de Sensoriamento Remoto (SBSR). Foz do Iguaçu-PR. 8p,

2013.

30

Page 31: Amazon Biome ATBD Rde...mainly covered by Dense Tropical Rainforest, characterized by evergreen vegetation and a high density of shrubby and arboreal species. The region is marked

 

9. Appendices

9.1. Supplementary Material     

9.2. Algorithm Code

 

 

 

 

31