Coweeta LTER created incremental land cover classifications along five year intervals for the southern Appalachian Mountains in order to assist researchers efforts to study change in land cover over time. Recently the United States Geologic Survey (USGS) has published land cover for this area, the National Land Cover Database (NLCD), dated 2001. Coweeta adopted USGS methodologies to create land cover datasets for 1986 onward using available Landsat imagery. In the future as imagery becomes available, Coweeta LTER plans to create another classification every 5 years, beginning with 2006, 2011, etc.

View and Download Land Cover

Developing Land Cover Classification
The NLCD 2001 dataset was developed by the Natural Resources Spatial Analysis Laboratory (NARSAL) at the Institute of Ecology, University of Georgia in Athens, Georgia. Coweeta LTER has worked in collaboration with NARSAL to develop the new classifications using methods as close as possible to those used in developing NLCD 2001. This paper is a detailed description of those methods, created as a reference for persons creating future land cover classifications.

Table of Contents

Chapter 1: Description of NLCD Land Cover Classes
Chapter 2: Preparing Imagery
Chapter 3: Digitize Cloud Areas, Create Cloud Masks
Chapter 4: Preparing DEMs
Chapter 5: Preparing Point Data
Chapter 6: Classifying images using NLCD classifier and See5
Chapter 7: Preparing Wetland Classification
Chapter 8: Classifying urban areas using NLCD classifier and Cubist


Chapter 1: Description of NLCD 2001 Land Cover Classes
The 2001 National Land Cover Database is a thematic raster image with each pixel assigned an integer value that represents a particular type of land cover. Each discrete value and its associated land cover is described on the USGS land cover website.

USGS describes the classification as a modification of the Anderson land-use and land-cover classification system. The Anderson system was modified for applicability to Landsat image classification, as many level III classes are best derived from aerial photography, and some level II classes are best consolidated for accuracy.

NLCD Land Cover Class Definitions
Water - All areas of open water or permanent ice/snow cover.
11. Open Water - all areas of open water, generally with less than 25% cover of vegetation/land cover.
12. Perennial Ice/Snow - all areas characterized by year-long surface cover of ice and/or snow.

Developed Areas characterized by a high percentage (30 percent or greater) of constructed materials (e.g. asphalt, concrete, buildings, etc).
21. Low Intensity Residential - Includes areas with a mixture of constructed materials and vegetation. Constructed materials account for 30-80 percent of the cover. Vegetation may account for 20 to 70 percent of the cover. These areas most commonly include single-family housing units. Population densities will be lower than in high intensity residential areas.
22. High Intensity Residential - Includes highly developed areas where people reside in high numbers. Examples include apartment complexes and row houses. Vegetation accounts for less than 20 percent of the cover. Constructed materials account for 80 to100 percent of the cover.
23. Commercial/Industrial/Transportation - Includes infrastructure (e.g. roads, railroads, etc.) and all highly developed areas not classified as High Intensity Residential.

Barren - Areas characterized by bare rock, gravel, sand, silt, clay, or other earthen material, with little or no "green" vegetation present regardless of its inherent ability to support life. Vegetation, if present, is more widely spaced and scrubby than that in the "green" vegetated categories; lichen cover may be extensive.
31. Bare Rock/Sand/Clay - Perennially barren areas of bedrock, desert pavement, scarps, talus, slides, volcanic material, glacial debris, beaches, and other accumulations of earthen material.
32. Quarries/Strip Mines/Gravel Pits - Areas of extractive mining activities with significant surface expression.
33. Transitional - Areas of sparse vegetative cover (less than 25 percent of cover) that are dynamically changing from one land cover to another, often because of land use activities. Examples include forest clearcuts, a transition phase between forest and agricultural land, the temporary clearing of vegetation, and changes due to natural causes (e.g. fire, flood, etc.).

Forested Upland - Areas characterized by tree cover (natural or semi-natural woody vegetation, generally greater than 6 meters tall); tree canopy accounts for 25-100 percent of the cover.
41. Deciduous Forest - Areas dominated by trees where 75 percent or more of the tree species shed foliage simultaneously in response to seasonal change.
42. Evergreen Forest - Areas dominated by trees where 75 percent or more of the tree species `maintain their leaves all year. Canopy is never without green foliage.
43. Mixed Forest - Areas dominated by trees where neither deciduous nor evergreen species represent more than 75 percent of the cover present.

Shrubland - Areas characterized by natural or semi-natural woody vegetation with aerial stems, generally less than 6 meters tall, with individuals or clumps not touching to interlocking. Both evergreen and deciduous species of true shrubs, young trees, and trees or shrubs that are small or stunted because of environmental conditions are included.
51. Shrubland - Areas dominated by shrubs; shrub canopy accounts for 25-100 percent of the cover. Shrub cover is generally greater than 25 percent when tree cover is less than 25 percent. Shrub cover may be less than 25 percent in cases when the cover of other life forms (e.g. herbaceous or tree) is less than 25 percent and shrubs cover exceeds the cover of the other life forms.

Non-Natural Woody - Areas dominated by non-natural woody vegetation; non-natural woody vegetative canopy accounts for 25-100 percent of the cover. The non-natural woody classification is subject to the availability of sufficient ancillary data to differentiate non-natural woody vegetation from natural woody vegetation.
61. Orchards/Vineyards/Other - Orchards, vineyards, and other areas planted or maintained for the production of fruits, nuts, berries, or ornamentals.

Herbaceous Upland - Upland areas characterized by natural or semi-natural herbaceous vegetation; herbaceous vegetation accounts for 75-100 percent of the cover.
71. Grasslands/Herbaceous - Areas dominated by upland grasses and forbs. In rare cases, herbaceous cover is less than 25 percent, but exceeds the combined cover of the woody species present. These areas are not subject to intensive management, but they are often utilized for grazing.

Planted/Cultivated - Areas characterized by herbaceous vegetation that has been planted or is intensively managed for the production of food, feed, or fiber; or is maintained in developed settings for specific purposes. Herbaceous vegetation accounts for 75-100 percent of the cover.
81. Pasture/Hay - Areas of grasses, legumes, or grass-legume mixtures planted for livestock grazing or the production of seed or hay crops.
82. Row Crops - Areas used for the production of crops, such as corn, soybeans, vegetables, tobacco, and cotton.
83. Small Grains - Areas used for the production of graminoid crops such as wheat, barley, oats, and rice.
84. Fallow - Areas used for the production of crops that do not exhibit visable vegetation as a result of being tilled in a management practice that incorporates prescribed alternation between cropping and tillage.
85. Urban/Recreational Grasses - Vegetation (primarily grasses) planted in developed settings for recreation, erosion control, or aesthetic purposes. Examples include parks, lawns, golf courses, airport grasses, and industrial site grasses.

Wetlands - Areas where the soil or substrate is periodically saturated with or covered with water as defined by Cowardin et al.
91. Woody Wetlands - Areas where forest or shrubland vegetation accounts for 25-100 percent of the cover and the soil or substrate is periodically saturated with or covered with water.
92. Emergent Herbaceous Wetlands - Areas where perennial herbaceous vegetation accounts for 75-100 percent of the cover and the soil or substrate is periodically saturated with or covered with water.

Overview of Methods
The process used for development of Nationall Land Cover Databases (NLCD) differs from many conventional classification methods in its use of training data. Training data are point data, with each point corresponding to a specific pixel of the image to be classified. The pixels associated with training points are a subset of total pixels in the image, and each point is assigned one of the NLCD land cover classes by hand, with the operator looking at the Landsat imagery and Digital Ortho Quarter Quad (DOQQ)aerial photography to determine the best fit class. The points should be clustered on DOQQs and should be numerous enough to proportionally represent all land cover classes. The DOQQs should be spread evenly over the image to represent all different types of areas. Once each point is assigned a classification, the point file is converted to a raster and is used as training data in the See5 or Cubist programs, programs which create regression models for classification based on the training data. Urban classes are created in a separate process from non-urban classes, as the urban classification is processed using continuous data representing percent impervious cover (0% - 100%) and the non-urban classification is processed using thematic data (discrete land cover classes). The See5 model creates regressions based on thematic data and is used for the non-urban classification, while Cubist creates continuous data regressions and is used for the urban classification.

Classification is performed using three different Landsat images as independent variables: Leaf-on (June-August), Leaf-off (December-February), and Spring (March-April). Additionally, a Digital Elevation Model (DEM) and aspect and slope derived from the DEM may also be used as independent variables. These images are particularly helpful in mountainous areas, such as the Southern Appalachian area in which Coweeta is interested.

Description of Software Requirements
Primarily, ERDAS IMAGINE and the See5 and Cubist models are the software programs used to perform the classification. Additionally, ArcGIS is useful for classifying point training data and general data viewing purposes, while Arc/INFO and the associated GRID programs are easier than IMAGINE for a few specific coverage and raster processes.

Southern Appalachian Study Area
The Southern Appalachian study area that is of interest to Coweeta research covers parts of Georgia, Tennessee, North Carolina, and Virginia. The study area used for classification covered each of these counties plus a 5 kilometer buffer around the entire area (adding a small portion of Kentucky). The study area is shown below.



Chapter 2: Preparing Imagery
Before beginning the classification process, Landsat imagery must be selected and purchased from USGS and prepared for classifying. This section will cover pre-classification preparation.

Selection of Imagery
Imagery can be browsed and selected through USGS Online Catalog.

Some types of imagery can be ordered online, but for this classification, terrain-corrected imagery is required and must be ordered by phone. A US Government-Affiliated User form must be completed for access to this highly-accurate imagery. The appropriate satellite data set is either Landsat 4-5 TM or Landsat 7 ETM+, depending on the required dates. The Southern Appalachian study area is covered by 7 Landsat scenes: path 17 rows 34 and 35, path 18 rows 34, 35, and 36, and path 19 rows 35 and 36. Three seasons of imagery are required, leaf-on, leaf-off, and spring. Three images must be ordered for each path/row combination, for a total of 21 scenes. Ideally, images would contain no cloud, snow, or haze cover, but most images are somewhat cloudy. In order to find high-quality imagery, a three-year window is allowed around the target date, one year on each side. For instance, in the 1996 classification, images were selected from 1995, 1996, or 1997. Images selected should have as little cloud, snow, or haze cover as possible, as these areas cannot be classified. It should be noted that the study area does not cover all of any of the scenes, and clouds in locations not in the study area are not a factor. Nearly none of the images is completely cloud-free, and if the best image for one season contains some clouds, try to select images for the other seasons that do not have clouds in the same area. Once the 21 scenes are selected, order terrain-corrected imagery using Coweeta’s preferred format:

Data Format: BSQ
File Format: EOSAT Fast
Image Orientation: Map
Pixel Size: 30m
Resampling Tech.: NN
Projection: UTM
Zone: 17N
Datum: NAD27

In 2006, the price for terrain-corrected images was $625 per scene, with a reduced price of $310 for second and third scenes adjacent to another scene on the same path and on the same date.

Importing Imagery
After receiving imagery from USGS on CDs, copy each CD to the location on your hard drive/network drive where you will be working. Single scenes will be on one CD; multi-scenes will be on multiple CDs. Multi-scenes from USGS are single images, with different bands on different CDs, as opposed to different scenes on different CDs with all bands of the same scene on one CD. IMAGINE importer can import directly from EOSAT Fast format, but it has trouble with multi-scene images. Thus the single-scene images will be imported using a different method than the multi-scenes.

Single-scene importing
Open the import dialogue in IMAGINE by clicking the ‘Import’ icon on the IMAGINE toolbar. Select ‘import’ at the top, then select Landsat TM Fast (EOSAT) from the dropdown menu.

Mosaicing Imagery
After importing each scene into IMAGINE, the scenes for each season must be mosaiced together. For the classification, it is imperative that pixels from each scene line up exactly with pixels from adjacent scenes, as well as overlay exactly with scenes from different seasons. If all imagery was ordered at the same time and in the same projection from USGS, then pixels should already line up.

Aligning Images
To correct misaligned images, the misaligned image will be shifted slightly so that the pixels line up. Begin by opening a properly aligned and the misaligned image in two different IMAGINE viewers. Geolink the two viewers by right clicking in one, selecting geolink, and then left clicking in the second viewer. Zoom in to an area in which the two images overlap, showing a few pixels in each viewer. Locate a single pixel in the properly aligned image and its corresponding pixel in the misaligned image. Unlink the viewers and zoom in as far as possible to one corner of the chosen pixel in the aligned image. Open the inquire cursor in that viewer and move the crosshairs as close as possible to that corner; the inquire cursor will show the X and Y coordinates of the pixel corner. In the second viewer, zoom in as far as possible to the same corner of the corresponding chosen pixel; the coordinates will be different as the images are not aligned. Once zoomed in on the pixel corner of the misaligned image, be sure that the arrow cursor is selected, click on ‘Raster’ at the top of the viewer and select ‘Set Drop Point.’ Click as closely as possible to the pixel corner. The image offset box will open; type in new X and Y coordinates that are shown in the inquire cursor window of the properly aligned image and click OK. The image will be shifted to align with the other. Repeat this process for all misaligned images.

Mosaic
Once all images are properly aligned, the 7 scenes for each season should be mosaicked using the IMAGINE mosaic tool. Click on the ‘DataPrep’ icon on the IMAGINE toolbar, select ‘Mosaic Images’ and then ‘Mosaic Tool.’ Add all 7 images for a single season to the mosaic tool, then adjust the order of overlap so that better images (less cloud cover) are on top of worse images in areas of overlap. In the ‘Set Overlap Function’ dialogue, be sure to select ‘Overlay.’ All other default settings should be acceptable. Run the mosaic. Repeat for the two remaining seasons.

Subset mosaic
The mosaiced images should be subset to the area of interest. You will need a shapefile of the area of interest for this operation; for the southern Appalachian study area, the area of interest is all counties in the study area plus a 5 km buffer. Open a shapefile/coverage of the counties in ArcMap, merge them all into one polygon, then buffer the polygon by 5 kilometers. In this initial operation, subset only one of the mosaicked images. The two remaining images will be subset in the IMAGINE Modeler using the initial subset image to ensure that the exact pixels are contained in each subset.

Open the image to be subset (leaf-on) and the shapefile of the study area in the IMAGINE viewer. Select the study area polygon in the shapefile. In the AOI (area of interest) tools menu, select ‘convert polygon to aoi.’ This will create an *.aoi of the study area. Click on the ‘DataPrep’ icon on the IMAGINE toolbar and select ‘Subset Image.’ Add the image to be subset (leaf-on) to the dialogue box, click ‘aoi,’ and select ‘from viewer.’ The study area aoi must be selected in the viewer for this operation. Name the output file and run the subset operation.

Create image of study area
After subsetting the first image, you will need to create a thematic image of the study area containing 1 layer, with a value of 1 for pixels in the study area and a value of 0 for pixels outside of the study area. This image will be invaluable for many subsequent operations, and will ensure that every subset image contains exactly the same pixels.

Open IMAGINE Modeler. Insert one input raster and one output raster, with one process between. Use the initial subset image (leaf-on) as the input raster and create a name for the output raster such as ‘studyarea.img.’ Be sure that the output raster is thematic unsigned 8-bit. Click on the process circle. In the dropdown menu in the top right, select ‘conditional,’ then select the ‘EITHER’ statement below. The EITHER statement will appear in the lower box; change this statement to:

EITHER 1 IF (<subsetmosaic_layer1> gt 0) OR 0 OTHERWISE

Only one layer of the input image is necessary, any layer will work. The output image ‘studyarea.img’ will have a value of 1 if the pixel is in the study area, 0 if it is outside.

Subset remaining 2 mosaic images
The remaining 2 mosaicked images (leaf-off, spring) will be subset in IMAGINE Modeler. This time use two input rasters and one output raster with one process between. Use the image to be subset as one input and ‘studyarea.img’ as the second input. Run process with the conditional statement:

EITHER <inputmosaic> IF (<studyarea> gt 0) OR 0 OTHERWISE

Run model on both remaining mosaics. You will end up with subset images for all 3 seasons, each having the exact same pixel content.


Chapter 3: Digitize Cloud Areas, Create Cloud Masks
Areas in images covered by clouds, cloud shadows, snow, or haze cannot be classified. It is necessary to digitize an AOI for these areas in all 3 images, so that they can be removed later. You will create 3 different AOIs, one for each image.

Digitize Cloud Areas
Open the leaf-on image (subset to study area) in the IMAGINE viewer. Click on the AOI menu on the viewer toolbar and open the AOI Tools toolbar. Click on the icon representing ‘create polygon aoi.’ After zooming in to an appropriate scale, draw polygons around all clouds, cloud shadows, snow, or significant haze (probably no snow in leaf-on image). Save the AOI with a name such as ‘leafon_cloudaoi.aoi.’ Repeat for leaf-off and spring images, resulting in 3 *.aoi files.

Create Cloud Masks
In the IMAGINE viewer, open ‘studyarea.img’ and ‘leafon_cloudaoi.aoi.’ Select all polygons in the AOI. Using the subset function under the ‘DataPrep’ icon on the IMAGINE toolbar, subset ‘studyarea.img’ using the AOI in the viewer. Change the output name to leafon_clouds.img. The resulting thematic image will have a value of 1 for areas containing clouds in the leaf-on image and 0 for all other areas. This image will be used later to remove cloudy areas from the classification. Repeat the process for leaf-off and spring cloud AOIs, resulting in 3 cloud masks.


Chapter 4: Preparing DEMs
Digital elevation Models (DEMs) can be very useful in classification when used in combination with Landsat imagery, particularly in areas with significant topographical variation such as the Appalachian Mountains, and are available free from USGS. Additionally, aspect and slope images can be created from DEMs and also used in classification. Because DEMs do not change over time like land cover, the same DEM, aspect, and slope files can be used for multiple years of image classification, unless more accurate versions of the same data are released in the future.

Downloading Data
The DEM data that is best for purposes here is the USGS National Elevation Dataset (NED), and is downloadable free from http://seamless.usgs.gov. Select only NED in the download menu on the right, then click and drag a download box over the desired area in the viewer. There is a limit to the amount of data that can be downloaded at one time, so it may take multiple sessions to get the entire desired area. The NED files will download in multiple tiles, which must be mosaicked together.

Reprojecting Data
After NED data has been downloaded for the entire study area, each individual tile should be reprojected to Coweeta’s preferred projection, UTM zone 17N, NAD 27. Reproject in IMAGINE, using the ‘reproject’ tool under the ‘DataPrep’ icon on the imagine toolbar. Because the data is in multiple tiles, it may be useful to use the batch process function in the reprojection window.

Mosaic Tiles
The reprojected tiles should be mosaicked together in IMAGINE to create one image. Using the IMAGINE mosaic tool, add all tiles. Change the output image type to ‘unsigned 16-bit,’ as the default 8-bit does not have enough range of values. Run the mosaic.

Subset Mosaic
The resulting mosaic will not align properly with the Landsat imagery. Shift the DEM to line up with the imagery, using the inquire cursor on the Landsat image and set drop point on the DEM, as described previously in aligning images. Subset the properly aligned DEM in the IMAGINE modeler using the same method used for the second and third Landsat images.

Generating Aspect and Slope
The IMAGINE Modeler has functions to calculate aspect and slope from an elevation image. Open modeler and create a model with one input raster, one output raster, with one process between. Select the DEM image as the input raster. Slope and aspect functions are under the ‘Surface’ function list. For slope, select PERCENT SLOPE (<raster> , <units>). Replace <raster> with the input DEM and replace <units> with ‘meters.’ Change the output raster type to ‘continuous’ and ‘Unsigned 16-bit,’ then select a name and location for the output. Run the model.

The aspect model will look the same, but the function should be ASPECT (<raster>), also from the ‘Surface’ list. Replace <raster> with input DEM. Change output raster type to ‘continuous’ and ‘Unsigned 16-bit,’ then select a name and location for the output. Run the model.

Chapter 5: Preparing Point Data

Classifying Non-Urban, Non-Wetland Points
The point data will be the training data for the classification of non-urban areas of the Landsat imagery. The points are clustered on Digital Ortho Quarter-Quad (DOQQ) aerial photos, and the DOQQs are spread across the study area to form a representative sample of all types of land cover. The points are classified by the user with the help of the aerial photos and the Landsat imagery, and are then used as training data in the classification model. Points were generated for the NLCD 2001 classification, and the same points will be used for subsequent Coweeta classifications.

Classifying Points
The easiest way to classify points is to open an ArcMap document and add the shapefile containing the points, all three landsat images, and the DOQQs. Adding only a few DOQQs at a time will save time by keeping the ArcMap document small. Create a new field in the point shapefile for land cover class. Open the editor toolbar, click ‘Start Editing,’ and select the point shapefile. This will allow you to add values to the new landcover field. Working with one DOQQ at a time, zoom in to each point, looking at the entire pixel of the landsat image that it overlays and the DOQQ for that area and assign a classification to that point. The classes are described in more detail above, but the only ones of use at this time are: 11 – water, 31 – barren, 41 – deciduous forest, 42 – evergreen forest, 43 – mixed forest, 52 – scrub/shrub, 71 – grassland, 81 – pasture, and 82 – row crop. Some classes are not applicable to the Southern Appalachian area, and wetland and urban classes will be determined at a later time. After selecting a class for a particular point, input the class numeric value into the new land cover class field of the shapefile. This process takes a bit of practice. DOQQ’s are taken in winter and most are color infrared, so photosynthetic activity will appear red. Thus evergreen forests (42) will appear red, deciduous forests (41) will not. However, much deciduous forest has an evergreen understory, and classification becomes more difficult. Understory evergreen will have a finer texture than evergreen canopy; individual trees should be discernable on the DOQQ for evergreen forests. Row crop (82) and pasture (81) are also difficult to distinguish, and may change from year to year. In general, row crops will be brighter and not reddish on the landsat image as more bare soil shows than crop cover. Also, plow rows may be visible on DOQQs. Grassland (71) and scrub/shrub (52) are generally considered as a continuum between clearcut and forest regrowth. Most grassland will be recently clearcut areas, most scrub/shrub will be clearcuts a few years old, and older clearcuts will fall under forest classes.

Convert Point Shapefile to Point Coverage
Convert point shapefile to point coverage in Arc/INFO. Open Arc/INFO workstation and change workspace to the location of the point shapefile. On the Arc command line, type:

shapearc <inputpointshapefile> <outputpointcoverage>

This command will convert the shapefile to a point coverage, maintaining all field data. The new coverage must be built. On the Arc command line, type:

build <pointcover> point

The coverage will now be ready for use.


Convert Point Coverage to GRID
To convert the point coverage to a raster image, it will first need to be converted to GRID format. In order to make sure that the pixels of the point image line up precisely with the pixels of the Landsat imagery and the DEM created earlier, the study area image used to subset the imagery and DEM will need to be converted to GRID format. The point coverage will then be snapped to that GRID during conversion. Change the workspace to the location of the study area by typing on the Arc command line:

ws drive:path/foldercontainingstudyarea

Convert the study area image to a grid, placing the grid in the workspace containing the point coverage by typing:

imagegrid <studyareaimage.img> <drive:path/foldercontainingpointcover/studyareagrid>

Change the workspace to the location containing the new GRID and the point coverage. Enter GRID by typing ‘grid’ on the Arc command line. On the GRID: command line, enter:

setwindow <studyareagrid> <studyareagrid>

The first ‘studyareagrid’ sets the extent of your GRID window to the extents of the GRID ‘studyareagrid,’ the second sets the snap grid to ‘studyareagrid.’ This command will ensure that the pixels of the GRID created from the point coverage will align properly with the study area and the landsat images

Still in GRID, convert the point coverage to a GRID by entering on the command line:

<outputgrid> = pointgrid(<pointcoverage>, <classfield>, #, #, 30)

‘Outputgrid’ is the name of the new GRID, ‘pointcoverage’ is the name of the coverage of point data, ‘classfield’ is the name of the field in the coverage with the classification values, and 30 is the size of the output pixels.

Convert GRID to Image
Using IMAGINE importer, import the point GRID to an IMAGINE image.

Enter Wetland Points into Point image
Because wetland areas cannot be accurately discerned by eye from DOQQ’s or Landsat imagery, they will be put in after classifying points. This will be accomplished by finding any forest points (41, 42, 43) that lie over NWI wetlands and changing them to forested wetland class (90) and any grassland or scrub-shrub points that lie over wetlands and changing them to emergent wetland class (95). For 2001 NLCD classification, USGS-contracted labs created wetland classifications using these wetland points and a series of models setting parameters to limit areas allowed to be classified as wetland. This method is beneficial in that wetland classes are determined from the imagery. However, wetlands constitute a relatively small area in the Southern Appalachians, and this sort of modeling is not necessary for Coweeta’s purposes. Coweeta will take forested and non-forested wetlands directly from NWI data at the end of the classification rather than classify them from the imagery. With this method, the wetland points can essentially be eliminated from the point data. Nonetheless, the process of reclassifying them will be described here, though USGS-method models will not.

The NWI wetland image is divided into open water, forested wetland, and emergent wetland. We will ignore these divisions as our determination of classes should supersede these divisions and only consider wetland = yes or wetland = no. Open the IMAGINE modeler and enter 2 input images and one output image, with one process between. The modeler must be run twice, once for forested wetland and once for emergent; first will be forested. For the input files, use the NWI wetland image and the points image. For the process, enter the conditional statement:

EITHER 90 IF (<pointimage> gt 40 AND <pointimage> lt 44 AND <nwiimage> gt 0) OR <pointimage> OTHERWISE

The output of this model will be ‘pointsintermediate.’ Run the model a second time, with the inputs NWI wetland image and pointsintermediate. For the process, enter the conditional statement:

EITHER 95 IF (<pointsintermediate> gt 51 AND <pointsintermediate> lt 72 AND <nwiimage> gt 0) OR <pointsintermediate> OTHERWISE

Output ‘pointsfinal.’ The output image will have all forest points (41, 42, 43) occurring over wetlands converted to forested wetland (90), and all grassland (71) and scrub/shrub (52) points occurring over wetlands converted to emergent wetland (95).


Chapter 6: Classifying Non-Urban Areas using NLCD classifier and See5

Non-urban, non-wetland areas will be classified with the See5 model, using the point image as training data. This process involves three steps: the CART Sampling Tool, the See5 Model, and the CART Classifier.

Initial Classification

CART Sampling Tool
Click on the CART icon on the IMAGINE toolbar and select ‘CART Sampling Tool.’ For independent variables, select the leaf-on TM image, the leaf-off TM image, the spring TM image, the DEM image, the aspect image, and the slope image. For the dependent variable select the points image. Type 0 for ‘Ignore values.’ Under ‘Sampling Number,’ select ‘percent’ and type 80 under ‘Training(%)’ and 20 under ‘Validation (%).’ Select a location and name for the output *.names file (other output files will automatically go to same place). Select See5 at the bottom and click ‘OK.’ This process will create *.names, *.data, and *.test files.

See5 Model
After the CART Sampling Tool finishes, open See5 (outside of IMAGINE). Click the ‘Locate Data’ icon on the toolbar and run the See5 model. This model will create several other files with the same name as the *.names file. Elaborate

CART Classifier
After the See5 model finishes, click the CART icon on the IMAGINE toolbar and select ‘CART Classifier.’ Select the *.names file created earlier as the input and select ‘See5.’ Select ‘Tree’ and select a path and name for the output image. Do not use a mask file at this point; click ‘OK.’ The output will be a thematic, 1-layer IMAGINE image containing the classification.

Revise/Troubleshoot Classification
The initial classification will probably not be so good. Problematic, particularly in mountainous areas, is over-estimation of water areas due to shadows and over-estimation of wetlands. To rectify this problem, reduce the number of classes to water, forest, and non-forest and re-classify, then subdivide the forest and non-forest areas.

Create Base Classification
Reduce the number of classes in the point image to three: water (11), forest (41, 42, 43, 90), and non-forest (31, 52, 71, 81, 82, and 95). This will be done changing the values in the point image: 41, 42, 43, and 90 will be changed to 41; 31, 52, 71, 81, 82, and 95 will be changed to 81. Open IMAGINE modeler and insert one input raster and one output raster, with one process between. Choose the points image for the input raster and choose a name and path for the output. In the process, select ‘Conditional’ on the Functions list, then select ‘CONDITIONAL.’ Change the function to:
CONDITIONAL { (<pointsfinal> eq 11) 11 , (<pointsfinal> eq 31) 81 , (<pointsfinal> eq 41) 41 , (<pointsfinal> eq 42) 41 , (<pointsfinal> eq 43) 41, (<pointsfinal> eq 52 ) 81 , (<pointsfinal> eq 71 ) 81 , (<pointsfinal> eq 81) 81 ,( <pointsfinal> eq 82) 81 , (<pointsfinal> eq 90) 41 , (<pointsfinal> eq 95) 81 }

Run the model. When the model finishes, re-run CART Sampling Tool, See5, and CART Classifier, using the new reclassified point image as the dependent variable.

Masking Cloudy Areas
Check the base classification, particularly in areas that are cloudy on one or more of the images. Clouds tend to return barren (31, 81 in this case), and cloud shadows tend to return water (11). In problem areas, the model will need to be re-run omitting the image with clouds as an independent variable, then using the resulting classification as a replacement in the problem area of the base classification.

For cloudy areas in the leaf-on image, run Cart Sampling Tool using only leaf-off TM image, spring TM image, DEM image, aspect image, and slope image as independent variables, and use the 3-class points image as the dependent variable. Run See5 model and CART Classifier with the output files. This process will create a classification without the problem of clouds from the leaf-on image.

In areas of leaf-on clouds or shadows, substitute this classification into the base classification. Open IMAGINE modeler and insert 3 input rasters, 1 output raster, with one process between. The three input rasters will be the original base classification, the base classification omitting the leaf-on image, and the image of leaf-on clouds created earlier under ‘Create Cloud Masks.’ Choose a name and location for the output classification. Select ‘Conditional’ under the Function list and choose the ‘EITHER’ statement. Change the statement to:

EITHER <baseclass_leaf-on_omitted> IF (<leaf-oncloudmask> gt 0 ) OR <baseclass_original> OTHERWISE

Run the model. The output will have the leaf-on cloudy areas replaced. Repeat the process for problem areas such as clouds and shadows on other images.

Sub-dividing Forest classification
Once all problem areas have been improved in the base classification, create a mask of the forested areas using IMAGINE modeler. In the modeler, put in one input raster, one output raster, and one process between. Use the final base classification as the input and name the output ‘forest_mask.img.’ In the process box, use the EITHER statement from the conditional function list as follows:

EITHER 1 IF (<baseclass> eq 41 ) OR 0 OTHERWISE

The output will have a value of 1 in forested areas and 0 elsewhere, and will be used as a mask in the CART Classifier.

Also needed in this step is a points image containing only points with values of 41, 42, or 43. Create another model with one input raster, one output raster, and one process. Use the points image containing all values (11, 31, 41, 42, 43, 52, 71, 81, 82, 90, and 95) as the input raster and name the output raster ‘forest_points.img.’ In the process box, use the EITHER statement from the conditional function list as follows:

EITHER <points_image> IF (<points_image> gt 40 AND <points_image> lt 44) OR 0 OTHERWISE

The output image will contain only pixels classified as 41, 42, or 43 from the point data. These values are the only ones needed because you are only classifying forested areas.

Run the CART Sampling Tool exactly as done in creating the base classification, with the 3 landsat images, DEM, aspect, and slope as independent variables, forest_points as dependent variable, 80% training/20% verification, and selecting See5. When the sampling tool finishes, run the See5 model as before. When See5 finishes, run CART Classifier as before, but use ‘forest_mask.img’ as a mask layer.

The resulting classification will subdivide the original forested area into its constituent classes. Because the forest classes are much closer together, in terms of spectral signature, than the 3 classes of the base classification, be sure to check this classification closely for accuracy. Re-run CART Sampling Tool, See5, and CART Classifier 3 more times, each time omitting one of the Landsat images as an independent variable. This process was done earlier during base classification to mask cloudy areas in one or more of the images, and that may still be helpful, but due to inherent inconsistencies between images and the spectral closeness of these classes, one of these classifications may be better than the one using all 3 Landsat images. Check closely against imagery to find the best. It may be helpful to run CART Sampling Tool, See5, and CART Classifier 3 additional times, using only one of the Landsat images as independent variable each time. Find the best classification by checking against the imagery. A final note on this process: the leaf-off image can be very good at the deciduous/evergreen division, as evergreen areas will still be photosynthetically active at that time of year.

After selecting the best classification from the group, check for cloudy areas in the images used in that classification. If there are cloudy areas, replace them as done in the base classification, using a classification that omitted the image in question and the appropriate cloud mask.

Sub-dividing Non-forest Classification
Re-classifying non-forest, non-water areas is done exactly the same as dividing forest classification, only in several iterations. First scrub/shrub (52) will be separated from other non-forest areas, then grassland (71) will be separated from the remaining non-forest, then barren land, pasture, and cropland will be split at the same time.

To subdivide scrub/shrub (52), you will need a non-forest mask and a points image containing only points with classes 31, 52, 71, 81, and 82, but with 31, 71, 81, and 82 all coded as 81. Create the mask from the base classification in IMAGINE modeler as done earlier for ‘forest_mask.img,’ and call it ‘non-forest_mask.img.’ Recode the points image (containing all classified points) in modeler using the conditional statement:

CONDITIONAL { (<pointsfinal> eq 11) 0 , (<pointsfinal> eq 31) 81 , (<pointsfinal> eq 41) 0 , (<pointsfinal> eq 42) 0 , (<pointsfinal> eq 43) 0, (<pointsfinal> eq 52 ) 52 , (<pointsfinal> eq 71 ) 81 , (<pointsfinal> eq 81) 81 ,( <pointsfinal> eq 82) 81 , (<pointsfinal> eq 90) 0 , (<pointsfinal> eq 95) 0 }

Name this output file ‘class52_points.img.’ Run CART Sampling Tool, See5, and CART Classifier as before, using ‘class52_points.img’ as the dependent variable in CART Sampling Tool and ‘non-forest_mask.img’ as a mask layer in CART Classifier. Again, run the classification tools and models again up to 6 times, omitting one or two of the images each time. Check against imagery to select the best classification. Most of the scrub/shrub should appear in areas that were somewhat recently clearcut, and should be apparent on the imagery. The southwestern portion of the Southern Appalachian study area has the most intense clearcuts; check this area carefully.

Again check for clouds, but they should be less of a problem due to the relatively small number of pixels that will be class 52.

After creating the best scrub/shrub classification possible, begin on the grassland (71) classification. Create a non-forest, non-scrub/shrub mask in modeler using the final scrub/shrub classification as input and naming the output ‘forest52_mask.img.’ In the process box, enter the conditional statement:

EITHER 1 IF (<class52image> eq 81) OR 0 OTHERWISE

Create a new points image containing only classes 31, 71, 81, and 82, with 31, 81, and 82 coded as 81. Use the original points image as input and run the conditional statement:

CONDITIONAL { (<pointsfinal> eq 11) 0 , (<pointsfinal> eq 31) 81 , (<pointsfinal> eq 41) 0 , (<pointsfinal> eq 42) 0 , (<pointsfinal> eq 43) 0, (<pointsfinal> eq 52 ) 0 , (<pointsfinal> eq 71 ) 71 , (<pointsfinal> eq 81) 81 ,( <pointsfinal> eq 82) 81 , (<pointsfinal> eq 90) 0 , (<pointsfinal> eq 95) 0 }

Name the output file ‘class71_points.img.’ Run CART Sampling Tool, See5, and CART Classifier just as before, as many times as necessary to find the best classification. Check closely against imagery; class 71 grassland should also be concentrated at clearcuts, but in areas more recently clearcut than class 52. Check the southwestern area well. Check for clouds, and replace areas as necessary.

Finally, classes 31, 81, and 82 will be divided. Repeat the processes as before, creating a points image containing only classes 31, 81, and 82 and creating a mask of non-forest, non-scrub/shrub, and non-grassland areas. Run the models with as many variations on input imagery as necessary and check the classifications carefully against imagery. Pasture and row crop are difficult to distinguish, and the same area may be different on different images. However, it should be apparent that some areas have generally more row crop, and some have more pasture. (check agriculture statistics). Barren areas should be apparent on imagery as exceptionally bright.

Overlay Non-urban classification
All of the non-urban classifications should now be overlaid using IMAGINE modeler. Begin using 71 classification and 31/81/81 classification as inputs. Name the output ‘class31718182.img’ and run the EITHER statement from the conditional function list as follows:

EITHER <classification_318182> IF (<classification_71> eq 81) OR <classification_71> OTHERWISE

Run again using ‘class31718182.img’ and 52 classification as inputs. Name the ouput ‘nonforestclassification.img’ and run the EITHER statement from the conditional function list as follows:

EITHER <class31718182.img> IF (<classification_52> eq 81) OR <classification_52> OTHERWISE

Finally, run IMAGINE modeler again, this time with 3 input rasters: base classification, forest classification, and non-forest classification. Name the output ‘landcover_nourban.img’ and choose the CONDITIONAL statement from the conditional function list in the process box. Enter in the process box:

CONDITIONAL { (<baseclass> eq 11) 11 , (<baseclass> eq 41) <forestclassification> , (<baseclass> eq 81) <nonforestclassification> }

The result will be the final classification of non-urban, non-wetland areas.




Chapter 7: Preparing Wetland Classification
As mentioned earlier, USGS used fairly complex models to classify wetlands from imagery, but for Coweeta’s purposes, National Wetland Inventory data will be used instead of models.

(in-progress)


Chapter 8: Classifying urban areas using NLCD classifier and Cubist

Description of urban classification (in-progress)