extraction of buildings in urban areas from very high resolution satellite images

Upload: miusay

Post on 03-Apr-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/28/2019 Extraction of Buildings in Urban Areas From Very High Resolution Satellite Images

    1/5

    AUTOMATIC EXTRACTION OF BUILDINGS IN URBAN AREAS FROM VERY HIGH

    RESOLUTION SATELLITE IMAGES

    M. Ghanea a, *, P. Moallem b, M. Momeni a

    a

    Dept. of Surveying Eng., Faculty of Eng., University of Isfahan, Isfahan, Iran - (m.ghane, [email protected])b Dept. of Electrical Eng., Faculty of Eng., University of Isfahan, Isfahan, Iran - [email protected]

    Session C1

    KEY WORDS: Building Extraction, Very High Resolution (VHR) Satellite Images, Urban Area, K-means Clustering, Region

    Growing, Boundary Improvement

    ABSTRACT:

    Due to developing satellite technology, extraction of man-made features such as roads and buildings has become a topic of interest

    for photogrammetric and remote sensing communities. Extraction of buildings in urban areas is a complicated problem due to the

    complexity of shapes, textures, and contexts. In order to dealing with this problem, an object-based image analysis is used togenerate homogeneous regions from a pixel-based image, which is called image segmentation. This article introduces a region-based

    image segmentation algorithm based on k-means clustering in order to extract buildings. The algorithm contains three steps: (1)

    clustering procedure, (2) image segmentation, and (3) boundary improvement. The approach is evaluated using a case study in

    Tehran, Islamic Republic of Iran. Very high resolution GeoEye satellite imagery was used in the case study. Experimental results

    show that the proposed algorithm extracts 76.1 % of building areas with a quality percentage 62.9 % in an urban area.

    * Corresponding author. This is useful to know for communication with the appropriate person in cases with more than one author.

    1. INTRODUCTIONAutomatic extraction of man-made objects from satellite images

    is a significant research task in photogrammetry and remote

    sensing. With a rapid increase of satellite technology, it is

    urgent to extract man-made objects, such as buildings and

    roads, from satellite images. Data acquisition for mapping andGeospatial Information System (GIS) by photogrammetric

    methods has traditionally been carried out in manual extraction

    of cartographic features from images of the terrain surface

    ranging in scale from 1:3000 to 1:90000 (Sowmya and Trinder,

    2000). Although this strategy is efficient under the viewpoints

    of accuracy and precession, it is generally time-consuming and

    expensive, what certainly have limited the amount, resolution,

    and revision cycles of terrain information that can be extracted

    by using current digital photogrammetric systems. (Dal Poz and

    do Vale, 2003). In this context, building extraction has

    remained as an important issue of research.

    Up to now, fully automatic algorithms for building extraction

    seem to be far away from a plenary state and, consequently, no

    such operational system is expected to be available in nearfuture. Image segmentation is the process of dividing an image

    into homogenous regions, which is a neccessary step for higher

    level image processing such as image analysis, pattern

    recognition, and automatic image interpretation (Blaschke and

    Strobl, 2001). So far, there are over 1000 types of developed

    segmentation methods (Zhang, 2001). General segmentation

    methods consist of global behaviour-based and local behaviour-

    based methods (Kartikeyan et al., 1998). Global behaviour-

    based methods group the pixels based on the analysis of the

    data in the feature space. Typical examples are clustering and

    histogram thresholding. Local behaviour-based methods

    analyze the variation of spectral features in a small

    neighbourhood. Typical examples are edge detection and

    region extraction (Fu and Mui, 1981).

    However, as reported in (Li et al., 2008), not all of the

    segmentation methods are possible for high resolution satellite

    imagery due to the following facts:

    1. This imagery is multi-spectral and multi-scale, so boththe complexity and redundancy are increased

    obviously.

    2. The imagery provides the more details such as color,shape, context and texture.

    3. Different class has its inherent features in differentscale. E.g. at coarse scales we may find fields, while

    at finer scales we may find individual trees or plants.

    So the segmentation model on one scale must be

    modified when used on the other scale.

    In this article, a fully automatic algorithm is introduced for

    building extraction from high resolution multispectral GeoEye

    imagery. For this purpose, we present a region-based image

    segmentation procedure together with a boundary improvement

    algorithm. The proposed algorithm is evaluated for a case study

    in Tehran, Islamic Republic of Iran.

    The remainder of this article is organized as follows: Section 2

    explains the proposed algorithm in detail. Section 3 presentsexperimental results, and Section 4 concludes the work.

    2. OVERVIEW OF THE ALGORITHMFigure 1 depicts a flow diagram of the proposed algorithm. At

    first, clustering procedure is used in order to divide an original

    image into a binary clustered image including building and non-

    building layers. A k-means clustering algorithm is applied

    because it is able to use all of the multi-spectral bands for

    providing a single band clustered image. In the second step, we

    apply a kind of region-based image segmentation which

    includes k-means clustering and single thresholding, seed point

    generation, and region growing. In the final step, boundary of

  • 7/28/2019 Extraction of Buildings in Urban Areas From Very High Resolution Satellite Images

    2/5

    extracted buildings is improved by a logical process. Next

    sections describe each step in detail.

    Figure 1. Flow diagram of the proposed algorithm

    2.1 Clustering ProcedureIn order to divide an original image (see Figure 3.a) into a

    clustered image (see Figure 3.b), a well-known clustering

    procedure referred to as k-means clustering (Gray and Linde,

    1982; Seber, 1984; Spath, 1985) is used. An overview of the k-

    means clustering algorithm is shown in Figure 2. The

    procedure is initialised by selecting the predefined number of

    centeroids (k) in the multi spectral space. The selection of these

    centeroids at this stage is arbitrary with exception that no two

    may be the same. The location of each pixel in the image is

    examined and the pixel is assigned to the nearest candidate

    centeroid. This assignment would be made on the basis of the

    Euclidean distance measure. Then, the new set of centeroids is

    computed. This two-phase procedure is repeated until it reaches

    the maximum iteration (m). Two parameters k and m must be

    specified beforehand by the user. The generated clustered

    image includes building and non-building layers.

    Figure 2. Overview of the k-means clustering algorithm

    (a)

    (b)

    Figure 3. Applying k-means clustering with k = 2 and m = 4 on

    the original image. (a) Original image, and (b)

    Clustered image

    2.2 Image SegmentationSince the generated building layer contains buildings and their

    adjacent similar spectral features such as roads, shadows, cars,

    yards etc, firstly, we must separate all features as it is possible

    and then extract building features. For this purpose, a region-

    based image segmentation algorithm is used. The image

    Predefined number

    of clusters

    Centroids determination

    Compute distance to centroids for

    each building pixel

    Assigning each building pixel to

    its nearest centroid

    > Maximum

    iterationNo

    Yes

    Original image

    Clustered image

  • 7/28/2019 Extraction of Buildings in Urban Areas From Very High Resolution Satellite Images

    3/5

    segmentation consists of three steps. At first, k-means

    clustering divides the building layer into predetermined number

    of clusters (see Figure 4.a). The predetermined number is

    selected on the basis of the building variation in size, shape, and

    color. Each cluster contains a set of discontinuous regions. In

    order to separate small regions from big ones, the histogram of

    all regions is plotted in order to represent cumulative

    distribution of pixels for each region (see Figure 4.b). If thenumber of pixels belonging to a region is less than a user-

    defined threshold, then this region is removed. The resulting

    image is shown in Figure 4.c.

    (a)

    (b)

    (c)Figure 4. Applying k-means clustering and thresholding on the

    building layer, respectively. (a) K-means clustering,

    (b) Histogram for the discontinuous regions, and (c)

    Thresholding (regions are shown with different

    colors)

    At second, seed point generation is implemented automatically

    in order to provide seed points for region growing. Firstly, the

    original image is converted into an 8-bit gray scale image. Then

    the position of a seed point for each region is defined as

    follows:

    =

    rgni )i,yig(x

    rgni )i,yi.g(xiyySeed

    rgni )i,yig(x

    rgni )i,yi.g(xix=xSeed

    (1)

    where rgn is an evaluation region and g(xi,yi) is the gray scale

    value of the ith point which belongs to the region. Both spatial

    (i.e. (xi,yi)) and homogeneity (i.e. g(xi,yi)) information take part

    in defining seed points. So seed points as starting points are

    more suitable than centeroids for a region growing stage. The

    generated image depicts regions with their seed points (see

    Figure 5).

    Figure 5. Seed point generation and region growing. (a)

    Regions, and (b) Regions with their seed points

    In the final stage, a region growing method is performed to

    extract buildings. For this purpose, the process starts at each

    seed point in the image with one-pixel objects, and in numerous

    subsequent steps, smaller image objects are merged into bigger

    ones (Carleer et al., 2005). The similarity condition is defined

    by means of a logical statement if pixels in the region are

    similar enough in terms of spectral variance property as follows: