Building Extraction by Stroke Width Transform from Satellite Imagery

Xu, Li; Kong, Mingming; Pan, Bing

doi:10.1007/978-981-10-7305-2_30

Li Xu^16,17,
Mingming Kong¹⁸ &
Bing Pan¹⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 773))

Included in the following conference series:

CCF Chinese Conference on Computer Vision

2557 Accesses
4 Citations

Abstract

This paper proposes a novel building extraction method from satellite imagery. Intuitively, the symmetry and regularity of architecture could be used to detect as building area, which benefited from the stroke width transform (SWT) algorithm. Meanwhile, the roof of building are very different with rural area (such as vegetation, wild, etc.) in color space, which can be partitioned by K-means clustering method. The area clustering can obtain the consistency region to complement the discontinuity from SWT algorithm detection. Then, A superpixel generation algorithm is adapted to yield the color distribution of building region, and final building area is able to extract accurately. Different with existing methods, the proposed method performs on a single source satellite image without any other supplement information. Experiment on a large number of satellite imagery demonstrates the efficiency of the proposed method for building extraction.

You have full access to this open access chapter, Download conference paper PDF

Building extraction and change detection from remotely sensed imagery based on layered architecture

Article 13 August 2019

Urban Extraction Based on Multi-scale Building Information Extra-Segmentation and SAR Coherence Image

Building Detection from Satellite Aerial Images Using Morphological Operations

Keywords

1 Introduction

Wireless communication system is explosively growth application in urban environment. An appropriate radio propagation model is significant to predict the signal communication which is influenced by building’s shadowing, reflection, absorption and fading. Building distribution is an important factor to modify the radio propagation model in simulation environment. Building density analysis is valuable for predicting the signal propagation and visualization, and simulation parameters of radio propagation model can be modified [1,2,3,4,5]. Therefore, accuracy automation building extraction from satellite imagery is helpful to design the wireless propagation model.

The satellite imagery has many advantages, such as the macroscopic imaging, low cost of approximating in horizontal projection objects and easily to be digitized, which became more and more popular to research in computer vision [6, 7]. In recent years, many researches have been focusing on building extraction from image. These methods can be coarsely divided into three classes. The first one is extracting building by some human interaction (semi-automatic) [8,9,10,11]. Based segmentation method requires much more user workload, which is impractical for the large segmentation task. Tian [11] adopts optimal parameters (shape, spectral bands, from-related) from the given type of objects (road, building, sports filed) to process the multi-resolution segmentation on high-resolution satellite image. The experiment proves that this algorithm is capable of producing with the comparable result. However, Suggested optimal parameters are usefulness and should be preliminarily demonstrated.

In order to decrease the user workload, some methods are proposed to extract building from different types imagery information (single-view or multi-views), such as optical image and radar data [10, 12]. Rottensteiner [13] creates a 3D building model form the high-resolution lidar data. The skew error distribution function is introduced to compute the digital terrain mode by separating points of building from other object classes. After analyzing the height difference with digital surface model, building is derived by curvature-based segmentation techniques. Guo [14] locates the boundary of building by a snake-based approach from the high-resolution IKONOS satellite imagery and captures the height data by airborne laser scanning system. Teimouri [15] identifies an optimal fusion of radar and optical image. The decision-level fusion is used to assess the classification of panchromatic, multi-spectral, and radar images. Then, author applies an artificial neural network to classify those feature, and the decision level of integration for the building detection purpose.

Since multi-source image is expensive to obtain, which limited to be widely applied for ordinary users. Then, the third method according to the principle of elements acquirement (region-based or edge-based) are proposed, that could be reduced the cost of equipment. There are many methods to automatically extract the building area by this way, such as morphological-based methods [16,17,18], region and edge detection-based methods [19,20,21,22]. Huang and zhang [16, 17] rely on the morphological shadow index (MSI), and the jointly use morphological building index (MBI) for the spatial constraint of building. The dual-threshold approach of their research improves for building extraction form background without collection any training samples and process of supervised leaning. Their method was proved more accurately than the support vector machine interpretation. Jin [18] presented an automated building-extraction strategy via mathematical morphology informations (the structural, contextual, and spectral) from the high-resolution satellite imagery. This system need not provide any initial algorithm parameters, training set and pre-classification. Jiang and Zhang [19] propose a semi-automatic building extraction method to get building by edge detection and region-driven. This method result heavily relies on the image’s spectral information. Lin [20] describes a system to detect the building in aerial scenes. After Low-level segmentation, a perceptual grouping approach is used to collect fragment segments from shape properties, and the shadow is used to form and verify the hypotheses region. Gamba [21] classifies the urban area mapping separately by a neural network and an adaptive Markov random field model. Decision fusion process combines with the classification mapping to exploit objects boundaries. Lee [22] uses the classification result of Ikonos multi-spectral image and Hough transformation to detect and form the building boundary. Extraction error is mainly caused by the road class as well as the occlusion and shadow. These methods usually utilize color, space, textual, and spectral information to extract building area. In other works, the successful building extraction of these existing region-based or edge-based methods is usually based on the assumption that the source image is in high-resolution and easily distinguished form the target area. However, when the image is in low resolution and with some noise information, such as the available google or network satellite mapping with slight fog and intense illumination, these methods will be not suitable to segment building region from around the environment.

Stroke width is related to the linear feature which has applied into extracting road’s networks. Doucette [23] presents a self-supervised road classification method to detect road centerline from high resolution multi-spectral imagery. Candidate road centerline is detected by the anti-parallel-edge. It is crucial for this method that needing the sufficient number of correct roads as training samples. The author reviews a variety of techniques which identified the linear, planimetric feature form imagery based on spectral, spatial and radiometric rules. Epshtein [24] detected the text in many fonts and languages use stroke width transform (SWT). This method suggests that the SWT algorithm combined with pixel dense estimation is efficient to extract the object with same stroke (symmetry region) in natural images. Quackenbush [25] also presents a review of the system for the linear feature extraction from imagery, that regarded as significantly requirement to automation detecting interest objects ongoing research from remotely sensed imagery.

In this paper, we propose a novel automatic method to extract building region from the single-view and in low-resolution satellite imagery, such as Fig. 1. The figure contains a complex environment, such as old building, playground, green landscape and waste land. Some of building is low and covered with vegetation. Our method is motivated by segmenting the symmetry and regularity architecture from this satellite mapping, and analyzed this region’s building density to modify the wireless propagation model. This study introduces SWT algorithm [24] to detect the hypothetical region, and fuses with k-means algorithm to achieve the presence of building from satellite imagery. The framework of building segmentation contains several steps. The first step uses the morphology transformation to add redundant boundary points to Canny operator [26], which benefited for calculating image pixels’ stroke width. The second step calculates images’ SWT spectral to detect the symmetry region from satellite imagery. K-means algorithm is used to obtain building region by color information. Then, the possibility building region is generated. Integrated SWT spectral with the classification result, building boundary can be totally acquired. The third step fusing second result could wipe out some interference region. Then, combined with the originate imagery mapping, Building region is grown into a fine segmentation result by a simple linear iterative clustering algorithm. For speciality of this satellite imagery, we compare our method result with K-means and Isodata clustering algorithm. The experiment results prove that our method is efficiently improvement building exaction.

The paper is organized as follows. In Sect. 2, our proposed method and the detailed of stroke width generation are introduced. In Sect. 3, experiment result and performance evaluation are presented. Finally, conclusions are given in Sect. 4.

2 Proposed Method

Our goal is to obtain the building region and evaluation its density. Therefore, this method could be performed as jointly estimating the symmetry region and color clustering problem. Intuitively, the symmetry region is applied to distinguish the character of building, and it can be detected by SWT algorithm. Besides, building always has different features with rural region in color space. We perform k-means algorithm in color clustering to removing interference. After intergrading those detected result, the simple linear iterative clustering algorithm [27] is used to maintain the building region consistency. Based on the consideration above, an unsupervised jointly framework is proposed as shown in Fig. 2. From the flowchart, we known that this method is designed simply and could be automation segmentation building region from satellite imagery.

2.1 Stroke Width Generation

The stroke width of image pixel is a locate description operator based on Canny edge detector. It has been extensively applied to extract the symmetry region (letter or road) in nature scenes. Neighboring pixels with the same stroke width may be grouped together. Relied on those similar stroke width of pixels, approximately symmetry region would be clustering from image. Building’s roof has a good symmetry in geometry morphology. Therefore, focusing on building’s roof is helpful to reduce a number of interference region.

Stroke width is defined as the length of line and rough perpendicular to the orientation of stroke. Its calculation should be firstly detected image edge by Canny operator. Those symmetry pixels located in edge are found. Then, the length of pixels in line is assigned as the same stroke width. Image pixels with same width would be in high spectral distribution. Building region is mainly identified by those image pixels’ stroke width.

Intuitively, stroke width is calculated from the image edge boundary. A gradient direction $d_p$ belongs to the pixel p on edge. If other pixel can be found from edge and signed as q, and the orientation of $d_q$ follow the ray $r = p + n \cdot d_p$, where $n > 0$. The gradient direction $d_q$ is roughly opposite to $d_q = -d_p \pm \pi /6$. Then, the SWT spectral of output image along with the segment [p, q] is assigned the width with $\Vert \overrightarrow{p-q}\Vert $. If a lower value already is appeared, pixel’s width would be chose the lower one as shown in Fig. 3.

If the pixel q is not existed, or $d_p$ does not opposites with $d_q$, the ray would be discarded. If pixels are in corner, and the average width of pixels is m, the length of pixels longer than m would be replaced by m as shown in Fig. 3.

The output of the SWT is a image, which contained the width of the stroke with each pixel as the same size of input image. Based on those theories, a detailed description of each step of the method is provided in the following section.

2.2 Building Boundary Extraction and Fusion

After calculating image pixels’ stroke width, Those symmetry region appear more intensity than others. In this section, pixels group together into building candidates via the distribution of pixels’ width. Edge detection determines the calculation scope of SWT algorithm, and is mainly influenced by the bright light and noise (shadow) in satellite imagery. Several tests prove that the more region’s edge detection, the better for the later building region extraction. Therefore, satellite imagery is necessary to be preprocessed before finding the building region.

In mathematical morphology, open operator is used to expand feature and close some of gaps in image. It followed with subtract operator could remove some of small or narrow elements, without influencing the large one. After those preprocessing, the uniformity of image background is added. Edge detection would become more complicated, so that building region would be appeared more obviously. Road and lawn region would be lower density in the distribution of SWT spectrum. The aim of this step could be to reduce several dissymmetry region.

Based on the edge detection, image’s SWT spectral is calculated. Neighbor pixels would be grouped together with the similar stroke width. For allowing more elaborate and perspective distortions of the symmetry region. If two neighboring pixels ratio does not exceed 3.0, the stroke with smoothly varying width will be grouped together. Those component region may contain building. A small set of rule is also employed by extracting the geometric character of building. The threshold of pixels’ stroke width (b) is set less than 30. After this threshold, road and lawn region (with less stroke wind) would be rejected from image.

Image’s SWT spectral obtains a lot of scattered region by Canny operator. To reduce the hypothesis building region, we classify image into two groups by K-means algorithm based on image’s color mapping. Building’s roof and road are homogenous in color and with much higher light than others. Therefore, the classification result is easily obtaining building and road from satellite imagery. We combine SWT spectral with classification result, and then the outline of building can be detected clearly. After this merged, irregular objects and mostly of roads are eliminated from the building’s region.

After above preprocessing, the hypothesis building region is brought up and scattered distribution in region. In this section, building boundary is extracted by a superpixel generation method [27]. This method generates superpixel based on K-means clustering with less memory requirement and strong performance. Superpixels contain similarly color distribution and labeled, respectively. Then, building region would be identified completely by fusing the building boundary with superpixel distribution.

In order to analysis the advantage of proposed method, an example of subjective result is shown in Fig. 4. We can see that this area contains many complex object region, such as building, road, tree and others shadow in Fig. 4(a), it can be seen that the probability map by SWT obtains the building symmetric region, but introduces some disconnected region in Fig. 4(b). Meanwhile, the result obtained by kmeans extracts building region as consistency in color. However, it introduces many noise region such as the flat region in (Fig. 4(c)). Combined the SWT with k-means result, most of dissymmetry region is wiped out. Building region would be recognized. While, the result in (Fig. 4(d)) successfully segments building region and reduces noise region after integrating building boundary and superpixel distribution.

3 Experiment and Analysis

The experiment is tested on Google earth satellite map with 1.07 metric/pixel spatial resolution. The dataset contains 224 images are special representative to predict the signal communication as shown in Fig. 4. The building region contains many different shapes, such as square and strip. Moreover, there are many tree and grass surrounding the old building, which seriously interfered building segmentation.

Iterative self-organizing data analysis technique algorithm (Isodata) is clustering algorithm based on the k-means, which added the merged and divided of clustering result. Isodata is an adaptive method to adjust the clustering number. Isodata and K-means are basic clustering methods. For segmenting the building density of satellite image with low resolution and single information, most of building excitation methods are failure. Therefore, we use the Isodata and k-means clustering algorithm comparative with our purposed method.

Performance Evaluation. In order to demonstrate the effective of our proposed method, we use the intersection-over-union (IOU) to evaluate the segmentation performance, which is widely applied for image segmentation. The groundtruth of building region is manual annotation. IOU is defined as follows:

$$\begin{aligned} precision = \frac{A\bigcap {B}}{A\bigcup {B}} \end{aligned}$$

(1)

where A is the binary mask which denoted the segmentation region by our method. B denotes the groundtruth. The higher the iou is, the better the performance is.

Some subjective result and objective result are shown in the following parts. Figure 5(a) is the groundtruth of satellite imagery. Figure 5(b)–(d) show the different segmentation results of Isodata, K-means, and our method, respectively. From this figure, we can see that the result of Isodata algorithm produces much noise due to the light point in original image. Meanwhile, the result of K-means algorithm obtain much more compact building than the result of Isodata. However, it produce some false positive in the middle bottom of the image, i.e., extracted the grass region as building. The result of proposed method is better than K-means and Isodata. It could not only remove the noise point but also avoid to segment the grass region as building. Meanwhile, the objective results are shown in Table 1. It can be seen that our method performs better than Isodata and K-means algorithm. The result demonstrates the effectiveness of the proposed method. The K-means result contain lots of building region, which contained some Non-architectural area. Therefor, it has much larger rise of IOU than Isodata. The value IOU of our method is improved limitedly. However, the segmentation result show that our method proves more similarly building region. The main reason is that building region loses some part of true territory caused by in the last step fused with superpixels region. As shown in Fig. 5(d), the new building region could be partitioned completely with much higher floor.

Table 1. IOU result.

Full size table

In this paper, stroke width of image pixel is used to provide the gradient information of building boundary. K-means algorithm detects more compactness region than Isodata. So we combined image pixels’ width with the K-means classification result to discard some noise. The building region could be detected very well by our algorithm. Comparing the extraction performance with K-means algorithm, Most of roads, grassland and bare ground could be deserted. The shadow of building could add the contrast with surrounding areas. Test result shows that image with much higher building could be enhanced the performance, which influenced by the shadow. Nevertheless, some of artificial road with good symmetry is hardly eliminated completely by this method. The experiment result also shows that our method is poor performance for the image only containing road or lawn region. Form the original satellite imagery, A lots of old building with a dark brown color roof was also made to be labeled mistakenly in test.

4 Conclusions

For modifying the wire radio propagation model parameters, loaded those satellite images is used to extracted the building density from google earth with low resolution and single spectre. In this paper, we proposed a simple automation method to segment building regions epically for an arbitrarily satellite mapping. We introduce SWT algorithm to detect the symmetry region, distinguished the character of building from around region, and cluster the similar color region via K-means algorithm. Finally, fused those result with the originate imagery mapping, building region is grown into a fine segmentation result. The proposed method could extract building region automatically and doesn’t need any high-level information or others prior knowledge. Compared with the object-oriented remotely sensed building extraction, it has a more application area. Based on the value of building density, wireless transform models will be modified in further researcher.

References

Erceg, V., Rustako, A.J., Roman, R.S.: Diffraction around corners and its effects on the microcell coverage area in urban and suburban environments at 900 MHz, 2 GHz, and 4 GHz. IEEE Trans. Veh. Technol. 43(3), 762–766 (1994)
Article Google Scholar
Kim, S.C., Guarino, B.J., Willis, T.M., et al.: Radio propagation measurements and prediction using three-dimensional ray tracing in urban environments at 908 MHz and 1.9 GHz. IEEE Trans. Veh. Technol. 48(3), 931–946 (1999)
Article Google Scholar
Souley, A.K.H., Cherkaoui, S.: Realistic urban scenarios simulation for ad hoc networks. In: 2nd International Conference on Innovations in Information Technology (IIT05) (2005)
Google Scholar
Sommer, C., Eckhoff, D., German, R., et al.: A computationally inexpensive empirical model of IEEE 802.11 p radio shadowing in urban environments. In: 2011 Eighth International Conference on Wireless On-Demand Network Systems and Services (WONS), pp. 84–90. IEEE (2011)
Google Scholar
Letourneux, F., et al.: Dual-polarized channel measurement and modeling in urban macro- and small-cells at 2 GHz. In: 2017 IEEE Wireless Communications and Networking Conference (WCNC), San Francisco, CA, pp. 1–6 (2017)
Google Scholar
Albert, A., Kaur, J., Gonzalez, M.: Using convolutional networks and satellite imagery to identify patterns in urban environments at a large scale. arXiv preprint arXiv:1704.02965 (2017)
Pelta, R., Chudnovsky, A.A.: Spatiotemporal estimation of air temperature patterns at the street level using high resolution satellite imagery. Sci. Total Environ. 579, 675–684 (2017)
Article Google Scholar
Mayunga, S.D., Zhang, Y., Coleman, D.J.: Semi-automatic building extraction utilizing Quickbird imagery. In: Proceedings of the ISPRS Workshop CMRT, vol. 13, pp. 1–136 (2005)
Google Scholar
Bypina, S.K., Rajan, K.S.: Semi-automatic extraction of large and moderate buildings from very high-resolution satellite imagery using active contour model. In: IEEE International on Geoscience and Remote Sensing Symposium (IGARSS), pp. 1885–1888. IEEE (2015)
Google Scholar
Khesali, E., Zoej, M.J.V., Mokhtarzade, M., et al.: Semi automatic road extraction by fusion of high resolution optical and radar images. J. Indian Soc. Remote Sens. 44(1), 21–29 (2016)
Article Google Scholar
Tian, J., Chen, D.M.: Optimization in multi-scale segmentation of high-resolution satellite images for artificial feature recognition. Int. J. Remote Sens. 28(20), 4625–4644 (2007)
Article Google Scholar
Hedman, K., Wessel, B., Stilla, U.: A fusion strategy for extracted road networks from multiaspect SAR images. U. Stilla, F. Rottensteiner und S. Hinz (Herausgeber), Int. Arch. Photogramm. Remote Sens. CMRT05 36(Part 3), W24 (2005)
Google Scholar
Rottensteiner, F., Briese, C.: A new method for building extraction in urban areas from high-resolution LIDAR data. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 34(3/A), 295–301 (2002)
Google Scholar
Guo, T., Yasuoka, Y.: Snake-based approach for building extraction from high-resolution satellite images and height data in urban areas. In: Proceedings of the 23rd Asian Conference on Remote Sensing, pp. 25–29 (2002)
Google Scholar
Teimouri, M., Mokhtarzade, M., Valadan Zoej, M.J.: Optimal fusion of optical and SAR high–resolution images for semiautomatic building detection. GISci. Remote Sens. 53(1), 45–62 (2016)
Article Google Scholar
Huang, X., Zhang, L.: A multidirectional and multiscale morphological index for automatic building extraction from multispectral GeoEye-1 imagery. Photogramm. Eng. Remote Sens. 77(7), 721–732 (2011)
Article Google Scholar
Huang, X., Zhang, L.: Morphological building/shadow index for building extraction from high–resolution imagery over urban areas. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 5(1), 161–172 (2012)
Article MathSciNet Google Scholar
Jin, X., Davis, C.H.: Automated building extraction from high-resolution satellite imagery in urban areas using structural, contextual, and spectral information. EURASIP J. Adv. Sig. Process. 2005(14), 745309 (2005)
Article MATH Google Scholar
Jiang, N., Zhang, J.X., Li, H.T., et al.: Semi-automatic building extraction from high resolution imagery based on segmentation. In: International Workshop on Earth Observation and Remote Sensing Applications, EORSA 2008, pp. 1–5. IEEE (2008)
Google Scholar
Lin, C., Huertas, A., Nevatia, R.: Detection of buildings using perceptual grouping and shadows. In: Proceedings of the 1994 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 1994, pp. 62–69. IEEE (1994)
Google Scholar
Gamba, P., Dell’Acqua, F., Lisini, G., et al.: Improved VHR urban area mapping exploiting object boundaries. IEEE Trans. Geosci. Remote Sens. 45(8), 2676–2682 (2007)
Article Google Scholar
Lee, D.S., Shan, J., Bethel, J.S.: Class-guided building extraction from Ikonos imagery. Photogramm. Eng. Remote Sens. 69(2), 143–150 (2003)
Article Google Scholar
Doucette, P., Agouris, P., Stefanidis, A.: Automated road extraction from high resolution multispectral imagery. Photogramm. Eng. Remote Sens. 70(12), 1405–1416 (2004)
Article Google Scholar
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970. IEEE (2010)
Google Scholar
Quackenbush, L.J.: A review of techniques for extracting linear features from imagery. Photogramm. Eng. Remote Sens. 70(12), 1383–1392 (2004)
Article Google Scholar
Canny, J.: A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell. 6, 679–698 (1986)
Article Google Scholar
Achanta, R., Shaji, A., Smith, K., et al.: SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2274–2282 (2012)
Article Google Scholar

Download references

Acknowledgments

This work is supported by the Typical Spectrum of Radio Signal (szjj2016-092). The Postdoctoral Station at Xihua University Based on Collaboration Innovation Center of Sichuan Automotive Key Parts. The National Natural Science Foundation (61372187, 61473239), and Laboratory of Intelligent Network Information Processing, (No. szjj2015-061).

Author information

Authors and Affiliations

The Postdoctoral Station, Southwest Jiaotong University, Chengdu, 610039, China
Li Xu
The Postdoctoral Station, Xihua University Based on Collaboration Innovation Center of Sichuan Automotive Key Parts, Chengdu, 610039, China
Li Xu
The Center for Radio Administration Technology Development, Xihua University, Chengdu, 610039, China
Mingming Kong & Bing Pan

Authors

Li Xu
View author publications
You can also search for this author in PubMed Google Scholar
Mingming Kong
View author publications
You can also search for this author in PubMed Google Scholar
Bing Pan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Li Xu .

Editor information

Editors and Affiliations

Civil Aviation University of China, Tianjin, China
Jinfeng Yang
Tianjin University, Tianjin, China
Qinghua Hu
Nankai University, Tianjin, China
Ming-Ming Cheng
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Liang Wang
Nanjing University of Information Science and Technology, Nanjing, China
Qingshan Liu
Huazhong University of Science and Technology, Wuhan, China
Xiang Bai
Xi’an Jiaotong University, Xi’an, China
Deyu Meng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, L., Kong, M., Pan, B. (2017). Building Extraction by Stroke Width Transform from Satellite Imagery. In: Yang, J., et al. Computer Vision. CCCV 2017. Communications in Computer and Information Science, vol 773. Springer, Singapore. https://doi.org/10.1007/978-981-10-7305-2_30

Download citation

DOI: https://doi.org/10.1007/978-981-10-7305-2_30
Published: 08 December 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7304-5
Online ISBN: 978-981-10-7305-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics