Abstract
Data-driven housing-market segmentation has been given increasing prominence for its objectiveness in identifying submarkets based on the housing data’s underlying structures. However, when handling high-dimensionality housing dataset, traditional statistical-clustering methods have been found to tend to lose low-variance information of the dataset and be deficient in deriving the globally optimal number of submarkets. Accordingly, with the intention of achieving more rigorous high-dimensionality housing market segmentation, a swarm-inspired projection (SIP) algorithm is introduced by this study. Using a high-dimensionality Taipei city’s housing dataset in a case study, a comparison of the proposed SIP algorithm and a statistical-clustering method using the combination of principal component analysis (PCA) and K-means clustering is conducted in evaluating the predictive accuracy of hedonic price models of the housing submarkets. The results show that, as compared to the original single market, the segmented submarkets resulting from SIP algorithm are more homogenous and distinctive, where the resulted hedonic price models have high-level statistical explanation and disparate sets of hedonic prices for different submarkets. In addition, as compared to the use of a statistical-clustering method, SIP algorithm is found to obtain a more optimal number of submarkets, where the resulted hedonic price models are found to achieve greater improvement of statistical explanation and more stable reduction of prediction error. These findings highlight the advantages of our proposed SIP algorithm in high-dimensionality housing market segmentation, and thus it is hoped that the present research will serve as a practical tool to better inform further studies aimed at market-segmentation-related problems.
Similar content being viewed by others
Data availability
The datasets generated and analysed during the current study are not publicly available due to privacy issues and the subjection to a confidential agreement but are available from the corresponding author on reasonable request.
Notes
The code for SIP algorithm, PCA and K-means clustering, and hedonic price modeling can be found online in this link https://github.com/Tingting202/Swarm-intelligence-Housing-Market-Segmentation.
It is noteworthy that the recorded transactions in the database may be smaller than the actual transactions happened in Taipei city during this period. Transaction records from 2008 to 2010 were obtained mainly because this set of data were agreed to be disclosed to the authors, while more updated transaction records have not been disclosed yet.
References
Adair, A. S., Berry, J. N., & McGreal, W. S. (1996). Hedonic modelling, housing submarkets and residential valuation. Journal of Property Research, 13(1), 67–83. https://doi.org/10.1080/095999196368899
Bates, L. K. (2006). Does neighborhood really matter? Comparing historically defined neighborhood boundaries with housing submarkets. Journal of Planning Education and Research, 26(1), 5–17. https://doi.org/10.1177/0739456x05283254
Bourassa, S. C., Hamelink, F., Hoesli, M., & MacGregor, B. D. (1999). Defining housing submarkets. Journal of Housing Economics, 8(2), 160–183. https://doi.org/10.1006/jhec.1999.0246
Chen, J.-H., Ong, C. F., Zheng, L., & Hsu, S.-C. (2017). Forecasting spatial dynamics of the housing market using support vector machine. International Journal of Strategic Property Management, 21(3), 273–283. https://doi.org/10.3846/1648715X.2016.1259190
Chen, P.-F., Chien, M.-S., & Lee, C.-C. (2011). Dynamic modeling of regional house price diffusion in Taiwan. Journal of Housing Economics, 20(4), 315–332. https://doi.org/10.1016/j.jhe.2011.09.002
Comrey, A. L., & Lee, H. B. (1992). Interpretation and application of factor analytic results. A. L. Comrey, H.B. Lee, A first course in factor analysis, 2, 1992.
Dale-Johnson, D. (1982). An alternative approach to housing market segmentation using hedonic price data. Journal of Urban Economics, 11(3), 311–332. https://doi.org/10.1016/0094-1190(82)90078-X
Goodman, A. C., & Thibodeau, T. G. (2003). Housing market segmentation and hedonic prediction accuracy. Journal of Housing Economics, 12(3), 181–201.
Han, J., Kamber, M., & Pei, J. (2012). Data mining concepts and techniques (3rd ed.). Waltham: Elsevier.
Helbich, M., Brunauer, W., Hagenauer, J., & Leitner, M. (2013). Data-driven regionalization of housing markets. Annals of the Association of American Geographers, 103(4), 871–889. https://doi.org/10.1080/00045608.2012.707587
Hui, E. C. M., Zhong, J., & Yu, K. (2016). Heterogeneity in spatial correlation and influential factors on property prices of submarkets categorized by urban dwelling spaces. Journal of Urban Planning and Development, 142(1), 04014047. https://doi.org/10.1061/(ASCE)UP.1943-5444.0000270
Islam, K. S., & Asami, Y. (2009). Housing market segmentation: A review. Review of Urban and Regional Development Studies, 21(2–3), 93–109. https://doi.org/10.1111/j.1467-940X.2009.00161.x
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning (Vol. 112). New York: Springer.
Jolliffe, I. T., & Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences, 374(2065), 20150202. https://doi.org/10.1098/rsta.2015.0202
Kana, M. (2020). Handling missing data for advanced machine learning. https://medium.com/towards-artificial-intelligence/handling-missing-data-for-advanced-machine-learning-b6eb89050357. Accessed October 3, 2020.
Kassambara, A. 2018. Machine learning essentials: Practical guide in R: STHDA.
Kauko, T. O. M., Hooimeijer, P., & Hakfoort, J. (2002). Capturing housing market segmentation: an alternative approach based on neural network modelling. Housing Studies, 17(6), 875–894. https://doi.org/10.1080/02673030215999
Kennedy, J., & Eberhart, R. 1995. Particle swarm optimization. In Proceedings of ICNN'95—International conference on neural networks, IEEE Perth, WA, Australia
Labroche, N., Monmarché, N., & Venturini, G. (2003). Visual clustering based on chemical recognition system of ants. In Genetic and Evolutionary Computation Conference, Chicago, Ilinois, France: HAL.
Manganelli, B., Pontrandolfi, P., Azzato, A., & Murgante, B. (2014). Using geographically weighted regression for housing market segmentation. International Journal of Business Intelligence and Data Mining, 9(2), 161–177.
Mundfrom, D. J., Shaw, D. G., & Ke, T. L. (2005). Minimum sample size recommendations for conducting factor analyses. International Journal of Testing, 5(2), 159–168.
O’Sullivan, S., & Morrall, J. (1996). Walking distances to and from light-rail transit stations. Transportation Research Record, 1538(1), 19–26. https://doi.org/10.1177/0361198196153800103
Picarougne, F., Azzag, H., Venturini, G., & Guinot, C. 2004. On data clustering with a flock of artificial agents. In 16th IEEE international conference on tools with artificial intelligence, Boca Raton, FL, USA
Rana, S., Jasola, S., & Kumar, R. (2011). A review on particle swarm optimization algorithms and their applications to data clustering. Artificial Intelligence Review, 35(3), 211–222. https://doi.org/10.1007/s10462-010-9191-9
Reif, R. (2018). Limitations of applying dimensionality reduction using PCA. https://www.robertoreif.com/blog/2018/1/9/pca. Accessed 27 Feb 2021.
Rosen, S. (1974). Hedonic prices and implicit markets: Product differentiation in pure competition. Journal of Political Economy, 82(1), 34–55.
Schnare, A. B., & Struyk, R. J. (1976). Segmentation in urban housing markets. Journal of Urban Economics, 3(2), 146–166.
Su, M.-C., & Chang, H.-T. (2001). A new model of self-organizing neural networks and its application in data projection. IEEE Transactions on Neural Networks, 12(1), 153–158. https://doi.org/10.1109/72.896805
Su, M.-C., Su, S.-Y., & Zhao, Y.-X. (2009). A swarm-inspired projection algorithm. Pattern Recognition, 42(11), 2764–2786. https://doi.org/10.1016/j.patcog.2009.03.020
Taipei City Government. 2017. "Demographic structure and composition". Department of Information Technology, Taipei City Government. https://english.gov.taipei/cp.aspx?n=C619997124A6D293. Accessed December 10, 2019.
Thrun, M. C. (2018). Projection-based clustering through self-organization and swarm intelligence: Combining cluster analysis with the visualization of high-dimensional data. Marburg: Springer Vieweg.
Tu, Y. (1997). The local housing sub-market structure and its properties. Urban Studies, 34(2), 337–353. https://doi.org/10.1080/0042098976203
Watkins, C. A. (2001). The definition and identification of housing submarkets. Environment and Planning A: Economy and Space, 33(12), 2235–2253. https://doi.org/10.1068/a34162
Wilson, P., White, M., Dunse, N., Cheong, C., & Zurbruegg, R. (2011). Modelling price movements in housing micro markets: Identifying long-term components in local housing market dynamics. Urban Studies, 48(9), 1853–1874. https://doi.org/10.1177/0042098010380960
Wu, C., & Sharma, R. (2012). Housing submarket classification: The role of spatial contiguity. Applied Geography, 32(2), 746–756. https://doi.org/10.1016/j.apgeog.2011.08.011
Wu, C., Ye, X., Ren, F., & Du, Q. (2018). Modified data-driven framework for housing market segmentation. Journal of Urban Planning and Development, 144(4), 04018036. https://doi.org/10.1061/(ASCE)UP.1943-5444.0000473
Funding
This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, JH., Ji, T., Su, MC. et al. Swarm-inspired data-driven approach for housing market segmentation: a case study of Taipei city. J Hous and the Built Environ 36, 1787–1811 (2021). https://doi.org/10.1007/s10901-021-09824-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10901-021-09824-1