Abstract
In this paper, we present an algorithm for cluster detection using modified Watershed model. The presented model for cluster detection works better than the k-means algorithm. The proposed algorithm is also computationally inexpensive compared to the k-means, agglomerative hierarchical clustering and DBSCAN algorithm. The clustering results can be considered as good as the results of DBSCAN and sometimes the result obtained by the proposed model is better than the DBSCAN results. The presented algorithm solves the conflicts faced by the DBSCAN in case of varying density. This paper also presents a way to reduce high dimensional data to low dimensional data with automatic association analysis. This algorithm can reduce high dimensional data to even a single dimension. Using this algorithm the challenges faced in multidimensional clustering by different algorithms such as DBSCSN is solved. This dimensionality reduction with automatic association algorithm is then applied to the Watershed model to detect cluster in Homicide Data and finding out murder prone zones and suggest a person with murder avoiding areas.
References
Drake, J., Hamerly, G.: Accelerated k-means with adaptive distance bounds. In: 5th NIPS Workshop on Optimization for Machine Learning, pp. 42–53 (2012)
Celebi, M.E., Kingravi, H.A., Vela, P.A.: A comparative study of efficient initialization methods for the k-means clustering algorithm. Expert Syst. Appl. 40(1), 200–210 (2013)
Cousty, J., Bertrand, G., Najman, L., Couprie, M.: Watershed cuts: minimum spanning forests and the drop of water principle. IEEE Trans. Pattern Anal. Mach. Intell. 31(8), 1362–1374 (2009)
Bellman, R.E.: Perturbation Techniques in Mathematics, Engineering and Physics. Courier Corporation, North Chelmsford (2003)
Hahsler, M., Grün, B., Hornik, K.: A computational environment for mining association rules and frequent item sets (2005)
Kriegel, H.P., Schubert, E., Zimek, A.: The (black) art of runtime evaluation: are we comparing algorithms or implementations? Knowl. Inf. Syst. 52(2), 341–378 (2016)
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, no. 14, pp. 281–297, June 1967
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Stevens, W.P., Myers, G.J., Constantine, L.L.: Structured design. IBM Syst. J. 13(2), 115–139 (1974)
Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recognit. Lett. 31(8), 651–666 (2010)
Richards, P.: Homicide Statistics, Research Paper 99/56, Social And General Statistics Section, House Of Commons Library, 27 May 1999
Barnes, R., Lehman, C., Mulla, D.: Priority-flood: an optimal depression-filling and watershed-labeling algorithm for digital elevation models. Comput. Geosci. 62, 117–127 (2014)
Pearson, K.: LIII. On lines and planes of closest fit to systems of points in space. Lond. Edinb. Dublin Philos. Mag. J. Sci. 2(11), 559–572 (1901)
Homicide Data. https://www.kaggle.com/murderaccountability/homicide-reports. Accessed 9 June 2017
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Khisha, J., Zerin, N., Choudhury, D., Rahman, R.M. (2017). Determining Murder Prone Areas Using Modified Watershed Model. In: Nguyen, N., Papadopoulos, G., Jędrzejowicz, P., Trawiński, B., Vossen, G. (eds) Computational Collective Intelligence. ICCCI 2017. Lecture Notes in Computer Science(), vol 10448. Springer, Cham. https://doi.org/10.1007/978-3-319-67074-4_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-67074-4_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67073-7
Online ISBN: 978-3-319-67074-4
eBook Packages: Computer ScienceComputer Science (R0)