1 Introduction

Landslide is considered to be one of the most typical natural hazards that have been affecting human lives and properties in India (Sarkar et al. 2011). Guha-Sapir et al. (2014) reported that India is one of the most affected countries in Asia by landslides after China. According to the Geological Survey of India, landslide affected area is about 0.49 million km2. Approximately 15 % of land area of India is vulnerable to landslide hazard (Onagh et al. 2012). Landslide has been occurring mainly in the Himalaya mountain range including the Uttarakhand state, in the northern India. Many areas of Uttarakhand state have been subjected to severe erosions of rock and soil, and under cutting of slopes by rivers and anthropogenic activities. With the developmental activities, landslide is turning into a serious hazard issue requiring urgent attentions to mitigate its damages. Landslide susceptibility map is considered as a standard tool in land management decision-making for mitigating damages arising from landslides. In addition, these maps are helpful for planners to select suitable areas to implement development schemes (Akgun 2012; Ozdemir and Altural 2013)

Over the decades, the topic of landslide susceptibility mapping has been turning into one of the most attentions to many researchers throughout the world (Dou et al. 2015c; Hong et al. 2015b; Kavzoglu et al. 2015; Pareek et al. 2013; Shahabi et al. 2014; Tien Bui et al. 2014b). Geographical Information System (GIS) is an excellent and useful tool in order to produce landslide susceptibility map. Nowadays, the data used in GIS is easier and quicker to receive due to the development of Global Positioning Systems (GPS) and Remote Sensing (RS) techniques. Therefore, the utilization of GIS becomes more and more popular in the spatial analysis of landslides (Althuwaynee et al. 2012).

Over the last decades, many studies in landslide susceptibility mapping using GIS have been carried out in many regions over the world such as weights of evidence (Kayastha et al. 2012; Mohammady et al. 2012; Ozdemir and Altural 2013; Pourghasemi et al. 2013b; Regmi et al. 2014), frequency ratio (Choi et al. 2012; Shahabi et al. 2014; Yalcin et al. 2011), logistic regression (Bai et al. 2011; Conoscenti et al. 2014; Das et al. 2010; Garcia-Rodriguez et al. 2008; Ohlmacher and Davis 2003), evidential belief function (Althuwaynee et al. 2012; Jebur et al. 2015; Pradhan et al. 2014), fuzzy logic (Aksoy and Ercanoglu 2012; Pourghasemi et al. 2012; Pradhan 2011; Saboya Jr et al. 2006; Tien Bui et al. 2012d), neuro-fuzzy (Oh and Pradhan 2011; Pradhan et al. 2010b; Sezer et al. 2013; Tien Bui et al. 2012c; Vahidnia et al. 2010), support vector machine (Hong et al. 2015a; Marjanovic et al. 2011; Peng et al. 2014; Pourghasemi et al. 2013a; Tien Bui et al. 2012b; Xu et al. 2012).

Overall, these methods can be grouped into three approaches: (i) analytic approach, (ii) statistic approach, (iii) soft computing approach (Pradhan 2013). The analytic approach (safety factor – based approach) is only feasible in small areas where landslide types are simple and having fairly homogeneous geologic properties (Dou et al. 2014; Scolobig and Pelling 2015). It is almost impossible to be applied in large areas (Pradhan 2013). Therefore, statistical and soft computing approaches usually are chosen for landslide susceptibility assessment in large areas. Moreover, these approaches can be implemented comparatively easily using GIS (Akgun et al. 2012; Pradhan 2013). Recently, some relative new soft computing – based models have been developed and applied successfully in classification task such as Naïve Bayes (NB), Multilayer Perception Neural Network (MLP Neural Nets), Functional Tree (FT) which can be applied in spatial prediction of landslide occurrence. In general, no agreement is reached on which one is best for all areas, therefore, the comparison of the performance of these methods should be carried out (Aksoy and Ercanoglu 2012; Alimohammadlou et al. 2014; Kavzoglu et al. 2014; Regmi et al. 2014; Shahabi et al. 2014; Zare et al. 2013).

NB is a new technique in classification and was selected for the present study to construct landslide susceptibility map. This method has been already used widely and successfully in many studies (Aguiar-Pulido et al. 2012; Lepora et al. 2010; Lu et al. 2010; Rosen et al. 2011; Zhang and Gao 2011). However, its application is still rare in landslide problem (Tien Bui et al. 2012a). Tien Bui et al. (2012a) also stated that NB giving relatively good prediction capability in landslide susceptibility assessment in Hoa Binh province in Viet Nam.

MLP Neural Nets is an artificial neural network that has been employed widely in many fields including landslide susceptibility assessment (Conforti et al. 2014; Dou et al. 2015d; Lee et al. 2004; Li et al. 2012; Melchiorre et al. 2008; Yilmaz 2009). Zare et al. (2013) was carried out the study in comparison of two artificial neural networks such as MLP Neural Nets and Radial Basic Function Network (RBF Neural Nets) for landslide problems and giving a statement that MLP Neural Nets outperforms RBF Neural Nets. Since, MLP Neural Nets was selected in this study for comparing with other models (NB and FT).

FT is an efficient method for classification in many fields such as forestry (Heinimäki and Elomaa 2013), patent (Cascini and Zini 2008), oceanology (Nerini and Ghattas 2007). Gama (2004) stated that the FT model might be seen as a generalization of prior multivariate trees for decision problems. However, this technique has seldom been explored for landslide susceptibility assessment; therefore, it was selected in this present study to compare the prediction capability with other models (FT and MLP Neural Nets).

The above three proposed methods have advantages and disadvantages depending on the available various datasets and site conditions. The comparison of these methods in landslide susceptibility assessment is still limited and thus requiring more studies. The main objective of this present study is to compare the prediction capability of the NB, MLP Neural Nets, and FT techniques for spatial prediction of landslides. Weka 3.7.12 and ArcGIS 10.2 software were used in data analysis, development of models and landslide susceptibility maps.

2 Description of study area

The part of Uttarakhand state has been selected as a study area (Fig. 1). It lies between latitudes 29°56’38’’N to 30°09’37’’N  and longitudes 78°29’01’E to 78°37’06”E covering an area of about 323,815 m².

Fig. 1
figure 1

Landslide inventory map of study area

The study area located in the middle of the Tehri Garhwal and Pauri Garhwal districts, as it is having number of landslides and identical meteorological conditions of these two districts. The daily mean temperature remains 19.6 oC in the Tehri Garhwal district, whereas the maximum temperature 36.5 °C in June, and the lowest temperature 4.6 °C in January (Bagchi 2011). The relative humidity in the Tehri Garhwal district is generally very high with the highest one of 85 % in monsoon season (June to September) and the lowest one of 25 % in summer season (April to June) (Bagchi 2011). The daily mean temperature remains 25 °C in the Pauri Garhwal district whereas the maximum temperature of 45 °C in June, and the minimum temperature of 1.3 °C in January. The relative humidity varies between 54 % and 63 % in the Pauri Garhwal district (http://pauri.nic.in/pages/display/55-the-land). The study area is situated in subtropical moon soon region which frequently experiences heavy rainfall. The heavy rainfall usually occurs in monsoon season. The annual mean rainfall varies from 770 mm to 1684 mm.

Broadly, in this study area, there are four types of land cover including dense forest, non-forest, open-forest, and scrub land. Non-forest covers the biggest area with 39.02 %, followed by dense-forest (31.96 %), open-forest (22.36 %), and scrub land (6.67 %), respectively.

Topographically, the study area is mountainous having very high mountain ranges. Elevation ranges from 380 m to 2180 m above the sea level, with average elevation of 1,081 m. Most of area falls above elevation 900 m. Slope angles in this area are smaller than 70 degrees. Most of the slope angles are in classes of 15 – 25 degrees (21.34 %), 25 – 35 degrees (38.08 %), 35 – 45 degrees (26.03 %). There is only 2.89 % in very gentle slope angles smaller than 8 degrees.

Geologically, there are six main lithological groups namely, Blaini and Krol group (boulder bed and limestone), Amri group (quartzite, phyllite), Bijni group (quartzite, phyllite), Jaunsar group (phyllite and quartzite), Tal group (sandstone, shale, quartzite, phyllite, and limestone), Manikot shell limestone (limestone). Baliana and Krol group, Bijni group present dominantly with 30.1 % and 28.1 % of the study area, respectively. Soil types are mainly fine-silt (26.27 %) and loamy (73.73 %). Loamy soils can be divided into four classes such as course-loamy (20.1 %), fine-loamy (8.02 %), skeletal- loamy (42.02 %), and mixed-loamy (3.6 %).

3 Methodology

Methodology in this study was carried out in six steps (Fig. 2): (i) data collection and interpretation, (ii) preparing of the landslide conditioning factors and landslide inventory map and creating of the training and testing datasets as inputs for landslide susceptibility modeling, (iii) considering of influence of the eleven conditioning factors on landslide occurrence using linear support vector machine, (iv) applying the NB, MLP Neural Nets, and FT models to assess the susceptibility of landslides in the study area, (v) evaluating and comparing the performance of these models using the success rate and predictive rate curves, (vi) generating landslide susceptibility maps, and selecting of the best model in the study area.

Fig. 2
figure 2

Methodological flow chart employed in this study

3.1 Data collection and interpretation

The dataset in the current study was collected mostly from published literatures such as Geological Survey of India including topography, lithology, soil, and land cover maps of Uttarakhand state at a scale of 1:1000000 (http://www.ahec.org.in/wfw/maps.htm). Meteorological data was collected and compiled for 30 years (1984 to 2014) from Climate Forecast System Reanalysis (CFSR) in Global Weather data for SWAT (NCEP 2014). The LANDSAT-8 satellite images with 30 m spatial resolution were downloaded from United State Geological Survey (USGS) database (http://earthexplorer.usgs.gov/). The Google Earth satellite images up to 10 m spatial resolution were used with the help of Google Earth pro 7.0 software. All of the raw data were processed to create the input data for landslide susceptibility assessment using ArcGIS 10.2.

3.1.1 Landslide inventory map

Landslide inventory map is the compilation of the landslide locations occurred in the past and its characteristics. It is indispensable in landslide susceptibility assessment that is carried out on the base of the statistical assumption that the conditions for landslide occurrence in the future is same with landslides happened in the past (Guzzetti et al. 2005). One of the most helpful methods for identification of landslides is using of remote sensing techniques (Dou et al. 2015b; Guzzetti et al. 2012; Singhroy et al. 1998).

Information for historical landslides for the study areas was first collected from reports and newspaper records and then this information were used to identify landslides by using “zooming out” and “zooming in” tools in Google Earth pro 7.0 software. Landslide locations were identified by Google Earth images in 2014. LANDSAT-8 satellite images were also used to help identification of very large landslides in the study area. The interpretation of landslides was based on evidences such as bare soil, breaks in the forest canopy, and other typical geomorphic characteristics (Pradhan 2013). Landslide locations were then validated using extensive field investigation.

A total of 430 landslide locations were finally identified whereas 236 landslides with area larger than 400 m2 (corresponding to the area of pixels in DEM 20 m) were mapped in polygon format, and remaining 194 landslides with area smaller than 400 m2 were mapped as points (Fig. 1). The biggest landslide polygon was identified with area of 199,574 m2. Approximately 75.6 % landslides (325 locations) are translational type whereas about 24.4 % of landslides (105 locations) are rotational type.

Most of these landslides have occurred in cut-slopes, embankments alongside roads, and highways. Examples of landslide photos in Uttarakhand state is shown in Fig. 3.

Fig. 3
figure 3

Examples of landslide photos in the study area (source: (a) http://www.indiawaterportal.org/articles/recent-landslides-uttarakhand-natures-fury-or-human-folly-paper-current-science, (b) http://indiatoday.intoday.in/gallery/uttarakhand-floods-2013-army-iaf-rescue-operations-continue/23/9667.html, (c) http://www.indiawaterportal.org/articles/recent-landslides-uttarakhand-natures-fury-or-human-folly-paper-current-science, (d) http://indiatoday.intoday.in/uttarakhand/deluge.jsp)

3.1.2 Landslide conditioning factors

Eleven landslide conditioning factors were used for landslide susceptibility assessment in the study area namely slope angle, slope aspect, elevation, curvature, lithology, soil type, land cover, distance to roads, distance to lineaments, distance to rivers, and rainfall.

Slope angle

Slope angle is one of the most causative factors to landslide occurrence (Van Den Eeckhaut et al. 2006). Slope angle map was extracted from a digital elevation model (DEM) with a 20 m*20 m grid size, which was generated using elevation contour lines and points extracted from the state topographical map at a scale of 1:1000000. Slope angles were classified into six classes including 0-8, 8-15, 15-25, 25-35, 35-45; and > 45 degrees (Fig. 4a). It can be observed from the statistical analysis that landslide occurred mainly between slope angles of 35 – 45 degrees (41.54 %) and 25–35 degrees (29.13 %). There are no landslides reported in very gentle slope angles smaller than 8 degrees (Fig. 4b).

Fig. 4
figure 4

(a) Slope angle map, (b) Distribution of pixels in slope angle map, (b) Slope aspect map, (d) Distribution of pixels in slope aspect map

Slope aspect

Slope aspect that controls topographic moisture due to impaction of solar radiation and rainfall (Sadr et al. 2014) was considered as a conditioning factor to landslide occurrence in this study. Slope aspect map was extracted from a DEM and were classified into nine classes such as flat (-1), north (0-22.5 and 337.5-360), northeast (22.5-67.5), east (67.5-112.5), southeast (112.5-157.5), south (157.5-202.5), southwest (202.5-247.5), west (247.5-292.5), northwest (292.5-337.5) (Fig. 4c). Landslides in the study area have been observed concentrated mainly in southwest (23.65 %), and south (14.5 %), and there are no landslide shown in flat area (Fig. 4d).

Elevation

Elevation is the deviation of maximum and minimum heights within terrains that is considered as a conditioning factors to landslide occurrence (Ayalew and Yamagishi 2005b). Elevation map was generated from DEM and categorized into different classes such as 0 - 600, 600 - 750, 750 - 900, 900 – 1050, 1050 – 1200, 1200 – 1350, 1350 – 1500, 1500 – 1650, 1650 – 1800, and > 1800 m (Fig. 5a). Elevation with the highest number of landslide is in class smaller than 600 m (26.49 %). Landslides in 900 – 1050 m and 1050 – 1200 m classes are also high (18.28 % and 14.65 %), respectively. No landslides were observed on elevation greater than 1800 m (Fig. 5b).

Fig. 5
figure 5

(a) Elevation map, (b) Distribution of pixels in elevation map, (c) Curvature map, (d) Distribution of pixels in curvature map

Curvature

Curvature is considered as one of the conditioning factors in this study due to it controls the water flow on terrain surface affecting to landslide occurrence. It can be stated that convex slopes are more stable than concave slopes because runoff disperses uniformly and easily down (Stocking 1972). Therefore, curvature map was derived from DEM and divided into three classes: concave (< -0.05), flat (-0.05 – 0.05), and convex (> 0.05) (Fig. 5c). Landslides occurred mostly in concave (57.79 %) and convex (42.21 %). Obviously, there are no landslides appearing in flat class (Fig. 5d).

Lithology

Geology includes lithological and structural variations which often lead to a difference in strength and permeability of rocks and soils, has greatly influences on occurrence of landslides (Ayalew and Yamagishi 2005b). Lithological map in the study area was extracted from the geological map of the state and then constructed with six formations based on lithological similarities (Ayalew and Yamagishi 2005a) including group 1 (Amri group with quartzite, phyllite), group 2 (Blaini and Krol group with boulder bed and limestone), group 3 (Bijni group with quartzite, phyllite), group 4 (Jaunsar group with phyllite and quartzite), group 5 (Manikot shell limestone with limestone), group 6 (Tal group with sandstone, shale, quartzite, phyllite, and limestone) (Fig. 6a). Lithological formation that has the highest number of landslides is group 6 (37.28 %). The number of landslides is also high in group 3 (22.11 %), group 4 (18.83 %), and group 2 (18.64 %). Landslides in group 5 are very small (3.14 %) whereas landslide did not happen in group 1 (Fig. 6b).

Fig. 6
figure 6

(a) Lithological map, (b) Distribution of pixels in lithological map, (c) Soil map, (d) Distribution of pixels in soil map

Soil

Soil type reflects textures and compositions of soil materials affecting on landslide occurrence (Sassa and Canuti 2008). Soil map was constructed from the soil map of state and were classified into fine-silt, course-loamy, fine-loamy, mixed-loamy, skeletal-loamy (Fig. 6c). It can be seen that landslides happened mainly in course-loamy (CL) (40.76 %), skeletal-loamy (SL) (36.13 %). There are few landslides in fine-silt (FS) (16.15 %), fine-loamy (FL) (3.14 %), and mixed-loamy (ML) (3.82 %) (Fig. 6d).

Land cover

Land cover map for the study area was extracted from the land cover map of the state. Land cover classes in the study area include dense forests, non-forests, open forests, and scrub lands (Fig. 7a). It is obvious that landslides took place mostly in non-forest area (31.86 %), following by dense-forest (29.13 %), open-forest (25.87 %), and scrub land (13.14 %), respectively (Fig. 7b). In the area covering by vegetation, the roots of vegetation act as anchors to keep stability of soil and rock masses on the slope thus preventing landslides (Prandini et al. 1977).

Fig. 7
figure 7

(a) Land cover map, (b) Distribution of pixels in land cover map, (c) Rainfall map, (d) Distribution of pixels in rainfall map

Rainfall

Rainfall is the most affecting factor for landslide occurrence (Glade et al. 2000; Polemio and Petrucci 2000). Rainfall map was constructed from the meteorological data. The rainfall was classified into eight classes including 0 - 900, 900 – 1000, 1000 – 1100, 1100 – 1200, 1200 – 1300, 1300 – 1400, 1400 – 1500, and > 1500 mm (Fig. 7c). It has been observed that landslides occurred mainly in areas where road cutting is there and rainfalls is more than 900 mm that is between 900 mm - 1000 mm (21.38 %), and 1000 – 1100 mm (20.65 %) (Fig. 7d).

Distance to roads

Distance to roads map was built with six classes including 0-40, 40 - 80, 80 - 120, 120 - 160, 160 – 200, and > 200 m (Fig. 8a). Roads were extracted from Google Earth satellite images, and then the road sections with slope angles larger than 15 degrees were buffered within the study area to construct distance to roads map. It has been observed that number of landslides occurring near to the roads. Landslides occurred mostly at distances of 40 – 80 m (40.27 %), 80 – 120 m (20.74 %), 0 – 40 m (15.97 %), respectively. Few landslides were observed at distances of 120 – 160 m (12.22 %), 160 – 200 m (7.23 %), and greater than 200 m (3.56 %) (Fig. 8b).

Fig. 8
figure 8

(a) Distance to roads map, (b) Distribution of pixels in distance to roads map, (c) Distance to rivers map, (d) Distribution of pixels in distance to rivers map

Distance to rivers

Distance to rivers map was also generated with six classes including 0-40, 40-80, 80 - 120, 120-160, 160 – 200, and > 200 m (Fig. 8c). Rivers were also extracted from Google Earth satellite images, and then the river sections with slope angles larger than 15 degrees were buffered within the study area to construct distance to rivers map. Landslides placed mostly at distance of 0 – 40 m (65.12 %), and very small amount of landslides occurred at greater distances (Fig. 8d).

Distance to lineaments

Distance to lineaments map was constructed with different classes as 0 - 50, 50 - 100, 100-150, 150-200, 200 – 250, 250 – 300, 300 – 350, 350 – 400, 400-450, 450-500, and > 500 m (Fig. 9a). Lineaments are linear features such as faults, fractures, geomorphologic ridges and other topographic and tectonic structures which affect continuity of rocks and soils masses thus causing instability of slope (Ayalew and Yamagishi 2005b). Lineaments were extracted from LANDSAT-8 satellite images using Geomatica software 2015 version. Most of landslides took place in distance of 0 – 50 m (68.9 %), and very few landslides occurred at greater distances (Fig. 9b).

Fig. 9
figure 9

(a) Distance to lineaments map, (b) Distribution of pixels in distance to lineaments map

3.2 Preparation of training and testing dataset

Chung and Fabbri (2003) stated that it is impossible to validate the model performance without splitting of the dataset. Therefore, it is important to split dataset into two parts for landslide susceptibility assessment. One part that is used for model building called training dataset, and another that is employed for validating/testing the performance of model called testing dataset. However, there are no thumb rule for selection of the proportion of training and testing datasets (Pradhan 2013). In this study, a ratio of 70 % and 30 % was selected according to randomly partition for training and testing dataset, respectively (Pourghasemi et al. 2012; Pourghasemi et al. 2013b; Xu et al. 2012).

In order to construct training dataset, 70 % of landslides were first extracted randomly from landslide inventory map including 301 landslide locations. These landslides were then converted into 6133 pixels with 20 m* 20 m size. The 6133 non-landslide pixels were also extracted randomly from the study area. Subsequently, the landslide pixels and non-landslide pixels were combined into sampling data. The training dataset was finally constructed by overlaying sampling data onto the eleven landslide conditioning factors. Also, 30 % of remaining landslides were used for making the testing dataset. Similarly, the testing dataset was built with 1614 landslide pixels from 129 landslide locations and 1614 non-landslide pixels.

Landslide pixels were assigned with value of 1 and non-landslide pixels were assigned with value of 0 in dataset for classification (Tien Bui et al. 2015).

3.3 Landslide conditioning factors selection

The quality of landslide susceptibility assessment depends not only on the selected models but also on the quality of the input data (Pradhan 2013). Therefore, the predictive ability of landslide conditioning factors has to be tested before conducting the learning process of the landslide models. The factors that are tested as low or null predictive capability should be removed for improving of results accuracy (Doshi and Chaturvedi 2014).

Feature selection is an effective method in selection of the input data for the model (Dash and Liu 1997). It was used for selection of landslide conditioning factors in the current study. Several feature selection methods used in the literature review such us information gain ratio (Dai and Xu 2013), consistency (Dash and Liu 2003), gain ratio (Karegowda et al. 2010), chi-square statistic (Chen et al. 2009). In this study, linear support vector machine (LSVM) was selected for evaluation of prediction capability of landslide conditioning factors. The LSVM method was proposed by Guyon and Elisseeff (2003) that improves the classification accuracy for modeling by reducing irrelevant and unnecessary input variables (Lin et al. 2008).

According to training dataset and eleven landslide conditioning factors, the test of the predictive capability of these factors was carried out using the LSVM method (Eq. 1):

$$ g(x)=sgn\left({w}^{\mathrm{T}}a+b\right) $$
(1)

Where, w T is the inverse matrix of weight matrix assigned in each landslide conditioning factor, a = (a 1, a 2,…, a 11) is the vector of inputs that contains eleven landslide conditioning factors, b is the offset from the origin of the hyper-plane. The landslide conditioning factor ith with the weight w i close to 0 has a smaller effect on the prediction than the one with larger values of w i (Mladenić et al. 2004).

3.4 Landslide susceptibility models

3.4.1 Naive Bayes (NB)

NB is a statistical classifier based on hypothesis that there is not any dependency between attributes to maximize the posterior probability in determination of the class for classification (Soni et al. 2011). The NB model use Bayes’ theorem that can be constructed in following steps: (i) collecting the examples, (ii) estimating a prior probabilities of each class, (iii) estimating means of classes, (iv) constructing covariance matrices and finding the inverse and determinant for each class, (v) forming the discriminant function for each lass (Bhargavi and Jyothi 2009). The advantage of the NB model is that it requires a small amount of training data to estimate the necessary parameters for classification (Bhargavi and Jyothi 2009). In this study, the performance of the NB model was tested in spatial prediction of landslide occurrence in the study area.

Let x = (x 1, x 2, …, z 11) is the vector of the eleven landslide conditioning factors, y = (y 1, y 2) is the vector of the classifier variables (landslide, non-landslide). The NB classifier is based on the following equation:

$$ {y}_{NBC}=\underset{y_{\mathrm{i}}=\left[\mathrm{landslide},non\ \mathrm{landslide}\right]}{\mathrm{argmax}\ P\left({y}_{\mathrm{i}}\right)}\ {\displaystyle \prod_{i=1}^{10}P\left({x}_{\mathrm{i}}/{y}_{\mathrm{i}}\right)} $$
(2)

where, P(y i) is the prior probability of y i that can be estimated based on the proportion of the observed cases with output class y i in the training dataset. P(x i/y i) is the conditional probability that can be calculated according to Eq. 3:

$$ P\left({x}_{\mathrm{i}}/{y}_{\mathrm{i}}\right)=\frac{1}{\sqrt{2\uppi}\alpha }{e}^{\frac{\hbox{-} {\left({x}_{\mathrm{i}}\hbox{-} \eta \right)}^2}{2{\alpha}^2}},\eta\ \mathrm{is}\ \mathrm{the}\ \mathrm{mean}\ \mathrm{and}\ \alpha\ \mathrm{is}\ \mathrm{the}\ \mathrm{standard}\ \mathrm{deviation}\ \mathrm{of}\ {x}_{\mathrm{i}} $$
(3)

3.4.2 Multilayer Perceptron Neural Networks (MLP Neural Nets)

MLP Neural Nets is one of the artificial neural networks which is a branch of the artificial intelligence (AI) (Gardner and Dorling 1998). MLP Neural Nets has been the most widely used in ANN techniques (Zare et al. 2013). The structure of MLP Neural Nets includes three classes which are input layers, hidden layers, and output layers. The meaning of these classes is various in terminologies for different fields (Yee and De Silva 2002). The training process in MLP Neural Nets is carried out in two steps: (i) the inputs are propagated forward through the hidden layers to result the output values, and then the output values are compared to pre-values in order to estimate the difference, (ii) the connection weights were adjusted to optimize the best results with the least difference (Tien Bui et al. 2015). In this study, MLP Neural Nets was employed for landslide susceptibility assessment in the study area. For this purpose, the input layers are viewed as the landslide conditioning factors, the output layers are understood as the classified results which are landslide or non-landslide classes, and the hidden layers are classifying layers to transform inputs to outputs (Fig. 10).

Fig. 10
figure 10

Multilayer Perceptron Neural Networks used in this study

Suppose that t = t i , i = 1, 2, ..., 11 is the vector of the eleven landslide conditioning factors, Φ = Φ j , j = 1, 2 that represents landslide, non landslide classes. MLP Neural Nets function for classification in this study indicating as following:

$$ \varPhi =f(t) $$
(4)

where f(t) is an unknown function that is optimized by the adjustable network weights during training process for given network architecture.

MLP Neural Nets has been shown as an efficient approach for classification which is better than traditional ones (Benediktsson et al. 1990). There are some advantages when using this approach: (i) no pre-assumptions regarding to the distribution of training dataset, (ii) no decision needs to be made related to the relative importance of the different input measurements, and (iii) the weights are adjusted to select the most input measurements during training process (Gardner and Dorling 1998).

However, there is an important and difficult problem when using MLP Neural Nets for classification that is to determine the optimal number of the hidden layers. It is because an increase in the number of the hidden layers that lead to lessen the output errors for the training examples, but increases the errors for novel examples that can create the “over-fitting” problem (Murata et al. 1994).

3.4.3 Functional Trees (FT)

FT is one of the hierarchical models, which is a framework to construct multivariate trees for regression and classification problems (Gama 2004). FT could use functional inner nodes or functional leaves or combining attributes at both functional inner nodes and leaves in prediction problems (Gama 2004). FT using functional leaves is a variance reduction process whereas FT using functional inner nodes is a bias reduction method, and FT using both functional inner nodes and leaves presents very well in large datasets (Gama 2004). The main difference between FT and other traditional hierarchical models is that FT use a logistic regression function for the splitting in the functional inner nodes and for prediction at the functional leaves instead of dividing the inputs at a tree node by comparing the value of some input attributes with a constant value (Lan et al. 2011). The performance of the FT model depends on the minimum number of instance per leaf, the number of bootstrap iterations and the selection of function trees (Gama 2004).

In this study, the FT model was applied on the first time in landslide susceptibility assessment and its performance was compared with other models such as NB and MLP Neural Nets.

Assigning z = z i , i = 1, 2, ..., 11 are the attributes of the eleven landslide causal factors, f = f j, j = landslide, non landslide classes. The FT model for classification in this study was carried out in following steps: (1) selecting the constructor of Linear Bayes discriminate function (Gama 2000) to construct the model f = f(z i ) which is the probability of distribution of landslide and non-landslide classes, (2) extending z i with new landslide conditioning factors to create new constructed dataset whereas each new landslide conditioning factor is the probability that z i belong to landslide or non-landslide, (3) selecting the landslide conditioning factors from original dataset and all new constructed dataset to build the tree for classification.

Finally, landside susceptibility maps were constructed after training the above mentioned models. In order to generate these maps, landslide susceptibility indices were first calculated for all pixels of the study area. Subsequently, these indices were reclassified into different susceptible categories. Ayalew and Yamagishi (2005b) stated that the classification of the landslide susceptibility indices can be carried out using some methods namely equal interval, natural break, and the standard deviation. Pradhan and Lee (2010) also reported that it can be finalized based on percentage of area of the study region. Schicker and Moon (2012) reported that the natural break method has the best representation of landslide distribution. Since, the natural break method was selected in this study that has been widely used in many researches (Constantin et al. 2011; Irigaray et al. 2007; Zare et al. 2013). According to this method, landslide susceptibility maps can be classified into five categories very low, low, moderate, high, and very high.

3.5 Models performance validation

The performance of three landslide susceptibility models was validated by applying the success rate and prediction rate curves. Success rate curve indicates how suitable the built model is for the landslide susceptibility assessment (Gaprindashvili et al. 2014). Success rate curve is constructed based on a comparison of the landslide susceptibility maps with the number of landslides used in training dataset (6133 landslide pixels) (Pradhan et al. 2010a).

Prediction rate curve that allows estimating the probabilities of landslide occurrence in the future, therefore, it indicates how good the model is and it can be used for validate of the prediction capability of the models (Brenning 2005). Predictive rate curve is generated using the same procedure of success rate curve excepting that it employs the testing dataset (1614 landslide pixels) instead of training dataset (Fabbri et al. 2002).

The AUC value is the area under the curve that is a base for assessing accuracy of the landslide susceptibility models. The AUC values are range from 0.5 to 1.0. The AUC value equals to 1 indicating the ideal model whereas AUC value is equal to 0.5 illustrating the inaccuracy model. If the AUC value is close to 1, the results of the model are very good. In contrast, if the AUC value is close to 0.5, the results of the model is not good (Pradhan 2013).

4 Results and discussion

4.1 Selection of landslide conditioning factors

The prediction capability of the landslide conditioning factors using LSVM method is shown in Fig. 11. It can be observed that distance to roads has the highest predictive capability for landslide model (AM = 10.6). It is because landslides that have been detected in this study mostly taking place beside the roads or highway or slope cutting (Fig. 1). This is also in line with other research was carried out by Tien Bui et al. (2014a).

Fig. 11
figure 11

The prediction capability of the eleven landslide conditioning factors in this study

Slope angle is also having very high contribution to landslide model in this study (AM = 10.4). It is also comparable with another studies including Ohlmacher and Davis (2003) and Van Den Eeckhaut et al. (2006) who stated that slope angle is one of the most important factors to landslide occurrence.

Other factors like elevation (AM = 9.0), rainfall (AM = 8), land cover (AM = 6.9), distance to lineaments (AM = 6.1), curvature (AM = 4.8), soil type (AM = 3.6), lithology (AM = 3.3), slope aspect (AM = 2.2), distance to rivers (AM = 1.1), respectively, also having significantly contribution to landslide models. They were widely used in many landslide susceptibility researches (Chen et al. 2015; Dou et al. 2015a; Feizizadeh et al. 2014; Fourniadis et al. 2007; Martha et al. 2013; Vijith and Madhu 2008).

Overall, all of the eleven landslide conditioning factors proposed in this study are having a contribution to landslide model, and it were all selected for building the landslide model. As a remark, it is necessary to test the prediction capability of the landslide conditioning factors before conducting the learning process of the landslide models.

4.2 Landslide susceptibility mapping using the MLP Neural Nets, FT, and NB models

Using the NB model, landslide susceptibility map was generated and shown in Fig. 12. Landslide susceptibility classes were reclassified into five classes with the respective susceptible index intervals such as very low (0 – 0.106), low (0.106 – 0.298), moderate (0.298 – 0.537), high (0.537 – 0.787), very high (0.787 – 0.999). It can be observed that very low class has largest area (51.15 %), following by low (17.62 %), very high (11.46 %), moderate (10.82 %), and high (8.95 %), respectively. The performance of landslide susceptibility map using the NB model also indicates in Fig. 12. Obviously, it indicates very good performance because landslides are shown the most in very high (59.95 %), and high classes (16.52 %). There are very few landslides presenting in moderate, low, and very low classes.

Fig. 12
figure 12

Landslide susceptibility map and its performance using the NB model

Landslide susceptibility map using the MLP Neural Nets model was constructed and shown in Fig. 13. Landslide susceptibility classes were reclassified into five classes with the respective susceptible index intervals such as very low (0.008 – 0.113), low (0.113 – 0.261), moderate (0.261 – 0.481), high (0.481 – 0.752), very high (0.752 – 0.929). Fig. 13 shows that low class has largest area (46.68 %), following by very low (20.12 %), very high (14.43 %), moderate (13.27 %), and high (5.5 %), respectively. Landslide susceptibility map using the MLP Neural Nets model also performs very well. The highest number of landslides is in very high class (71.59 %), followed by high class (9.2 %), moderate (8.98 %), low (8.49 %), and very low (1.73 %) classes, respectively.

Fig. 13
figure 13

Landslide susceptibility map and its performance using the MLP Neural Nets model

Also, landslide susceptibility map built using the FT model is shown in Fig. 14. Landslide susceptibility classes were reclassified into five classes with the corresponding susceptible index intervals including very low (0– 0.114), low (0.114 – 0.345), moderate (0.345 – 0.592), high (0.592 – 0.847), very high (0.847 – 1). Very low class has largest area (71.74 %), followed by very high (9.49 %), low (6.86 %), moderate (6.07 %), and high (5.84 %), respectively. Using the FT model, landslide susceptibility map performs very well as well (Fig. 14). Very high class has the highest number of landslides (68.14 %). High class also has the high number of landslides. Landslides are shown very less in the remaining classes.

Fig. 14
figure 14

Landslide susceptibility map and its performance using the FT model

It can be concluded that, all of three landslide susceptibility maps were produced using three models indicating a very good accuracy, whereas landslide susceptibility map produced by the MLP Neural Nets model is slightly higher than others.

4.3 Models performance and their comparison

Regarding to the FT model, an optimal test was first carried out with various parameters, and the best performance was obtained by using functional leaves with 50 instances per leaf, and 30 bootstrap iterations. For the MLP Neural Nets model, the parameters for momentum, learning rate, and training time were set up is 0.2, 0.3, and 500, respectively, the logistic sigmoid was employed as the activation function (Sasikala et al. 2014; Şenkal and Kuleli 2009). To determine the number of the hidden layers in this study, a test was carried out with various numbers of hidden layers based on predictive accuracy (ACC) and the area under the receiver operating characteristics curve (AUC) using testing data. The optimal number of the hidden layers was determined is 2 in this study as shown in Table 1.

Table 1 The performance of the MLP Neural Nets model with the different number of hidden neurons

The final results are shown in Fig. 15. It can be observed from Fig. 15a that all of the three landslide models are very suitable for landslide susceptibility assessment according to the performance of success rate curve. The FT model indicates the highest reasonable degree (AUC = 0.9), the MLP Neural Nets model is less suitable (AUC = 0.876), and the NB model (AUC = 0.838) presents the lowest suitable comparing to others. However, Fig. 15b shows that the prediction capability of the MLP Neural Nets model is the highest with AUC value of 0.851, following by the FT model (AUC = 0.849), and the NB model (AUC = 0.838), respectively.

Fig. 15
figure 15

(a) Success rate curve, (b) Predictive rate curve

The NB model is fast and efficient to deal with discrete and continuous attributes. It has very well performance in solving real-life problems. However, due to independent assumptions on the attributes which is not always true in fact. Especially, in landslide studies, the landslide conditioning factors have high correlation (Tien Bui et al. 2015), therefore, the assumption of non-independence of these factors in the NB model might result poor performance (Zhang and Su 2004). Indeed, the performance of the NB model in this study is slightly less than the MLP Neural Nets and FT models.

Meanwhile, the MLP Neural Nets is a method, that is based on back-propagation algorithm, which is very flexible and adaptable in modeling (Basheer and Hajmeer 2000). It is considered as a standard algorithm for any supervised-learning pattern recognition process, however, it uses ‘black box” for classification that is very difficult to explain the performance of results in some cases (Soria et al. 2008). In this study, the MLP Neural Nets model outperforms the FT and NB models. It can be explained that the MLP Neural Nets model uses loops to optimize weights assigning for input data which is not used in the FT and NB models, thus its prediction capability can be improved (Gardner and Dorling 1998). These results also are comparable with another research such as Kapur et al. (2005) which shown that the MLPN model is really better than the NB model in motion capture data.

Moreover, one of the most attractive features of the FT model is that it uses the bootstrap aggregating (bagging) (Breiman 1996) which is simple variance reduction method to reduce its error (Gama 2004). Furthermore, the FT model was considered as an effective multivariable decision tree method for classification (Gama 2004). Chen et al. (2014) also stated that the FT model has the highest prediction capability compared with decision tree and logistic regression models in biodegradability prediction of chemicals. In this study, for the first time, the FT model was applied successfully in landslide problem. Its prediction capability is even better than the NB model.

Overall, all of these three landslide models are suitable for landslide susceptibility assessment in the study area. Therefore, land planners and engineers can select these models to construct landslide susceptibility map that can also be helpful in mitigating of landslide destruction.

5 Conclusion

Landslide susceptibility assessment is one of the hottest topics in recent decades due to its usefulness in identification of landslide susceptible areas that can be helpful for land use planning and decision makers and in warning landslide hazards. Even though NB, MLP Neural Nets, and FT were applied individually in many researches for solving various problems, these models had not been compared in landslide susceptibility assessment. Especially, the FT is the model that has not so far employed in landslide research. The main goal of the current study is to make a comparison on predictive abilities of these three models for landslide susceptibility mapping in the study area. Eleven landslide conditioning factors was exploited that have affected to landslide occurrence in this study area including slope angle, slope aspect, elevation, curvature, lithology, soil type, land cover, distance to roads, distance to lineaments, distance to rivers, and rainfall. These factors were used to analyze the spatial relationship between them and landslide occurrence. The prediction capability of three models was validated and compared using the success rate and predictive rate curves. The results show that three landslide models perform very reasonable for landslide susceptibility assessment. Out of these, the MLP Neural Nets model performs slightly better than the FT and NB models. Also, the predictive capability of the NB model is less than the MLP Neural Nets and FT models. As a final conclusion, all of three landslide models employed in this study perform as promising methods for landslide susceptibility assessments. It can be a good selection for the same purposes in other areas where are prone to landslide occurrence. More importantly, the construction of the road is the main reason for landslide occurrence in this study area. Since, it is essential to use of protective measures to keep slope stability when constructing roads or highways.