Commute Times in Dense Graphs

Escolano, Francisco; Curado, Manuel; Hancock, Edwin R.

doi:10.1007/978-3-319-49055-7_22

Francisco Escolano^18,19,
Manuel Curado^18,19 &
Edwin R. Hancock^18,19

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10029))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

1313 Accesses
3 Citations

Abstract

In this paper, we introduce the approach of graph densification as a means of preconditioning spectral clustering. After motivating the need of densification, we review the fundamentals of graph densifiers based on cut similarity and then analyze their associated optimization problems. In our experiments we analyze the implications of densification in the estimation of commute times.

You have full access to this open access chapter, Download conference paper PDF

Graph Clustering Using Early-Stopped Random Walks

On the Interplay Between Strong Regularity and Graph Densification

A Shortest Path Algorithm for Graphs Featuring Transfer Costs at Their Vertices

Keywords

1 Introduction

1.1 Motivation

Machine learning methods involving large graphs face a common problem, namely the natural sparsification of data as the number of dimensions d increases. In this regard, obtaining the proximity structure of the data is a key step for the subsequent analysis. This problem has been considered from two complementary perspectives: efficiency and utility. On the one hand, an efficient, i.e. scalable, proximity structure typically emerges from reducing the $O(dn^2)$ time complexity of kNN graphs, where n is the number of samples. The classical approach for dealing with large graphs is the Nyström method. It consists of sampling either the feature space or the affinity space so that the eigenproblems associated with clustering relaxations become tractable. For instance, in [10] there is a variational version of this method. In [6] an approximated kNN is obtained in $O(dn^t)$ with $t\in (1,2)$ by recursively dividing and glueing the samples. More recently, anchor graphs [13, 15] provide data-to-anchor kNN graphs, where $m\ll n$ is a set of representatives (anchors) typically obtained through K-means clustering, in $O(dmnT + dmn)$ where O(dmnT) is due to the T iterations of the K-means process. These graphs tend to make out-of-the-sample predictions compatible with those of Nyström approximations, and in turn their approximated adjacency/affinity matrices are ensured to be positive semidefinite.

On the other hand, the utility of the kNN representation refers to its suitability to predict or infer some properties of the data. These properties include (a) their underlying density and (b) the geometry induced by both the shortest path distances and the commute time distances. Concerning the density, it is well known that it can be estimated from the degrees of the kNN graph if its edges contain the local similarity information between the data, i.e. when the graph is weighted. However, when the kNN graph is unweighted the estimation is only acceptable for reasonably dense graphs, for instance when $k^{d+2}/(n^2\log ^dn)\rightarrow \infty $ as proposed in [20]. However, these densities are unrealistic, since the typical regime, the one adopted in practice, is $k\approx \log n$. A similar conclusion is reached when shortest path distances are analyzed both in weighted and unweighted kNN graphs. The shortest path distance computed from an unweighted kNN graph typically diverges from the geodesic distance. However this is not the case of the one computed from a weighed kNN graph. The solution proposed in [1] consists of assigning proper weights to the edges of the unweighted kNN graphs. Since these weights depend heavily on the ratio $r=(k/(n\mu _d))^{1/d}$, where $\mu _d$ is the volume of a $d-$dimensional unit ball, one expects $r\rightarrow 0$ for even moderate values of d, meaning that for high dimensional data both unweighted and weighted graphs yield similar, i.e. diverging, estimations. Finally, it is well know that for large $k-$NN (unweighted) graphs the commute time distance can be misleading since it only relies on the local densities (degrees) of the nodes [21, 22].

Therefore, for a standard machine learning setting ($n\rightarrow \infty $, $k\approx \log n$ and large d) we have that kNN graphs result in a sparse, globally uninformative representation. This can be extended to $\epsilon -$graphs and Gaussian graphs as well. As a result, machine learning algorithms for graph-based embedding, clustering and label propagation tend to produce misleading results unless we are able of preserving the distributional information of the data in the graph-based representation. In this regard, recent experimental results with anchor graphs suggest a way to proceed. In [5], the predictive power of non-parametric regression rooted in the anchors/landmarks ensures a way of constructing very informative weighted kNN graphs. Since anchor graphs are bipartite (only data-to-anchor edges exist), this representation bridges the sparsity of the pattern space because a random walk traveling from node u to node v must reach one or more anchors in advance. In other words, for a sufficient number of anchors it is then possible to find links between distant regions of the space. This opens a new perspective for computing meaningful commute distances in large graphs. It is straightforward to check that the spectral properties of the approximate weight matrix $W=Z\varLambda Z^T$, where $\varLambda ={{\mathrm{diag}}}(Z^T1)$ and Z is the data-to-anchor mapping matrix, rely on its low-rank. Then, it is possible to compute a reduced number of eigenvalue-eigenvector pairs associated with a small $m\times m$ matrix, where m is the number of anchors (see [16] for details). In this way, the spectral expression of the commute distance [18] can accomodate these pairs for producing meaningful distances. Our interpretation is that the goodness of the eigenvalue-eigenvector pairs is a consequence of performing kernel PCA process over $ZZ^T$ where the columns of Z act as kernel functions. This interpretation is consistent with the good hashing results obtained with anchor graphs [14, 16] where the kernel encoded in the columns of Z is extensively exploited.

Although anchor graphs provide meaningful commute distances with low-complexity spectral representations, some authors have proposed more efficient methods where anchor graphs are bypassed for computing these distances. For instance, Chawla and coworkers [9, 11] exploit the fact that commute distances can be approximated by a randomized algorithm in $O(n\log n)$ [19]. Then, using standard kNN graphs with low k for avoiding intra-class noise, their method beats anchor graphs, in terms of clustering accuracy, in several databases. These results are highly contradictory with respect to the von Luxburg and Radl’s fundamental bounds (in principle commute distances cannot be properly estimated from large kNN graphs [22]). The authors argue that this can only be explained by the fact that their graphs are quite different from those explored for defining the fundamental bounds (particularly the $\epsilon -$geometric graphs). Their estimator works better than anchor graphs in dense datasets, i.e. in settings with a low number of classes and many samples. Our preliminary experiments with the NIST database, with ten classes, confirm that their technique does not improve anchor graphs when data is sparse enough as it happens in a standard machine learning setting.

1.2 Contributions

We claim that one way of providing meaningful estimations of commute distances is to transform the input sparse graph into a densified graph. This implies the inference of novel links between data from existing ones. This is exactly what anchor graphs do when incorporate data-to-anchor edges. In this paper, we show that the inference of novel edges can be done by applying recent results in theoretical computer science, namely cut densification which in turn is an instance of graph densification. Graph densification consists in populating an input graph G with new edges (or weights if G is weighted) so that the output graph H preserves or enforces some structural properties of G. Graph densification offers a principled way of dealing with sparse graphs arising in machine learning so that commute distances can be properly estimated. In this paper we will introduce the main principles of densification and will explore their implications in Pattern Recognition (PR). In our experiments (see the Discussion section) we will show how the associated optimization problems (primal and dual) lead to a reasonable densification (in terms of PR). To the best of our knowledge this is the first application of densification principles to estimate the commute distance.

2 Graph Densification

2.1 Combinatorial Formulation

Graph densification [8] is a principled study of how to significantly increase the number of edges of an input graph G so that the output, H, approximates G with respect to a given test function, for instance whether there exists a given cut. This study is motivated by the fact that certain NP-hard problems have a PTAS (Polynomial Time Approximation Scheme) when their associated graphs are dense. This is the case of the MAX-CUT problem [2]. Frieze and Kannan [7] raise the question whether this “easyness” is explained by the Szemerédi Regularity Lemma, which states that large dense graphs have many properties of random graphs [12].

For a standard machine learning setting, we have that G is typically sparse either when a kNN representation is used or when a Gaussian graph, usually constructed with a bandwidth parameter t satisfying $t\rightarrow 0$, is chosen. Then, the densification of G so that the value of any cut is at most C times the value of the same cut in G is called a one-sided C-multiplicative cut approximation. This (normalized) cut approximation must satisfy:

$$\begin{aligned} \frac{cut_H(S)}{m(H)}\le C\cdot \frac{cut_G(S)}{m(G)}, \end{aligned}$$

(1)

for any subset $S\subset V$ of the set of vertices V, where $cut_G(S)=\sum _{u\in S,v\in V\sim S}x_{uv}$ considers edge weights $\{x_{uv}\}_{u,v\in V}$ and $x_{uv}\in [0,1]$. For H we have $cut_G(S)=\sum _{u\in S,v\in V\sim S}x'_{uv}$ for edge weights $\{x'_{uv}\}_{u,v\in V}$ also satisfying $x'_{uv}\in [0,1]$. Cuts are normalized by the total edge weight m(.) of each graph, i.e. $m(G)=\sum _{u,v}x_{uv}$ and $m(H)=\sum _{u,v}x'_{uv}$.

Cut Similarity and Optimization Problem. The cut approximation embodies a notion of similarity referred to as $C-$ cut similarity. Two graphs G and H are C-cut similar if $cut_H(S)\le C\cdot cut_G(S)$ for all $S\subset V$, i.e. if the sum of the weights in the edges cut is approximately the same in every division. Considering the normalized version in Eq. 1, finding the optimal one-sided $C-$multiplicative cut densifier can be posed in terms of the following linear program:

(2)

Herein, the term one-sided refers only to satisfy the upper bound in Eq. 1. The program $\mathbf {P1}$ has $2^n$ constraints, where $n=|V|$, since for every possible cut induced by S, the sum of corresponding edge weights $\sum _{u\in S,v\in V\sim S}x'_{uv}$ is bounded by C times the sum of the weights for the same cut in G. The solution is the set of edge weights $x'_{uv}$ with maximal sum so that the resulting graph H is $C-$cut similar to G. The NP-hardness of this problem can be better understood if we formulate the dual LP. To this end we must consider a cut metric $\delta _S(.,.)$ where [4]

$$\begin{aligned} \delta _S(u,v) = \left\{ \begin{array}{ll} 1 &{} \text {if}\;\; |\{u,v\}\cap S| = 1\\ 0 &{} \text {otherwise} \\ \end{array} \right. \end{aligned}$$

(3)

i.e. $\delta _S$ accounts for pairs of nodes (not necessarily defining an edge) with an end-point in S. As there are $2^n$ subsets S of V we can define the following metric $\rho $ on $V\times V$, so that $\rho = \sum _{S}\lambda _S\delta _S$, with $\lambda _S\ge 0$, is a non-negative combination of a exponential number of cut metrics. For a particular pair $\{u,v\}$ we have that $\rho (u,v)= \sum _{S}\lambda _S\delta _S(u,v)$ accounts for the number subsets of V where either u or v (but not both) is an end-point. If a graph G has many cuts where $\frac{cut_G(S)/m(G)}{\sum _{u,v}\delta _S(u,v)}\rightarrow 0$ then we have that $\rho (u,v) \ge \mathbb {E}_{(u',v')\in E}\;\rho (u',v')$ since

$$\begin{aligned} \mathbb {E}_{(u',v')\in E}\;\rho (u',v')= \sum _{S}\lambda _S\mathbb {E}_{(u',v')\in E}\;\delta _S(u',v') = \sum _{S}\lambda _S \frac{cut_G(S)}{m(G)}. \end{aligned}$$

(4)

These cuts all called sparse cuts since the number of pairs $\{u,v\}$ involved in edges is a small fraction of the overall number of pairs associated with a given subset S, i.e. the graph stretches at a sparse cut. The existence of sparse cuts, more precisely non-ovelapping sparse cuts allows the separation of a significant number of vertices $\{u,v\}$ where their distance, for instance $\rho (u,v)$, is larger (to same extent) than the average distance taken over edges. This rationale is posed in [8] as satisfying the condition

$$\begin{aligned} \sum _{u,v\in V}\min \left\{ \rho (u,v) - C\cdot \mathbb {E}_{(u',v')\in E}\;\rho (u',v'), 1\right\} \ge (1-\alpha )n^2, \end{aligned}$$

(5)

where C is a constant as in the cut approximation, and $\alpha \in (0,1)$. This means that a quadratic number of non-edge pairs are bounded away from the average length of an edge. In other words, it is then possible to embed the nodes involved in these pairs in such a way that their distances in the embedding do not completely collapse. This defines a so called $(C,\alpha )$ humble embedding.

Finding the metric, $\rho (u,v)$ that best defines a humble embedding is the dual problem of $\mathbf{P1}$:

(6)

where the search space is explicitly the power set of V.

Since the optimal solution of $\mathbf{P2}$ must satisfy

$$\begin{aligned} \sigma _{uv} = \max \left\{ 0, C\cdot \mathbb {E}_{(u',v')\in E}\;\rho (u',v') + 1 - \sigma _{uv}\right\} , \end{aligned}$$

(7)

we have that $\mathbf{P2}$ can be written in a more compact form:

$$\begin{aligned} \min _{\rho } \sum _{u,v}\max \left\{ 0, C\cdot \mathbb {E}_{(u',v')\in E}\;\rho (u',v') + 1 - \sigma _{uv}\right\} , \end{aligned}$$

(8)

which is equivalent to $n^2 - \max _{\rho } \sum _{u,v}\min \left\{ 1, \rho (u,v) - C\cdot \mathbb {E}_{(u',v')\in E} \right\} .$

Therefore, a solution satisfying $\sum _{u,v}\sigma _{uv} = \alpha n^2$ implies that the graph has a humble embedding since

$$\begin{aligned} \max _{\rho } \sum _{u,v}\min \left\{ 1, \rho (u,v) - C\cdot \mathbb {E}_{(u',v')\in E} \right\} =(1-\alpha )n^2. \end{aligned}$$

(9)

Since the $\sigma _{uv}$ variables in the constraints of $\mathbf {P2}$ are the dual variables of $x_{uv}$ in $\mathbf {P1}$, the existence of a $(C,\alpha )$ humble embedding rules out a C-densifier with an edge weight greater than $\alpha n^2$ and vice versa.

2.2 Spectral Formulation

Since $Q_G(z)=z^TL_Gz=\sum _{e_{uv}\in E}x_{uv}(z_u - z_v)^2$, if z is the characteristic vector of S (1 inside and 0 outside) then Eq. 1 is equivalent to

$$\begin{aligned} \frac{z^TL_Hz}{m(G)}\le C\cdot \frac{z^TL_Gz}{m(G)}, \end{aligned}$$

(10)

for $0-1$ valued vectors z, where $L_G$ and $L_H$ are the respective Laplacians. However, if H satisfies Eq. 10 for any real-valued vector z, then we have a one-sided C-multiplicative spectral approximation of G, where $L_G$ and $L_H$ are the Laplacians. This spectral approximation embodies a notion of similarity between the Laplacians $L_G$ and $L_H$. We say that G and H are $C-$ spectrally similar if $z^TL_Hz\le C\cdot z^TL_Gz$ and it is denoted by $L_H \preceq C\cdot L_G$. Spectrally similar graphs share many algebraic properties [3]. For instance, their effective resistances (rescaled commute times) are similar. This similarity is bounded by C and it leads to nice interlacing properties. We have that the eigenvalues of $\lambda _1,\ldots ,\lambda _n$ of $L_G$ and the eigenvalues $\lambda '_1,\ldots ,\lambda '_n$ of H satisfy: $\lambda '_i\le C\cdot \lambda _i$. This implies that H does not necessarily increases the spectral gap of G and the eigenvalues of $L_G$ are not necessarily shifted (i.e. increased).

Whereas the spectral similarity of two graphs can be estimated to precission $\epsilon $ in time polynomial in n and $\log (1/\epsilon )$, it is NP-hard to approximately compute the cut similarity of two graphs. This is why existing theoretical advances in the interplay of these two concepts are restricted to existence theorems as a means of characterizing graphs. However, the semi-definite programs associated with finding both optimal cut densifiers and, more realistically, optimal spectral densifiers are quite inspirational since they suggest scalable computational methods for graph densification.

Spectral Similarity and Optimization Problem. When posing $\mathbf {P1}$ and $\mathbf {P2}$ so that they are tractable (i.e. polynomial in n) the cut metric $\rho $, which has a combinatorial nature, is replaced by a norm in $\mathbb {R}^n$. In this way, the link between the existence of humble embeddings and that of densifiers is more explicit. Then, let $z_1,\ldots ,z_n\in \mathbb {R}^n$ the vectors associated with a given embedding. The concept $(C,\alpha )$ humble embedding can be redefined in terms of satisfying:

$$\begin{aligned} \sum _{u,v\in V}\min \left\{ ||z_u - z_v ||^2- C\cdot \mathbb {E}_{(u',v')\in E}\; ||z_u' - z_v' ||^2, 1\right\} \ge (1-\alpha )n^2, \end{aligned}$$

(11)

where distances between pairs should not globally collapse when compared with those between pairs associated with edges. Then the constraint in $\mathbf {P2}$ which is associated with the pair $\{u,v\}$ should be rewritten as:

$$\begin{aligned} ||z_u - z_v ||^2- C\cdot \mathbb {E}_{(u',v')\in E}\; ||z_u' - z_v' ||^2\ge 1 -\sigma _{uv}. \end{aligned}$$

(12)

Therefore, $\mathbf {P2}$ is a linear problem with quadratic constraints. For $Z=[z_1,\ldots ,z_n]$ we have that $|| z_u - z_v||^2 = b_{uv}^TZ^TZb_{uv}$ where $b_{uv}=e_u-e_v$. Then, a Semipositive Definite (SPD) relaxation leads to express the first term of the left part of each inequality in terms of $b_{uv}^TZb_{uv}$ provided that $Z\succeq 0$. Similarly, for the SPD relaxation corresponding to the expectation part of each inequality, we consider the fact that the Laplacian of the graph can be expressed in terms of $L_G=\sum _{u,v}w_{uv}b_{uv}b_{uv}^T$. Since $z^TL_Gz=\sum _{(u',v')\in E}w_{uv}|| z(u') - z(v')||^2$, if $z\sim \mathcal{N}(0,Z)$, i.e. z is assumed to be a zero mean vector in $\mathbb {R}^n$ with covariance $Z\succeq 0$, we have that $\mathbb {E}_{(u',v')\in E}\; ||\tilde{z}'_{u} - \tilde{z}'_v||^2$ can be expressed in terms of ${{\mathrm{tr}}}(L_GZ)$ (see [17] for details). Therefore the SDP formulation of $\mathbf {P2}$ is as follows

(13)

Then, the dual problem of $\mathbf { P2_{SDP}}$, i.e. the SDP relaxation of $\mathbf {P1}$ is

(14)

As in the combinatorial version of densification, first we solve the dual and then the primal. The solution of $\mathbf {P2}_{SDP}$ provides $\sigma _{uv}$ as well as the coordinates of the optimal embedding (in terms of avoiding the collapse of distances) in the columns of Z. In Fig. 2 we explain how the dual solution is obtained for the graph in Fig. 1. We denote the right hand of Eq. 11 as humility. The higher the humility the lower the maximum weight of the spectral densifier (as in the combinatorial case).

3 Discussion and Conclusions

With the primal SDP problem $\mathbf {P1}_{SDP}$ at hand we have that $\lambda _i'\le \left( C\cdot \sum _{u,v}x'_{uv}\right) \lambda _i$ where $\lambda _i'$ are the eigenvalues of the Laplacian $L_H = \sum _{u,v}x'_{uv}b_{uv}b_{uv}^T$ associated with the densified graph H. For $C>1$ we have that densification tends to produce a quasi complete graph $\mathcal{K}_n$. When we add to the cost of the dual problem $\mathbf {P2}_{SDP}$ the term $-K\log \det (Z)$ (a log-barrier) enforces choices for $Z\succeq 0$ (i.e. ellipsoids) with maximal volume which also avoids $\mathcal{K}_n$. In this way, given a fixed $K=1000$, the structure of the pattern space emerges^{Footnote 1} as we modify the $C<1$ bound so that the spectral gap is minimized in such a way that reasonable estimations of the commute distance emerge. In Fig. 3 we summarize some experiments done by subsampling the NIST digit database. Given the densifications (more dense in red) the commute time matrix is estimated and the accuracy w.r.t. the ground truth is plotted. Accuracy decreases with the number of classes and in many cases the optimal value is associated with low values of C. The quality of the results is conditioned by the simplicity of the optimization problem (guided only by a blind cut similarity, which does not necessarily impose to reduce inter-class noise) but it offers a nice path to explore.

Notes

1.
All examples/experiments were obtained with the SDPT3 solver [23] version 4.0. In our experiments, the number of variables is $|E|\approx 4500$ and the SDP solver is polynomial with |E|.

References

Alamgir, M., von Luxburg, U.: Shortest path distance in random k-nearest neighbor graphs. In: Proceedings of ICML 2012 (2012)
Google Scholar
Arora, S., Karger, D., Karpinski, M.: Polynomial time approximation schemes for dense instances of NP-hard problems. J. Comput. Syst. Sci. 58(1), 193–210 (1999)
Article MathSciNet MATH Google Scholar
Batson, J.D., Spielman, D.A., Srivastava, N., Teng, S.: Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56(8), 87–94 (2013)
Article Google Scholar
Benczúr, A.A., Karger, D.R.: Approximating s-t minimum cuts in $O(n^2)$ time. In: Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, pp. 47–55 (1996)
Google Scholar
Cai, D., Chen, X.: Large scale spectral clustering via landmark-based sparse representation. IEEE Trans. Cybern. 45(8), 1669–1680 (2015)
Article Google Scholar
Chen, J., Fang, H., Saad, Y.: Fast approximate kNN graph construction for high dimensional data via recursive lanczos bisection. J. Mach. Learn. Res. 10, 1989–2012 (2012)
MathSciNet MATH Google Scholar
Frieze, A.M., Kannan, R.: The regularity lemma and approximation schemes for dense problems. In: 37th Annual Symposium on Foundations of Computer Science, FOCS 96, pp. 12–20 (1996)
Google Scholar
Hardt, M., Srivastava, N., Tulsiani, M.: Graph densification. Innovations Theoret. Comput. Sci. 2012, 380–392 (2012)
Article MathSciNet MATH Google Scholar
Khoa, N.L.D., Chawla, S.: Large scale spectral clustering using approximate commute time embedding. CoRR abs/1111.4541 (2011)
Google Scholar
Vladymyrov, M., Carreira-Perpinan, M.A.: The Variational Nystrom method for large-scale spectral problems. In: ICML 2016, pp. 211–220 (2016)
Google Scholar
Khoa, N.L.D., Chawla, S.: Large scale spectral clustering using resistance distance and spielman-teng solvers. In: Ganascia, J.-G., Lenca, P., Petit, J.-M. (eds.) DS 2012. LNCS (LNAI), vol. 7569, pp. 7–21. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33492-4_4
Chapter Google Scholar
Komlós, J., Shokoufandeh, A., Simonovits, M., Szemerédi, E.: The regularity lemma and its applications in graph theory. In: Khosrovshahi, G.B., Shokoufandeh, A., Shokrollahi, A. (eds.) TACSci 2000. LNCS, vol. 2292, pp. 84–112. Springer, Heidelberg (2002). doi:10.1007/3-540-45878-6_3
Chapter Google Scholar
Liu, W., He, J., Chang, S.: Large graph construction for scalable semi-supervised learning. In: Proceedings of ICML 2010, pp. 679–686 (2010)
Google Scholar
Liu, W., Mu, C., Kumar, S., Chang, S.: Discrete graph hashing. In: NIPS 2014, pp. 3419–3427 (2014)
Google Scholar
Liu, W., Wang, J., Chang, S.: Robust and scalable graph-based semisupervised learning. Proc. IEEE 100(9), 2624–2638 (2012)
Article Google Scholar
Liu, W., Wang, J., Kumar, S., Chang, S.: Hashing with graphs. In: Proceedings of ICML 2011, pp. 1–8 (2011)
Google Scholar
Luo, Z., Ma, W., So, A.M., Ye, Y., Zhang, S.: Semidefinite relaxation of quadratic optimization problems. IEEE Sig. Process. Mag. 27(3), 20–34 (2010)
Article Google Scholar
Qiu, H., Hancock, E.R.: Clustering and embedding using commute times. IEEE TPAMI 29(11), 1873–1890 (2007)
Article Google Scholar
Spielman, D.A., Srivastava, N.: Graph sparsification by effective resistances. SIAM J. Comput. 40(6), 1913–1926 (2011)
Article MathSciNet MATH Google Scholar
von Luxburg, U., Alamgir, M.: Density estimation from unweighted k-nearest neighbor graphs: a roadmap. In: NIPS 2013, pp. 225–233 (2013)
Google Scholar
von Luxburg, U., Radl, A., Hein, M.: Getting lost in space: large sample analysis of the resistance distance. In: NIPS 2010, pp. 2622–2630 (2010)
Google Scholar
von Luxburg, U., Radl, A., Hein, M.: Hitting and commute times in large random neighborhood graphs. J. Mach. Learn. Res. 15(1), 1751–1798 (2014)
MathSciNet MATH Google Scholar
Toh, K.C., Todd, M., Tutuncu, R.: SDPT3 - A MATLAB software package for semidefinite programming. Optim. Methods Softw. 11, 545–581 (1998)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and AI, University of Alicante, 03690, Alicante, Spain
Francisco Escolano, Manuel Curado & Edwin R. Hancock
Department of Computer Science, University of York, York, YO10 5DD, UK
Francisco Escolano, Manuel Curado & Edwin R. Hancock

Authors

Francisco Escolano
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Curado
View author publications
You can also search for this author in PubMed Google Scholar
Edwin R. Hancock
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Francisco Escolano .

Editor information

Editors and Affiliations

Data 61 - CSIRO , Canberra, Australia
Antonio Robles-Kelly
Pattern Recognition Laboratory, Technical University of Delft Pattern Recognition Laboratory, CD Delft, The Netherlands
Marco Loog
Electrical and Electronic Engineering, University of Cagliari Electrical and Electronic Engineering, Cagliari, Italy
Battista Biggio
Computación e IA, Universidad de Alicante Computación e IA, Alicante, Spain
Francisco Escolano
Computer Science, University of York Computer Science, York, United Kingdom
Richard Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Escolano, F., Curado, M., Hancock, E.R. (2016). Commute Times in Dense Graphs. In: Robles-Kelly, A., Loog, M., Biggio, B., Escolano, F., Wilson, R. (eds) Structural, Syntactic, and Statistical Pattern Recognition. S+SSPR 2016. Lecture Notes in Computer Science(), vol 10029. Springer, Cham. https://doi.org/10.1007/978-3-319-49055-7_22

Download citation

DOI: https://doi.org/10.1007/978-3-319-49055-7_22
Published: 05 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49054-0
Online ISBN: 978-3-319-49055-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)