Canonical labelling of random regular graphs
📝 Original Info
- Title: Canonical labelling of random regular graphs
- ArXiv ID: 2602.17567
- Date: 2026-02-19
- Authors: ** 정보 제공되지 않음 (논문에 저자 명시가 없으므로 확인 불가) **
📝 Abstract
We prove that whenever $d=d(n)\to\infty$ and $n-d\to\infty$ as $n\to\infty$, then with high probability for any non-trivial initial colouring, the colour refinement algorithm distinguishes all vertices of the random regular graph $\mathcal{G}_{n,d}$. This, in particular, implies that with high probability $\mathcal{G}_{n,d}$ admits a canonical labelling computable in time $O(\min\{n^ω,nd^2+nd\log n\})$, where $ω<2.372$ is the matrix multiplication exponent.💡 Deep Analysis
📄 Full Content
Colour refinement (CR) is a simple algorithmic routine that operates on vertex-coloured graphs. For an input graph G with initial colouring C 0 : V (G) → Z, CR iteratively computes new colourings. At round t, C t (v) is a pair (C t-1 (v), C t-1 (N (v))), where C t-1 (N (v)) is the multiset of C t-1 -colours of neighbours of v. That is, the process refines the initial partition C 0 and halts once the partition stabilises. Let us call a colouring discrete if every pair of vertices is coloured differently. If CR runs on an uncoloured graph (i.e., there is only one initial colour) and outputs a discrete colouring, then, since the vertex colours are isomorphism-invariant, this yields a canonical labelling by numbering the colour names in the lexicographic order. In [9,19,26], it was proved that CR run on a binomial random graph G(n, p) 1 outputs a discrete colouring with high probability (whp, in what follows) 2 whenever (1 + ε) ln n n < p ≤ 1 2 , which implies a near linear time algorithm 3 for canonical labelling of G(n, p) [13].
We stress that for regular uncoloured graphs G, CR terminates immediately with a trivial colouring, and is therefore unsuitable for canonical labelling. For a positive integer n, we denote [n] := {1, . . . , n}. Let d ≤ n -1 be a non-negative integer such that dn is even. Let G n,d be a uniform distribution over all d-regular graphs on [n]. We write G n ∼ G n,d for a graph sampled from this distribution, i.e. G n is a uniformly random d-regular graph on [n]. Since, for constant d, efficient canonical labelling algorithms are known, we focus on the case d = ω (1). Moreover, since the edge complement of a d-regular graph is (n -1 -d)-regular, the edge complement of G n,n-1-d is distributed as G n,d . Therefore, we may restrict ourselves to d ≤ n/2. Our main result shows that the triviality of the initial colouring is the only obstacle for a complete refinement: once a non-trivial initial colouring is produced, whp CR run on G n ∼ G n,d outputs a discrete colouring which is suitable for canonical labelling.
Before we state the main result of this paper, we need one more definition. For a connected graph G, we denote by diam(G) its diameter.
Theorem 1. Let d 0 be large enough, let d = d(n) be such that d 0 ≤ d ≤ n/2, and let G n ∼ G n,d . Then, the following holds whp: for every non-trivial partition [n] = V 1 ⊔ V 2 of the vertex set of G n , CR runs at most 2 diam(G n ) + 3 steps on G n and outputs a discrete colouring.
Remark 2. We did not try to optimise the bound on the number of rounds, and we believe that 2 diam(G n ) + 3 is suboptimal. In particular, for d ≥ n 1/2+ε , we show that 2 diam(G n ) + 1 = 5 rounds is enough. Actually, it is natural to suspect that the total number of rounds needed is (1 + o diam(Gn) (1)) diam(G n ) whp. We also note that our technique does not directly generalise to small values of d; in particular, there appears to be a natural barrier at d = 3. Nevertheless, we believe that the statement of Theorem 1 can be extended to all 3 ≤ d ≤ n/2.
We now describe a possible approach to canonical labelling of G n ∼ G n,d based on our main result. Since the output of CR can be computed in time O((n + |E(G)|) log n) on an n-vertex graph G [13], Theorem 1 reduces the problem to efficiently finding an isomorphism-preserving partition of [n]. To this end, recall that, for any d = ω(1), whp G n contains a triangle [32]. Let t i be the number of triangles that contain the vertex i ∈ [n] in G n , and let t = max{t 1 , . . . , t n }. Using fast square matrix multiplication, it is possible to compute the vector (t 1 , . . . , t n ) in time n ω , where ω < 2.372 [1]. Alternatively, this vector can be computed in time O(nd 2 ) using the standard triangle-listing algorithm [17], which is faster when d < n 0.685 , assuming the best known upper bound on the matrix multiplication exponent ω (and yields the bound o(n 2 ) for