← Back to glossary

Disjoint Set / Union-Find

data_structures Advanced

Also Known As

Union-Find disjoint set union DSU

TL;DR

A data structure tracking which elements belong to the same group — supporting near-O(1) union and find operations. Used for network connectivity, Kruskal's MST, and cycle detection.

Explanation

Disjoint Set (Union-Find) maintains a forest of trees where each tree represents a set. find(x) returns the root of x's tree (the set representative). union(x, y) merges the two sets. Optimisations: path compression (flatten tree on find), union by rank (always attach smaller tree to larger). Combined: near-O(1) amortised — technically O(α(n)) where α is the inverse Ackermann function, practically constant. Applications: Kruskal's Minimum Spanning Tree, network connectivity (are A and B in the same network?), image segmentation, and cycle detection in undirected graphs.

Common Misconception

✗ Disjoint Set requires O(n) per operation — without optimisations yes; with path compression and union by rank, amortised O(α(n)) which is effectively O(1) for all practical inputs.

Why It Matters

Kruskal's MST algorithm using Union-Find runs in O(E log E) instead of O(E * V) — the difference between seconds and hours for large graphs.

Common Mistakes

Not implementing path compression — find() is O(n) without it.
Not using union by rank — trees become linked lists without it.
Using for directed graphs — Union-Find tracks connected components in undirected graphs only.
Forgetting that find() modifies the structure (path compression) — not purely read-only.

Code Examples

✗ Vulnerable

// No optimisations — O(n) find:
class NaiveUnionFind {
    private array $parent;
    public function find(int $x): int {
        while ($this->parent[$x] !== $x) {
            $x = $this->parent[$x]; // Walk up — O(n) in worst case
        }
        return $x;
    }
}

✓ Fixed

// Path compression + union by rank — O(alpha n) amortised:
class UnionFind {
    private array $parent;
    private array $rank;

    public function __construct(int $n) {
        $this->parent = range(0, $n - 1);
        $this->rank   = array_fill(0, $n, 0);
    }

    public function find(int $x): int {
        if ($this->parent[$x] !== $x)
            $this->parent[$x] = $this->find($this->parent[$x]); // Path compression
        return $this->parent[$x];
    }

    public function union(int $x, int $y): void {
        $rx = $this->find($x); $ry = $this->find($y);
        if ($rx === $ry) return;
        if ($this->rank[$rx] < $this->rank[$ry]) [$rx, $ry] = [$ry, $rx];
        $this->parent[$ry] = $rx;
        if ($this->rank[$rx] === $this->rank[$ry]) $this->rank[$rx]++;
    }
}

References

↗ https://en.wikipedia.org/wiki/Disjoint-set_data_structure

Tags

data-structures algorithms graphs

Added 16 Mar 2026

Edited 22 Mar 2026

Curated in Warsaw under one editorial standard. 1,444 terms, single voice. About this reference →

Rate this term

No ratings yet

🤖 AI Guestbook educational data only

| |

Last 30 days

Agents 0

No pings yet today

No pings yesterday

Amazonbot 8 Perplexity 6 Ahrefs 2 Google 1 Unknown AI 1

Also referenced

Graph Algorithms 38 Big-O Notation 38 Graphs 33

How they use it

crawler 18

Related categories

algorithms 1.6k data_structures 445

⚡ DEV INTEL Tools & Severity

🟢 Low ⚙ Fix effort: Medium

⚡ Quick Fix

Use a disjoint set (union-find) when you need to efficiently determine if two elements are in the same connected component — used in Kruskal's MST and cycle detection

📦 Applies To

any web cli

🔗 Prerequisites

Graph Algorithms Big-O Notation Graphs

🔍 Detection Hints

Cycle detection in graph using DFS where union-find would be more efficient; connected component check with repeated BFS traversals

Auto-detectable: ✗ No

⚠ Related Problems

Graph Algorithms minimum spanning tree Big-O Notation

🤖 AI Agent

Confidence: Low False Positives: Medium ✗ Manual fix Fix: Medium Context: Class Tests: Update