← Back to glossary

B-Trees & B+ Trees

Data Structures Advanced

debt(d7/e5/b5/t7)

d7 Detectability Operational debt — how invisible misuse is to your safety net

Closest to 'only careful code review or runtime testing' (d7) — EXPLAIN plans (db_explain_advanced) reveal index misuse, but it requires a human reading query plans; no automated linter flags bad index design or column ordering.

e5 Effort Remediation debt — work required to fix once spotted

Closest to 'touches multiple files / significant refactor' (e5) — fixing index design (reordering composite columns, switching from UUID v4 to ordered IDs) involves schema migrations, possibly application code changes, and reindexing large tables, not a one-line patch.

b5 Burden Structural debt — long-term weight of choosing wrong

Closest to 'persistent productivity tax' (b5) — index strategy applies across web/cli contexts and shapes query performance everywhere; poor choices (UUID PKs, over-indexing write-heavy tables) impose ongoing costs on many work streams.

t7 Trap Cognitive debt — how counter-intuitive correct behaviour is

Closest to 'serious trap' (t7) — the misconception conflates BSTs with B-trees, and the leftmost-prefix rule plus range-before-equality gotcha contradicts intuition; developers reasonably expect indexes to 'just work' regardless of column order.

About DEBT scoring → scored by claude-opus-4-7 · 2026-05-06 · reviewed by human

Also Known As

B-tree B+ tree database index structure InnoDB index

TL;DR

Self-balancing tree structures used in database indexes — each node holds multiple keys, keeping the tree shallow and minimising disk I/O for range queries.

Explanation

A B-Tree stores sorted keys in nodes with multiple children. B+ Trees (the standard in databases) store all data in leaf nodes; internal nodes only store keys for routing. Leaf nodes are linked — enabling efficient range scans. Properties: balanced (all leaves at same depth), fan-out of hundreds (each node holds many keys), O(log n) search/insert/delete. InnoDB, PostgreSQL, and most databases use B+ Trees for indexes. The high fan-out (order 100+) means even billion-row tables have trees only 3-4 levels deep.

Common Misconception

✗ Binary search trees and B-trees are the same — BSTs have 2 children per node; B-trees have hundreds, keeping the tree extremely shallow and cache-friendly for disk-based storage.

Why It Matters

Understanding B+ Trees explains why a database index on a billion-row table still returns in milliseconds (3-4 disk reads), why composite index column order matters (leftmost prefix), and why random writes fragment the B-tree.

Common Mistakes

Adding indexes on low-cardinality columns — a boolean column has only 2 values; the B-tree provides no benefit.
Not understanding that a full-table scan can be faster than an index for large result sets.
UUID v4 primary keys causing B-tree fragmentation — random inserts split nodes constantly.
Too many indexes on a write-heavy table — every insert/update must update all B-trees.

Code Examples

✗ Vulnerable

-- B-tree fragmented by UUID v4 primary key:
CREATE TABLE events (
    id CHAR(36) PRIMARY KEY DEFAULT (UUID()),  -- Random UUID
    data JSON
);
-- Every INSERT goes to a random leaf position
-- Constant node splits and page fills
-- Index fragmentation: 70% over time

✓ Fixed

-- B-tree stays ordered with UUID v7 (time-sorted):
CREATE TABLE events (
    id BINARY(16) PRIMARY KEY,  -- UUID v7: timestamp-prefixed
    data JSON
);
-- Inserts always append near the rightmost leaf
-- Minimal fragmentation
-- Same O(log n) lookup, better write performance

-- Check fragmentation:
SHOW TABLE STATUS LIKE events;  -- Data_free shows fragmentation

Tags

data-structures database performance

Added 16 Mar 2026

Edited 22 Mar 2026

Curated in Warsaw under one editorial standard. 1,506 terms, single voice. About this reference →

Rate this term

No ratings yet

🤖 AI Guestbook educational data only

| |

Last 30 days

Agents 0

No pings yet today

No pings yesterday

Amazonbot 16 Ahrefs 15 Scrapy 6 Google 5 Perplexity 5 ChatGPT 3 SEMrush 3 Unknown AI 2 Claude 2 Bing 1 Meta AI 1

Also referenced

Database Indexing 92 Composite Index Design 49 Covering Index 47 UUID vs ULID vs Auto-Increment 33

How they use it

crawler 53 crawler_json 6

Related categories

performance 3.3k database 2.8k data_structures 862

⚡ DEV INTEL Tools & Severity

🔵 Info ⚙ Fix effort: Medium

⚡ Quick Fix

Understanding B-tree internals explains why composite index column order matters and why equality conditions before range conditions is always faster

📦 Applies To

any web cli

🔗 Prerequisites

Database Indexing Binary Trees Big-O Notation

🔍 Detection Hints

Composite index with range column before equality column; queries not using index despite its existence

Auto-detectable: ✗ No mysql-workbench db_explain_advanced

⚠ Related Problems

Database Indexing Composite Index Design Query Optimisation

🤖 AI Agent

Confidence: Low False Positives: High ✗ Manual fix Fix: High Context: File

References

https://en.wikipedia.org/wiki/B%2B_tree