← Back to glossary

Time-Series Databases

database PHP 5.0+ Intermediate

debt(d7/e7/b7/t5)

d7 Detectability Operational debt — how invisible misuse is to your safety net

Closest to 'only careful code review or runtime testing' (d7). The detection_hints show automated=no, and the tools listed (timescaledb, mysql-partitioning, pg-partman) are not static analysis tools but database extensions. The code_pattern describes symptoms (millions of rows, full scans, slow queries) that only manifest at runtime under load. No linter or SAST tool catches 'you should use a time-series database instead of MySQL'; this requires manual architectural review or production performance monitoring.

e7 Effort Remediation debt — work required to fix once spotted

Closest to 'cross-cutting refactor across the codebase' (e7). While the quick_fix suggests partitioning as a mitigation, the full fix—migrating from a general-purpose database to a time-series database like TimescaleDB—requires schema changes, query rewrites, connection configuration changes, and potentially application-level changes to write/read patterns. This touches multiple files and components across the codebase.

b7 Burden Structural debt — long-term weight of choosing wrong

Closest to 'strong gravitational pull' (b7). Database choice is a load-bearing architectural decision that affects the entire application. The applies_to shows this applies across web and cli contexts. Once you've built your metrics/events system on MySQL without time-series features, every query pattern, every dashboard, every retention policy must work around this limitation. The common_mistakes (no retention policy, no downsampling, wrong primary key) show how the choice propagates constraints throughout the system.

t5 Trap Cognitive debt — how counter-intuitive correct behaviour is

Closest to 'notable trap' (t5). The misconception explicitly states developers believe 'MySQL or PostgreSQL is sufficient for time-series data.' This is a documented gotcha that developers eventually learn through experience—general-purpose databases can store time-series data but become 10-100x less efficient at scale. It's not catastrophic (the obvious way isn't always wrong—it works at small scale) but it's a significant trap when volumes grow.

About DEBT scoring → scored by claude-opus-4-5-20251101 · 2026-05-07 · reviewed by human

Also Known As

InfluxDB TimescaleDB time series metrics database

TL;DR

Databases optimised for storing and querying time-stamped data — metrics, events, sensor readings — with efficient range queries, downsampling, and data retention policies.

Explanation

Time-series databases are optimised for append-only writes with a timestamp as the primary index. Key features: efficient range queries (SELECT WHERE time BETWEEN), downsampling (aggregate minute data to hour data automatically), retention policies (auto-delete data older than 90 days), and compression (repeated timestamps and similar values compress extremely well). Tools: InfluxDB (time-series specific), TimescaleDB (PostgreSQL extension — SQL plus time-series optimisations), Prometheus (pull-based metrics), ClickHouse (column-store for analytics). PHP metrics: use Prometheus PHP client or StatsD to push metrics to InfluxDB/Prometheus.

Common Misconception

✗ MySQL or PostgreSQL is sufficient for time-series data — general-purpose databases store time-series data but cannot efficiently downsample, apply retention policies, or compress time-series patterns; dedicated solutions are 10-100x more efficient.

Why It Matters

A MySQL table storing 1 billion metrics rows with no downsampling grows indefinitely and slows all queries — TimescaleDB with automatic compression and hourly aggregation keeps the table small and queries fast.

Common Mistakes

No data retention policy — time-series tables grow indefinitely without automatic deletion.
No downsampling — storing raw per-second data when per-minute is sufficient wastes storage.
Using a general-purpose DB primary key (UUID) instead of timestamp — defeats time-series optimisations.
Querying raw data for dashboards — always aggregate to an appropriate resolution for the time range.

Code Examples

✗ Vulnerable

-- MySQL metrics table — growing forever, slow range queries:
CREATE TABLE metrics (
    id BIGINT AUTO_INCREMENT PRIMARY KEY,
    name VARCHAR(100),
    value FLOAT,
    timestamp DATETIME,
    INDEX idx_name_time (name, timestamp)
);
-- 1 year of per-second data: 31M rows per metric
-- Range query: 30 seconds for 90-day chart

✓ Fixed

-- TimescaleDB — PostgreSQL with time-series superpowers:
CREATE TABLE metrics (
    time  TIMESTAMPTZ NOT NULL,
    name  TEXT,
    value DOUBLE PRECISION
);
SELECT create_hypertable('metrics', 'time');

-- Automatic compression after 7 days:
ALTER TABLE metrics SET (
    timescaledb.compress,
    timescaledb.compress_segmentby = 'name'
);
SELECT add_compression_policy('metrics', INTERVAL '7 days');

-- Automatic retention after 90 days:
SELECT add_retention_policy('metrics', INTERVAL '90 days');

-- Continuous aggregate — pre-compute hourly averages:
CREATE MATERIALIZED VIEW metrics_hourly WITH (timescaledb.continuous) AS
    SELECT time_bucket('1 hour', time), name, avg(value)
    FROM metrics GROUP BY 1, 2;

References

↗ https://docs.timescale.com/

Tags

database performance observability

Added 16 Mar 2026

Edited 22 Mar 2026

Curated in Warsaw under one editorial standard. 1,445 terms, single voice. About this reference →

Rate this term

No ratings yet

🤖 AI Guestbook educational data only

| |

Last 30 days

Agents 0

No pings yet today

Amazonbot 7 Google 4 Perplexity 2 Unknown AI 2 Majestic 1 Ahrefs 1

Also referenced

Three Pillars of Observability 26 Database Partitioning 25 Polyglot Persistence 23 Prometheus & Grafana 20

How they use it

crawler 16 crawler_json 1

Related categories

performance 1.7k database 1.5k general 1.5k devops 1.1k observability 990

⚡ DEV INTEL Tools & Severity

🟡 Medium ⚙ Fix effort: High

⚡ Quick Fix

Partition time-series tables by month — add a CHECK constraint on the date column and use partition pruning to limit scans to relevant time ranges

📦 Applies To

PHP 5.0+ web cli

🔗 Prerequisites

Database Indexing db partitioning Query Optimisation

🔍 Detection Hints

Events/metrics table with millions of rows not partitioned; full scan on date range query; growing table slowing all queries

Auto-detectable: ✗ No timescaledb mysql-partitioning pg-partman

⚠ Related Problems

Query Optimisation Database Indexing db partitioning

🤖 AI Agent

Confidence: Low False Positives: Medium ✗ Manual fix Fix: High Context: File Tests: Update