← Back to glossary

Regex Flags & Modifiers

regex PHP 5.3+ Intermediate

debt(d6/e1/b2/t6)

d6 Detectability Operational debt — how invisible misuse is to your safety net

Closest to 'only careful code review or runtime testing' (d7), slightly better at d6 because phpstan/phpcs (per detection_hints) can flag some missing-flag patterns, but most missing /u or /s issues only surface when non-ASCII or multiline input arrives in production.

e1 Effort Remediation debt — work required to fix once spotted

Closest to 'one-line patch or single-call swap' (e1), since quick_fix is literally adding a flag character (u, s, or x) to the regex delimiter — a single character edit per occurrence.

b2 Burden Structural debt — long-term weight of choosing wrong

Closest to 'localised tax' (b3), slightly lighter at b2 because regex flags are per-call decisions; they don't impose ongoing structural weight, though consistency across many regex sites is a mild recurring concern.

t6 Trap Cognitive debt — how counter-intuitive correct behaviour is

Closest to 'serious trap' (t7), slightly softer at t6 because the misconception (thinking /i is the only common flag) and common_mistakes (m flag only affecting anchors, spaces being significant in [] under /x, \w missing accented chars without /u) contradict naive expectations in ways that silently corrupt multibyte text.

About DEBT scoring → scored by claude-opus-4-7 · 2026-05-11 · reviewed by human

Also Known As

PCRE modifiers regex flags i flag s flag u flag x flag

TL;DR

PCRE modifiers in PHP — i (case-insensitive), m (multiline), s (dotall), x (verbose), u (Unicode) — and their inline equivalents (?imsx).

Explanation

PHP/PCRE modifiers: i — case-insensitive matching. m — multiline: ^ and $ match line boundaries not just string start/end. s — dotall: . matches newlines (default: . does not match \n). x — extended: allows whitespace and # comments in the pattern for readability. u — UTF-8: treat pattern and subject as UTF-8; enables \p{L} Unicode property escapes. A — anchored. D — dollar matches end only. U — ungreedy. Inline mode: (?ims) applies from that point; (?-i) turns off.

Common Misconception

✗ The i flag is the only commonly needed modifier — the s flag (dotall for multiline content), u flag (Unicode support), and x flag (readable complex patterns) are all frequently needed in real-world PHP code.

Why It Matters

A regex without the u flag that processes UTF-8 input may corrupt multibyte characters or produce incorrect match positions — always use u for user-generated multilingual text.

Common Mistakes

Forgetting s flag for multi-line strings — . does not match newlines without s
Not using u flag for UTF-8 input — \w misses accented characters
x flag with unescaped spaces in character classes — spaces inside [] are significant
m flag without ^ or $ anchors — m only affects those anchors

Code Examples

✗ Vulnerable

// Missing flags causing bugs:
$html = "<div>\nContent\n</div>";
preg_match('/<div>(.*)<\/div>/', $html, $m);
// Without s: . does not match \n — no match!

$name = 'François';
preg_match('/^\w+$/', $name, $m);
// Without u: ç not in \w — fails for valid French name!

✓ Fixed

// Correct flags for each use case:
preg_match('/<div>(.*?)<\/div>/s', $html, $m); // s: . matches newlines
preg_match('/^[\w]+$/u', 'François', $m);       // u: Unicode \w

// x flag for readable complex pattern:
$phone = '/
    ^                    # Start
    (\+?\d{1,3})?        # Optional country code
    [\s.-]?              # Optional separator
    \(?\d{3}\)?          # Area code
    [\s.-]?              # Separator
    \d{3}[\s.-]?\d{4}   # Number
    $                    # End
/x';
preg_match($phone, '+1 (555) 123-4567');

References

↗ https://www.php.net/manual/en/reference.pcre.pattern.modifiers.php

Tags

regex php

Added 16 Mar 2026

Edited 5 Apr 2026

Curated in Warsaw under one editorial standard. 1,448 terms, single voice. About this reference →

Rate this term

No ratings yet

🤖 AI Guestbook educational data only

| |

Last 30 days

Agents 0

No pings yet today

No pings yesterday

Amazonbot 7 Perplexity 5 Unknown AI 2 Ahrefs 2 Google 2 Claude 1

Also referenced

Unicode & Multibyte Regex 21 Named Capture Groups 20 PCRE in PHP 20

How they use it

crawler 18 crawler_json 1

Related categories

regex 366

⚡ DEV INTEL Tools & Severity

🟢 Low ⚙ Fix effort: Low

⚡ Quick Fix

Always add the /u (Unicode) flag when matching user text; use /x (extended) for complex patterns that need comments and whitespace; use /s (dotall) to make . match newlines

📦 Applies To

PHP 5.3+ any web cli queue-worker

🔗 Prerequisites

PCRE in PHP Regex Syntax Unicode Fundamentals

🔍 Detection Hints

Regex on UTF-8 text without /u flag; complex regex without /x flag for readability; multiline text without considering /m or /s

Auto-detectable: ✓ Yes phpstan phpcs

⚠ Related Problems

PCRE in PHP Unicode Fundamentals Regex Syntax

🤖 AI Agent

Confidence: Medium False Positives: Low ✓ Auto-fixable Fix: Low Context: Line