← Back to glossary

Double URL Encoding Bypass

Security PHP 4.0+ Advanced

debt(d7/e5/b5/t9)

d7 Detectability Operational debt — how invisible misuse is to your safety net

Closest to 'only careful code review or runtime testing' (d7). The detection_hints specify semgrep as the tool but mark automated as 'no', meaning even semgrep patterns for $_GET/$_POST/urldecode require manual rule crafting and review. Double encoding issues do not surface as compiler errors, linter warnings, or straightforward static analysis hits — they require understanding the full data-flow from input receipt through filtering to use, and will only reliably manifest during targeted security testing or when an attacker exploits them in production.

e5 Effort Remediation debt — work required to fix once spotted

Closest to 'touches multiple files / significant refactor in one component' (e5). The quick_fix requires multiple coordinated changes: replacing raw $_GET filtering with post-decode validation, adding realpath() normalisation before path checks, ensuring parameterised queries are used, and removing any reliance on encoding-based filtering. This is not a single-line swap — it touches input validation logic, path handling, and potentially WAF integration points across the web-facing layer of the application.

b5 Burden Structural debt — long-term weight of choosing wrong

Closest to 'persistent productivity tax' (b5). applies_to is web context only, which limits scope somewhat, but any PHP web application that handles user-supplied paths or parameters must consistently apply decode-then-validate patterns everywhere. This imposes an ongoing cognitive load on developers — every new input-handling code path must be reviewed for double-encoding susceptibility, and the pattern must be enforced across the codebase rather than in a single location.

t9 Trap Cognitive debt — how counter-intuitive correct behaviour is

Closest to 'catastrophic trap — the obvious way is always wrong' (t9). The misconception field states explicitly: 'URL-decoding input once before filtering is sufficient — attackers double-encode specifically to survive single-decode filters.' This is a perfect t9 scenario — the intuitive, apparently-correct defensive measure (decode once, then filter) is precisely the behaviour that the attack exploits. A competent developer applying reasonable security hygiene will still get this wrong, because the single-decode approach looks correct and passes naive testing.

About DEBT scoring → scored by claude-sonnet-4-6 · 2026-05-11 · reviewed by human

TL;DR

Filters operating on URL-decoded input miss double-encoded payloads — %2527 decodes to %27 which decodes to ' — always decode completely before filtering.

Explanation

URL encoding: %27 = '. Double encoding: %2527 = %27 = '. If a WAF or filter decodes once and checks, then the application decodes again, the second decode reveals the payload. Common in path traversal (%2e%2e%2f = ../), XSS (%253Cscript%253E), and SQL injection. PHP's urldecode() and $_GET automatic decoding create opportunities. Defences: filter after all decoding is complete, use parameterised queries (immune to encoding tricks), validate against a whitelist of allowed characters after normalisation, use realpath() to resolve paths before checking.

Common Misconception

✗ URL-decoding input once before filtering is sufficient — attackers double-encode specifically to survive single-decode filters.

Why It Matters

Double encoding is a common WAF bypass technique — applications that rely on perimeter filtering without proper input normalisation remain vulnerable.

Common Mistakes

Filtering raw $_GET without checking if values are further encoded.
Path traversal checks that operate before realpath() normalisation.
Trusting WAF filtering without validating at the application layer.

Code Examples

✗ Vulnerable

// Checks for ../ but misses %2e%2e%2f or %252e%252e%252f:
if (strpos($_GET['file'], '../') !== false) {
    die('Invalid path');
}
readfile('/uploads/' . $_GET['file']);

✓ Fixed

$file = $_GET['file'] ?? '';
// Fully normalise before validating
$path = realpath('/uploads/' . $file);
if ($path === false || !str_starts_with($path, '/uploads/')) {
    http_response_code(400); die('Invalid path');
}
readfile($path);

Tags

Added 22 Mar 2026

Curated in Warsaw under one editorial standard. 1,506 terms, single voice. About this reference →

Rate this term

No ratings yet

🤖 AI Guestbook educational data only

| |

Last 30 days

Agents 0

No pings yet today

No pings yesterday

Google 16 Amazonbot 7 Perplexity 7 ChatGPT 5 Scrapy 4 Unknown AI 3 Ahrefs 3 Claude 2 SEMrush 1 Meta AI 1 PetalBot 1

Also referenced

Input Validation vs Output Encoding 118 Cross-Site Scripting (XSS) 76 Local File Inclusion (LFI) 70 Path Traversal 67

How they use it

crawler 44 crawler_json 4 pre-tracking 2

Related categories

security 7.9k general 3k

⚡ DEV INTEL Tools & Severity

🟠 High ⚙ Fix effort: Medium

⚡ Quick Fix

Use realpath() to normalise all paths before validation. Filter after complete decoding. Use parameterised queries. Never rely on encoding-based filtering alone.

📦 Applies To

PHP 4.0+ web

🔗 Prerequisites

Path Traversal Input Validation vs Output Encoding

🔍 Detection Hints

\$_GET|\$_POST|urldecode

Auto-detectable: ✗ No semgrep

⚠ Related Problems

Path Traversal Local File Inclusion (LFI) Cross-Site Scripting (XSS)

🤖 AI Agent

Confidence: Medium False Positives: Medium ✗ Manual fix Fix: Medium Context: Function Tests: Update

CWE-22 CWE-79 CWE-116

References

https://owasp.org/www-community/Double_Encoding