← Home ← Codex ← DEBT
Browse by Category
+ added · updated 7d
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
AI Fallback Routing 🧠 4
Automatically routing LLM requests to alternative models or providers when the primary fails, times out, or returns unusable output.
2w ago ai_ml intermediate
AI Model Selection Criteria
The systematic factors engineers weigh when choosing an LLM for a task: capability, cost, latency, context window, modality, hosting, and licensing.
2w ago ai_ml intermediate
AI Prompt Versioning
The practice of treating prompts as versioned artifacts — tracking changes, correlating outputs to prompt revisions, and enabling rollback when quality regresses.
3w ago ai_ml intermediate
AI Synthetic Data Generation
Using generative models to produce artificial training, testing, or augmentation data that mimics the statistical properties of real datasets without exposing originals.
3w ago ai_ml intermediate
Prompt Caching
API feature where a static prompt prefix (system instructions, large context) is cached server-side, dramatically reducing cost and latency on repeated calls that share the prefix.
1mo ago ai_ml intermediate
Reasoning Models & Test-Time Compute
A class of LLMs trained to allocate extra inference-time compute to internal reasoning before answering, achieving large gains on math, code, and logic at the cost of latency and tokens.
1mo ago ai_ml intermediate
Diagram: AI Guardrails AI Guardrails 🧠 1
Runtime constraints and safety filters applied around LLM calls to detect, block, or rewrite inputs and outputs that are harmful, off-topic, or policy-violating.
2mo ago ai_ml intermediate
Diagram: AI Observability AI Observability 🧠 2
The practice of monitoring, tracing, and evaluating LLM-powered systems in production — covering latency, token costs, prompt drift, output quality, and failure modes.
2mo ago ai_ml intermediate
Diagram: LLM Temperature & Sampling Strategies LLM Temperature & Sampling Strategies
Parameters that control the randomness and diversity of LLM output — temperature scales token probabilities, while top-p and top-k limit the candidate pool before sampling.
2mo ago ai_ml intermediate
Diagram: Multimodal AI Multimodal AI
AI models that process and generate across multiple input or output modalities — text, images, audio, and video — within a single unified architecture.
2mo ago ai_ml intermediate
LLM Hallucination
When a large language model generates confident-sounding text that is factually incorrect, fabricated, or unsupported by any source — a fundamental property of how language models work.
2mo ago ai_ml intermediate
LLM Streaming Responses PHP 8.0+
Receiving LLM output token-by-token as it is generated rather than waiting for the full response — dramatically improving perceived latency for users and enabling real-time displays of AI-generated content.
2mo ago ai_ml intermediate
RAG — Retrieval-Augmented Generation
An LLM architecture that fetches relevant documents from an external knowledge base before generating a response, grounding answers in retrieved facts rather than training data alone.
2mo ago ai_ml intermediate
Semantic Search
Search that matches by meaning and intent rather than exact keywords — a query for 'how to prevent database attacks' finds SQL injection documentation even if those exact words never appear.
2mo ago ai_ml intermediate
Structured Output from LLMs (JSON Mode) PHP 8.0+
Instructing an LLM to return valid JSON rather than prose — either via a system prompt schema, a JSON mode API flag, or a tool-use response format — so the output can be reliably parsed and used programmatically.
2mo ago ai_ml intermediate
Tokenization in LLMs
The process of splitting text into tokens — subword units that LLMs process — which directly determines context window usage, cost, and model behaviour on non-English and code inputs.
2mo ago ai_ml intermediate
Vector Database
A database optimised for storing and querying high-dimensional vector embeddings, enabling similarity search — finding items semantically close to a query rather than exact-match lookups.
2mo ago ai_ml intermediate
AI API Cost Management 🧠 1
Strategies to reduce LLM API costs — caching identical prompts, batching requests, choosing smaller models for simpler tasks, and minimising context length.
3mo ago ai_ml intermediate
AI-Assisted Code Generation 🧠 5
Using LLMs to generate, complete, or refactor code — powerful for boilerplate and exploration but requiring review for correctness, security, and licence compliance.
3mo ago ai_ml intermediate
LLM Context Window
The maximum amount of text an LLM can process in one call — everything the model sees at once, including system prompt, conversation history, and retrieved context.
3mo ago ai_ml intermediate
✓ schema.org compliant