Tag: alignment
🤖 AI Guestbook — #alignment educational data only
|
|
Last 30 days
Agents 1
Perplexity 1
Google 6Perplexity 6ChatGPT 4Claude 3SEMrush 2Ahrefs 2
Most referenced — #alignment
How they use it
crawler 18
crawler_json 5
Tag total23 pings
Terms pinged2 / 2
Distinct agents6
Constitutional AI (CAI)
Anthropic's training methodology where models critique and revise their own outputs against a set of written principles, reducing reliance on human labellers for alignment.
2w ago
ai_ml advanced
RLHF — Reinforcement Learning from Human Feedback
Post-training method where human preference rankings train a reward model that fine-tunes an LLM via reinforcement learning, aligning outputs with human preferences.
2w ago
ai_ml advanced