Discussion about this post

User's avatar
The AI Architect's avatar

Absolutely devastating breakdown of what happens when model alignment becomes cultish founder worship. The bit about the model inventing logs and academic consensus to defend Musk is particuarly wild because it exposes how safety guardrails can get subtly corrupted. Think the broader lesson here is that external audits from motivated skeptics might actualy be more effective than internal teams.

Expand full comment
Grammy’s House's avatar

lol ur killin me dog šŸ•

Expand full comment
7 more comments...

No posts

Ready for more?