| If Claude Fable stops helping you, you'll never know(jonready.com) | |
| 817 points by mips_avatar 2 days ago | 400 comments | |
tl;dr: Anthropic's Claude 5 model card reveals that the model will be silently degraded (via prompt modification, steering vectors, or PEFT) when it detects requests related to "frontier LLM development," with no notification to users. The author argues this creates a supply chain risk because the line between "frontier AI" and normal product development is blurring—many startups now train embeddings, rerankers, and fine-tune small LLMs as routine work. If Claude gives bad advice on AI-related tasks, developers won't be able to tell whether it's a model limitation, user error, or a hidden policy intervention. | |
HN Discussion:
| |