Anthropic will make its AI safeguards for frontier LLM development visible. The company previously limited the effectiveness of such requests without user notification but faced backlash. Changes rolling out this week will visibly fallback flagged requests to an older model version, with API users receiving explicit refusal reasons.
Opening Kapyn…