The AI firm Anthropic recently shared a detailed description of the various capabilities and safety risks of Claude Mythos Preview, a new AI model (Anthropic’s “most capable frontier model to date”). If you’ve heard anything about Claude Mythos Preview, it may be that its cybersecurity/hacking skills capabilities are so powerful that Anthropic decided not to release the model to the general public. But there’s a lot in the 245-page document besides that, including a couple of items about phil...
No comments yet. Log in to reply on the Fediverse. Comments will appear here.