Claude Opus 4.7 raises a lot of key model welfare related concerns. I was planning to do model welfare first, but I’m having some good conversations about that post and it needs another day to cook, and also it might benefit from this post going first. So I’m going to do a swap. Yesterday we covered the model card. Today we do capabilities. Then tomorrow we’ll aim to address model welfare and related issues. Table of Contents The Gestalt. The Official Pitch. General Use Tips. Capabilities (Model Card Section 8). Other People’s Benchmarks. General Positive Reactions. General Negative Reactions. Miscellaneous Ambiguous Notes. The Last Question. Prompt Injection Problems. Not Ready For Prime Time. Brevity Is The Soul of Wit. Why Should I Care? Let’s Wrap It Up. Non-Adaptive Thinking. Lapses In Thinking. Tell Me How You Really Feel. Failure To Follow Instructions. The Gestalt Claude Opus 4.7 is the most intelligent model yet in its class. Overall I believe it is a substantial improvement…
No comments yet. Log in to reply on the Fediverse. Comments will appear here.