1 article tagged with this topic
Analysts flag selective benchmark reporting in Anthropic's Claude Mythos launch—part of a wider AI industry credibility problem .