CommerceTracks

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

June 10, 2026 Lee 0 Comments

Researchers from the University of California, Berkeley’s Center for Responsible, Decentralized Intelligence (RDI), alongside an advisory committee of over 300 domain experts, have launched Agents’ Last Exam (ALE)—a grueling new benchmark built to measure whether artificial intelligence can actually execute economically valuable, long-horizon professional workflows. In a shocking upset, OpenAI’s GPT-5.5 from April, operating through…

Researchers say they trained a foundation model from scratch for about $1,500

June 10, 2026 Lee 0 Comments

Training a foundation LLM from scratch costs millions and requires internet-scale data — which is why most enterprises don’t bother. Sapient thinks it has a cheaper path. To overcome this brute-force scaling dogma, researchers at Sapient developed HRM-Text, which replaces standard Transformers with a highly sample-efficient Hierarchical Recurrent Model (HRM), an architecture they first introduced…

Anthropic CEO calls for FAA-style regulation of powerful AI models: what enterprises should know

June 10, 2026 Lee 0 Comments

In a sweeping new essay titled “Policy on the AI Exponential,” Anthropic co-founder and CEO Dario Amodei publicly calls for new government regulations governing the release of powerful AI models — specifically comparing AI industry to commercial aviation, which follows regulations enforced by the U.S. Federal Aviation Administration (FAA) — arguing that this is necessary…

MassMutual’s AI strategy: 12-month contracts, 30% productivity gains, zero lock-in

June 10, 2026 Lee 0 Comments

Enterprise AI teams face a dilemma: The best models today might not be the best models a year from now. MassMutual’s answer is to stop making long-term bets — and build infrastructure that can swap models as the market shifts. “The world of AI today is extremely dynamic,” Sears Merritt, MassMutual CIO, explained in a…

Day: June 10, 2026

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

Researchers say they trained a foundation model from scratch for about $1,500

Anthropic CEO calls for FAA-style regulation of powerful AI models: what enterprises should know

MassMutual’s AI strategy: 12-month contracts, 30% productivity gains, zero lock-in