CommerceTracks

57% of enterprises have watched AI agents be confidently wrong. The fix is an agentic context layer, but who has one?

July 10, 2026 Lee Comments Off

An enterprise AI agent answers with total confidence, but the number is wrong. Nobody catches it until someone traces it back to a stale metric definition or a document the retrieval system never pulled. The model did not fail. The context it was given did. In the past six months, 57% of enterprises traced a…

OpenAI introduces ChatGPT Work, a cloud-based AI agent that manages tasks across email, Slack and calendars

July 10, 2026 Lee Comments Off

OpenAI on Thursday launched ChatGPT Work, a new AI agent embedded inside its flagship chatbot that aims to transform ChatGPT from a question-and-answer tool into an autonomous work platform capable of executing complex, multi-step tasks across users’ email, calendars, code repositories, and messaging apps. The product is powered by OpenAI’s latest flagship model, GPT-5.6, and…

Wall Street is debating the AI buildout. Enterprises just answered: 86% say their GPUs run at half capacity or less

July 10, 2026 Lee Comments Off

Enterprise companies are running AI agents ahead of the controls needed to manage them — and they deployed that way knowingly. That is the central finding from VentureBeat Research’s June survey of 573 technical leaders at companies with 100 or more employees, fielded across five parallel surveys of the agentic stack. Enterprises are now retrofitting…

Enterprise AI is entering an evaluation gap: Agents are gaining autonomy faster than companies can verify them

July 10, 2026 Lee Comments Off

Enterprise AI teams are giving agents more freedom at the same moment their confidence in automated testing is collapsing. Half of enterprises have deployed an AI agent or LLM feature that passed internal evaluations and yet still caused a customer-facing failure — one in four more than once — according to the June 2026 VB…

Google’s TabFM skips per-dataset training and still predicts on tables it’s never seen

July 10, 2026 Lee Comments Off

The vast majority of business data is tabular — living in data warehouses, CRMs, and financial ledgers — yet building a reliable model from it still means training a new one from scratch for every dataset, then maintaining hyperparameter tuning loops, feature engineering, and retraining pipelines to fight data drift. Google Research is proposing a…

Shared API keys expose AI agents at 69% of enterprises, new VentureBeat research finds

July 9, 2026 Lee Comments Off

Share one API key across five AI agents, and a single compromised agent inherits the reach of all five. The attacker immediately benefits from the accumulated permissions of every workflow that the key touches. The forensic trail goes cold at the credential level because five agents on one account leave no record of which agent…

Enterprises using multiple AI models are underestimating failure rates by 2.25x

July 9, 2026 Lee Comments Off

A team routing queries across a coding specialist, a logic specialist, and a generalist model assumes each will cover the others’ blind spots. A new study evaluating 67 frontier models from 21 providers shows that assumption is mathematically flawed — and the flaw has a name: the co-failure ceiling. The assumption works like this: as…

The enterprise AI challenge nobody solves with code generation alone

July 9, 2026 Lee Comments Off

Presented by SAP Generating code with AI is fast, but getting that code to run reliably inside a large enterprise, integrated with live systems, governed for compliance, and maintainable over years requires foundational work that most organizations underestimate. While 81% of all organizations have a detailed strategy, only 12–16% reach AI‑driven execution, says SAP’s Michael…

One interface isn’t enough for enterprise AI

July 9, 2026 Lee Comments Off

Presented by Oracle NetSuite Every major technology transition produces a set of assumptions about where the market is headed. The assumptions are often directionally correct, but they tend to underestimate the degree to which organizations adapt new technologies to their own circumstances. AI is following a similar trajectory. Many current discussions about enterprise AI assume…

SpaceX’s Grok 4.5 launches at half the price of rivals — here’s why that could rattle Anthropic and OpenAI

July 8, 2026 Lee Comments Off

Elon Musk’s SpaceX released Grok 4.5 on Wednesday, the first artificial intelligence model the company has trained specifically for coding and autonomous agents — and the first tangible product of its $60 billion acquisition of the AI coding startup Cursor, completed just weeks ago. The launch marks a pivotal test of the sprawling, vertically integrated…

Uncategorized