CommerceTracks

TTT-Discover optimizes GPU kernels 2x faster than human experts — by training during inference

February 5, 2026 Lee Comments Off

Researchers from Stanford, Nvidia, and Together AI have developed a new technique that can discover new solutions to very complex problems. For example, they managed to optimize a critical GPU kernel to run 2x faster than the previous state-of-the-art written by human experts. Their technique, called “Test-Time Training to Discover” (TTT-Discover), challenges the current paradigm…

OpenAI’s GPT-5.3-Codex drops as Anthropic upgrades Claude — AI coding wars heat up ahead of Super Bowl ads

February 5, 2026 Lee Comments Off

OpenAI on Wednesday released GPT-5.3-Codex, which the company calls its most capable coding agent to date, in an announcement timed to land at the exact same moment Anthropic unveiled its own flagship model upgrade, Claude Opus 4.6. The synchronized launches mark the opening salvo in what industry observers are calling the AI coding wars —…

Anthropic’s Claude Opus 4.6 brings 1M token context and ‘agent teams’ to take on OpenAI’s Codex

February 5, 2026 Lee Comments Off

Anthropic on Thursday released Claude Opus 4.6, a major upgrade to its flagship artificial intelligence model that the company says plans more carefully, sustains longer autonomous workflows, and outperforms competitors including OpenAI’s GPT-5.2 on key enterprise benchmarks — a release that arrives at a tumultuous moment for the AI industry and global software markets. The…

Beyond the lakehouse: Fundamental’s NEXUS bypasses manual ETL with a native foundation model for tabular data

February 5, 2026 Lee Comments Off

The deep learning revolution has a curious blind spot: the spreadsheet. While Large Language Models (LLMs) have mastered the nuances of human prose and image generators have conquered the digital canvas, the structured, relational data that underpins the global economy — the rows and columns of ERP systems, CRMs, and financial ledgers — has so…

The ‘brownie recipe problem’: why LLMs must have fine-grained context to deliver real-time results

February 4, 2026 Lee Comments Off

Today’s LLMs excel at reasoning, but can still struggle with context. This is particularly true in real-time ordering systems like Instacart. Instacart CTO Anirban Kundu calls it the “brownie recipe problem.” It’s not as simple as telling an LLM ‘I want to make brownies.’ To be truly assistive when planning the meal, the model must…

Mistral drops Voxtral Transcribe 2, an open-source speech model that runs on-device for pennies

February 4, 2026 Lee Comments Off

Mistral AI, the Paris-based startup positioning itself as Europe’s answer to OpenAI, released a pair of speech-to-text models on Wednesday that the company says can transcribe audio faster, more accurately, and far more cheaply than anything else on the market — all while running entirely on a smartphone or laptop. The announcement marks the latest…

Kilo CLI 1.0 brings open source vibe coding to your terminal with support for 500+ models

February 4, 2026 Lee Comments Off

Remote-first AI coding startup Kilo doesn’t think software developers should have to pledge their undying allegiance to any one development environment — and certainly not any one model or harness. This week, the startup — backed by GitLab co-founder Sid Sijbrandij — unveiled Kilo CLI 1.0, a complete rebuild of its command-line tool that offers…

The hidden tax of “Franken-stacks” that sabotages AI strategies

February 4, 2026 Lee Comments Off

Presented by Certinia The initial euphoria around Generative and Agentic AI has shifted to a pragmatic, often frustrated, reality. CIOs and technical leaders are asking why their pilot programs, even those designed to automate the simplest of workflows, aren’t delivering the magic promised in demos. When AI fails to answer a basic question or complete…

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

February 3, 2026 Lee Comments Off

Chinese e-commerce giant Alibaba’s Qwen team of AI researchers has emerged in the last year as one of the global leaders of open source AI development, releasing a host of powerful large language models and specialized multimodal models that approach, and in some cases, surpass the performance of the proprietary U.S. leaders such as OpenAI,…

Apple integrates Anthropic’s Claude and OpenAI’s Codex into Xcode 26.3 in push for ‘agentic coding’

February 3, 2026 Lee Comments Off

Apple on Tuesday announced a major update to its flagship developer tool that gives artificial intelligence agents unprecedented control over the app-building process, a move that signals the iPhone maker’s aggressive push into an emerging and controversial practice known as “agentic coding.” Xcode 26.3, available immediately as a release candidate, integrates Anthropic’s Claude Agent and…

Uncategorized