Uncategorized

Huawei’s new open source technique shrinks LLMs to make them run on less powerful, less expensive hardware

Huawei’s Computing Systems Lab in Zurich has introduced a new open-source quantization method for large language models (LLMs) aimed at reducing memory demands without sacrificing output quality. The technique, called SINQ (Sinkhorn-Normalized Quantization), is designed to be fast, calibration-free, and easy to integrate into existing model workflows. The code for performing it has been made…

Read More

OpenAI’s DevDay 2025 preview: Will Sam Altman launch the ChatGPT browser?

OpenAI will host more than 1,500 developers at its largest annual conference on Monday, as the company behind ChatGPT seeks to maintain its edge in an increasingly competitive artificial intelligence landscape. The third annual DevDay conference at San Francisco’s Fort Mason represents a critical moment for OpenAI, which has seen its dominance challenged by rapid…

Read More

New AI training method creates powerful software agents with just 78 examples

A new study by Shanghai Jiao Tong University and SII Generative AI Research Lab (GAIR) shows that training large language models (LLMs) for complex, autonomous tasks does not require massive datasets. Their framework, LIMI (Less Is More for Intelligent Agency), builds on similar work in other areas of LLM research and finds that “machine autonomy…

Read More

Google’s Jules coding agent moves beyond chat with new command line and API

Google wants its coding assistant, Jules, to be far more integrated into developers’ terminals than ever. The company wants to make it a more workflow-native tool, hoping that more people will use it beyond the chat interface.  Jules, which the company first announced in December 2024, will gain two new features: a Jules API to…

Read More

Salesforce launches AI ‘trust layer’ to tackle enterprise deployment failures plaguing 80% of projects

Salesforce Inc. is expanding its artificial intelligence platform with new data management and governance capabilities, aiming to address what the company says is a crisis in enterprise AI adoption where more than 80% of projects fail to deliver meaningful business value. The San Francisco-based software giant announced Thursday a suite of new tools designed to…

Read More

Software commands 40% of cybersecurity budgets as gen AI attacks execute in milliseconds

Software spending now makes up 40% of cybersecurity budgets, with investment expected to grow as CISOs prioritize real-time AI defenses.Read More

Read More

How Sakana AI’s new evolutionary algorithm builds powerful AI models without expensive retraining

M2N2 is a model merging technique that creates powerful multi-skilled agents without the high cost and data needs of retraining.Read More

Read More

How Intuit killed the chatbot crutch – and built an agentic AI playbook you can copy

This is the inside story of Intuit’s transformation journey with AI — including a grueling nine-month pivot to “burn the boats” and reinvent how the 40-year-old finance giant builds its products.Read More

Read More

In crowded voice AI market, OpenAI bets on instruction-following and expressive speech to win enterprise adoption

OpenAI’s new speech model, gpt-realtime, hopes that its more naturalistic voices would make enterprises use more AI generated voices in applications.Read More

Read More

Nous Research drops Hermes 4 AI models that outperform ChatGPT without content restrictions

Nous Research launches Hermes 4 open-source AI models that outperform ChatGPT on math benchmarks with uncensored responses and hybrid reasoning capabilities.Read More

Read More