Models And Releases - AI news 2025-07-25

New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples

venturebeat.com • Jul 25, 2025

Hierarchical Reasoning Models (HRM) tackle complex reasoning tasks while being smaller, faster, and more data-efficient than large AI models....

TL;DR

Sapient Intelligence developed an AI architecture called the Hierarchical Reasoning Model (HRM), which can solve complex reasoning tasks more efficiently and effectively than large language models.

Key Takeaways:

The HRM architecture can achieve near-perfect accuracy on complex puzzles like Sudoku and maze-solving tasks with significantly less data and memory compared to current large language models.
HRM's parallel processing approach allows for a 100x speedup in task completion time, making it suitable for latency-sensitive fields like embodied AI and robotics.
The cost savings of using HRM are substantial, with training times measured in GPU hours instead of days or weeks for massive foundation models.

It’s Qwen’s summer: new open source Qwen3-235B-A22B-Thinking-2507 tops OpenAI, Gemini reasoning models on key benchmarks

venturebeat.com • Jul 25, 2025

The new Qwen3-Thinking-2507, as we'll call it for short, now leads or closely trails top-performing models across several major benchmarks....

TL;DR

Alibaba's Qwen Team released four open-source generative AI models, including Qwen3-Thinking-2507, which boasts record-setting benchmarks and superior reasoning capabilities.

Key Takeaways:

Qwen3-Thinking-2507 surpasses top-performing models across several major benchmarks, including AIME25, LiveCodeBench, and GPQA.
The model's separation from hybrid reasoning enables improved consistency, clarity, and benchmark performance, setting a new standard for open-source, reasoning-focused models.
The Qwen series offers permissive licensing, allowing full flexibility and ownership for enterprises, and the team plans to extend this open, performant, and production-ready AI infrastructure.

AI referrals to top websites were up 357% year-over-year in June, reaching 1.13B

techcrunch.com • Jul 25, 2025

AI platforms in June 2025 generated over 1.13 billion referrals to the top 1,000 websites globally, up 357% year-over-year....

TL;DR

AI referrals to top websites globally grew by 357% in June 2025, reaching 1.13 billion, but Google Search still accounts for the majority of traffic.

Key Takeaways:

AI referrals grew by 357% in June 2025, reaching 1.13 billion, with news and media experiencing a 770% increase.
Google Search still accounts for the majority of traffic to top websites globally, at 191 billion referrals in June 2025.
The top 5 news and media websites receiving AI referrals in June 2025 were Yahoo (2.3M), Yahoo Japan (1.9M), Reuters (1.8M), The Guardian (1.7M), and India Times (1.2M)

CoSyn: The open-source tool that’s making GPT-4V-level vision AI accessible to everyone

venturebeat.com • Jul 25, 2025

Researchers at the University of Pennsylvania and the Allen Institute for Artificial Intelligence have developed a groundbreaking tool that allows ope...

TL;DR

Researchers at the University of Pennsylvania and the Allen Institute for Artificial Intelligence have developed CoSyn, a groundbreaking tool generating synthetic training data for AI systems to match or surpass proprietary models' visual understanding capabilities.

Key Takeaways:

CoSyn enables open-source AI systems to match proprietary models' visual understanding capabilities without relying on copyrighted materials
The tool leverages language models' coding abilities to generate high-quality synthetic training data, addressing a critical bottleneck in AI development
CoSyn achieves state-of-the-art performance on benchmark tests, outperforming proprietary models like GPT-4V and Gemini on key tasks

Claude Code introduces specialized sub-agents

docs.anthropic.com • Jul 25, 2025

Article URL: https://docs.anthropic.com/en/docs/claude-code/sub-agents Comments URL: https://news.ycombinator.com/item?id=44686726 Points: 88 # Commen...

TL;DR

Claude Code introduces sub agents, pre-configured AI personalities that can handle specific tasks with customized system prompts, tools, and a separate context window.

Key Takeaways:

Sub agents enable context preservation, specialized expertise, reusability, and flexible permissions.
They can be invoked automatically by Claude Code when matching a task description or explicitly by mentioning the sub agent in a command.
Best practices for using sub agents include starting with Claude-generated agents, designing focused sub agents, writing detailed prompts, limiting tool access, and version controlling them.

Google is testing a vibe-coding app called Opal

techcrunch.com • Jul 25, 2025

Google is testing a new vibe-coding tool called Opal, available in the U.S. through Google Labs, that lets users quickly spin up web apps with just a ...

TL;DR

Google is testing Opal, a vibe-coding tool that lets users create mini web apps using text prompts.

Key Takeaways:

Google's Opal tool uses Google models to create web apps from text prompts, aiming to target a wider audience.
The tool's visual workflow allows users to edit and personalize the app creation process.
Google joins the list of competitors, including Canva, Figma, and Replit, focusing on non-technical app prototyping.

Google spoofed via DKIM replay attack: A technical breakdown

easydmarc.com • Jul 25, 2025

Article URL: https://easydmarc.com/blog/google-spoofed-via-dkim-replay-attack-a-technical-breakdown/ Comments URL: https://news.ycombinator.com/item?i...

TL;DR

Article URL: https://easydmarc.com/blog/google-spoofed-via-dkim-replay-attack-a-technical-breakdown/ Comments URL: https://news.ycombinator.com/item?i

Key Takeaways:

How Anthropic teams use Claude Code

www.anthropic.com • Jul 25, 2025

Article URL: https://www.anthropic.com/news/how-anthropic-teams-use-claude-code Comments URL: https://news.ycombinator.com/item?id=44678535 Points: 15...

TL;DR

Anthropic teams efficiently develop and automate tasks using Claude Code, streamlining their workflows and productivity.

Key Takeaways:

Claude Code accelerates feature implementation and development velocity across various teams, including product development, data infrastructure, and security engineering.
The tool enables non-technical staff to tackle complex projects, automate tasks, and bridge skill gaps, resulting in time savings and improved productivity.
Claude Code's use cases cover various aspects, including codebase navigation, debugging, documentation synthesis, and parallel task management, showcasing its versatility and potential for widespread adoption.