14 July 2025

Topic: Models And Releases

Exclusive with Amazon Bedrock chief Atul Deo: AWS' big bet on cheaper AI — and a new breed of software agent - SiliconANGLE
Exclusive with Amazon Bedrock chief Atul Deo: AWS' big bet on cheaper AI — and a new breed of software agent - SiliconANGLE
source siliconangle.com Jul 14, 2025

Atul Deo’s goal is to make artificial-intelligence software both cheaper and smarter at the same time. The up-and-coming executive, head of Amazon …...

TL;DR
Amazon Bedrock chief Atul Deo shares the goal of making artificial-intelligence software both cheaper and smarter at the same time, and showcases AWS' new breed of software agents that can execute multistep tasks and justify their price tag.

Key Takeaways:
  • Amazon Bedrock has added seven headline models, and users can now swap among them with a single API, reducing the complexity of choosing the right model.
  • AWS has introduced new features aimed at reducing inference cost, such as prompt caching, intelligent prompt routing, batch mode, and model distillation, with potential savings of up to 90%.
  • The Model Context Protocol (MCP) is being developed to enable agents to discover data sources, maintain state, enforce security policies, and dynamically interact with each other.
Amazon launches Kiro, its own Claude-powered challenger to Windsurf and Codex
Amazon launches Kiro, its own Claude-powered challenger to Windsurf and Codex
source venturebeat.com Jul 14, 2025

Initial community reactions to Kiro were mixed, but developers were intrigued, praising the emphasis on specs, hooks and structure....

TL;DR
Amazon has released Kiro, a new agentic integrated development environment (IDE) that helps developers bridge the gap between rapid prototyping and secure, maintainable applications.

Key Takeaways:
  • Kiro offers a spec-driven development model, guiding the process from ideation to implementation, with features like automated task management and quality control.
  • The tool is built on open tooling, compatible with Visual Studio Code extensions and settings, and includes features like agentic multi-modal chat and steering rules.
  • Kiro is currently free during its preview period, with plans to offer three subscription tiers after the preview ends, including a free tier with limited features and two paid tiers.
Alibaba-backed Moonshot releases new Kimi AI model that beats ChatGPT, Claude in coding — and it costs less
Alibaba-backed Moonshot releases new Kimi AI model that beats ChatGPT, Claude in coding — and it costs less
source cnbc.com Jul 14, 2025

BEIJING — The latest Chinese generative artificial intelligence model to take on OpenAI's ChatGPT is offering coding capabilities — at a lower …...

TL;DR
Moonshot, an Alibaba-backed startup, releases a low-cost, open-source large language model called Kimi K2, which surpasses ChatGPT and Claude Opus 4 in coding capabilities.

Key Takeaways:
  • Moonshot's Kimi K2 model beats OpenAI's GPT-4.1 model in coding performance and is available at a lower cost, charging 15 cents for input tokens and $2.50 for output tokens per million.
  • Kimi K2 achieves better overall performance than Claude Opus 4 on several industry benchmarks and has lower token costs for large-scale or budget-sensitive deployments.
  • The model's open-source nature provides access to its source code for free, allowing developers to use it as they see fit, while also requiring a visible mention of 'Kimi K2' on user interfaces with over 100 million monthly active users or $20 million in monthly revenue.
GPT-5 is coming, but can OpenAI retain its edge?
GPT-5 is coming, but can OpenAI retain its edge?
source livemint.com Jul 14, 2025

Can GPT-5 be a game-changer? OpenAI’s most anticipated model is expected to advance reasoning, memory and adaptability through persistent (human-like ...

TL;DR
OpenAI's GPT-5 is expected to launch soon, but its success will depend on whether it surpasses previous models in artificial general intelligence capabilities.

Key Takeaways:
  • GPT-5 aims to advance reasoning, memory, and adaptability through persistent memory, better navigation, and autonomous behavior.
  • OpenAI faces stiff competition from rivals like Google, Meta, and xAI, which are also developing advanced AI models and poaching OpenAI talent.
  • The company is gearing up for the future with AI-first hardware development and a new browser, but concerns over AGI misuse persist.
Build secure RAG applications with AWS serverless data lakes
Build secure RAG applications with AWS serverless data lakes
source aws.amazon.com Jul 14, 2025

Data is your generative AI differentiator, and successful generative AI implementation depends on a robust data strategy incorporating a …...

TL;DR
A secure RAG application can be built using a serverless data lake architecture with AWS services for fine-grained access control, metadata-driven retrieval, and robust security features.

Key Takeaways:
  • Serverless data lakes support scalable and secure access control for RAG applications, with features like fine-grained permissions and cross-functional access controls.
  • Amazon Bedrock Knowledge Bases provide a managed solution for organizing and retrieving unstructured data, with features like metadata filtering and standardization.
  • A modern data strategy with proper governance and serverless architecture is essential for making the most of data assets for generative AI applications, while maintaining security and compliance.
Another Chinese AI model is turning heads
Another Chinese AI model is turning heads
source nbcnews.com Jul 14, 2025

Alibaba-backed startup Moonshot released on late Friday night its Kimi K2 model, touting performance that rivals many U.S. models. BEIJING — The lates...

TL;DR
Moonshot, an Alibaba-backed startup, releases the Kimi K2 model, an open-source large language model with lower costs and improved coding capabilities, challenging OpenAI's dominance in the field.

Key Takeaways:
  • Kimi K2 surpasses Claude Opus 4 and GPT-4.1 models on certain benchmarks and offers lower token costs, making it attractive for large-scale or budget-sensitive deployments.
  • The model charges 15 cents for every 1 million input tokens and $2.50 per 1 million output tokens, which is significantly lower than Claude Opus 4 and GPT-4.1.
  • Kimi K2's lower costs and improved performance could disrupt the market and challenge OpenAI's dominance in the field of large language models.

Community talk