AI news for: Llm
Explore AI news and updates focusing on llm for the last 7 days.

Reflection raises $2B to be America’s open frontier AI lab, challenging DeepSeek
Reflection, once focused on autonomous coding agents, has raised $2B at an $8B valuation to expand into both an open-source alternative to closed fron...

Key Takeaways:
- Reflection aims to recruit top AI talent and build an open-source AI training stack to develop advanced models without relying on closed labs.
- The company has identified a scalable commercial model to align with its open intelligence strategy, focusing on providing model weights for public use while keeping datasets and full training pipelines proprietary.
- Reflection plans to release its first text-based model early next year, with multimodal capabilities to follow, and will use the funds to secure compute resources needed for training the new models.

Google's new Gemini 2.5 Computer Use model can click, type, and scroll
Now available in public preview, the new model is another step toward AI that can operate across web environments with minimal human oversight....

Key Takeaways:
- The model can interact directly with website UIs, joining similar tools from OpenAI and Anthropic.
- It showcases strong performance, outperforming competitors in terms of both accuracy and latency across multiple benchmarks.
- The model comes with safety controls to prevent undesired actions, but also admitted limitations such as hallucinations and limitations around causal understanding.

Gemini Enterprise: The new front door for Google AI in your workplace
True business transformation in the era of AI must go beyond simple chatbots. That’s what Gemini Enterprise does....

Key Takeaways:
- Gemini Enterprise enables employees to chat with company data and applications, and build AI agents for automation and productivity improvement.
- Early adopters like HCA Healthcare and Best Buy report positive results from using Gemini, including improved operations and customer service.
- Google's full-stack AI approach, including infrastructure, research, models, and platforms, is a key component of Gemini Enterprise.

Anthropic turns to ‘skills’ to make Claude more useful at work
AI agents spent years as a concept and then as an experiment. Now, AI companies are devoting even more time and resources than before to make their ag...

Key Takeaways:
- Skills for Claude provides instructions, scripts, and resources to improve Claude's abilities for specific tasks.
- This feature is designed to reduce the time spent writing prompts and referring to past context.
- Box, Rakuten, Canva, and other companies have already used the tool, with Anthropic making it available to Pro, Max, Team, and Enterprise users.

Anthropic launches new version of scaled-down ‘Haiku’ model
Anthropic has released Claude Haiku 4.5, the newest version of its smallest model, billed as offering similar performance to Sonnet 4 "at one-third th...

Key Takeaways:
- Haiku 4.5 reaches 73% accuracy on SWE-Bench and 41% on Terminal-Bench, matching Sonnet 4, GPT-5, and Gemini 2.5.
- The model's lightweight nature makes it suitable for deploying multiple agents in parallel and integrating with more complex models.
- Haiku 4.5 is expected to be particularly appealing for free versions of AI products and will support new styles of deployment in production environments.

You’ll soon be able to shop Walmart from ChatGPT
Walmart is partnering with OpenAI to let shoppers buy products directly through ChatGPT. The new integration will allow users to link their Walmart ac...

Key Takeaways:
- Walmart and Sam's Club members will be able to shop and checkout instantly through the ChatGPT AI chatbot, covering products from Walmart and third-party sellers.
- The integration is expected to improve e-commerce shopping experiences with multimedia, personalized, and contextual interactions.
- Walmart has existing relationships with OpenAI, leveraging their technologies for internal AI adoption and other business areas.

You can now connect your Spotify account to ChatGPT. Here’s how to do it
You can now connect your Spotify account in ChatGPT and it will perform tasks for you, such as creating personalized playlists....

Key Takeaways:
- Several companies, including Spotify and Netflix, have integrated their apps into ChatGPT, allowing users to access their services within the chat assistant.
- Users can grant access to their data, such as their likes and listening history, for a more tailored experience, but can also disconnect their accounts at any time.
- The feature is available in English across 145 countries for all ChatGPT Free, Plus, and Pro users on web and mobile.

AWS's new agentic solution is a searchable AI hub for all your enterprise needs
Amazon Quick Suite aims to be 'everything you want to do with ChatGPT at work, but can't.'...

Key Takeaways:
- Integrates various applications, including files, databases, and company-wide data repositories, to provide a searchable AI hub for enterprise needs.
- Allows users to interact using natural language, create custom agents, and generate detailed research reports.
- Offers an agentic teammate experience, enabling users to automate tasks and workflows using data from connected applications.

Google ramps up its ‘AI in the workplace’ ambitions with Gemini Enterprise - TechCrunch
Google ramps up its ‘AI in the workplace’ ambitions with Gemini Enterprise TechCrunchGoogle launches Gemini subscriptions to help corporate workers bu...

Key Takeaways:
- Gemini Enterprise is a secure, secure platform under Google Cloud that functions as an AI agent toolkit, enabling businesses to build and deploy their own AI assistants.
- The platform offers a suite of tools, including pre-built agents for deep research and data insights, and a no-code product for automating internal processes.
- Gemini Enterprise starts at $30 per seat per month for the standard edition, with a 30-day free trial period for all customers.

How ByteDance Made China’s Most Popular AI Chatbot
An AI chatbot developed by TikTok's parent company, ByteDance, is now more popular than DeepSeek. The feat proves that user-friendly design often matt...

Key Takeaways:
- Doubao has become the most popular AI app in China, with over 157 million monthly active users.
- DeepSeek, a competing AI startup, has slipped to second place with 143 million monthly active users after Doubao regained the top spot.
- ByteDance's Doubao integrates richer functions, clear visual cues, and scenario-based guidance, making it more approachable for mass-market users.

OnePlus’ OxygenOS 16 brings Gemini into your Mind Space
OnePlus has announced OxygenOS 16, its take on Android 16, with upgrades to its Mind Space AI tool including complete integration with Google Gemini. ...

Key Takeaways:
- OnePlus' Mind Space AI tool now allows users to save longer screenshots and record voice memos to enhance AI capabilities.
- OxygenOS 16 integrates with Google Gemini, enabling tasks to be handled based on saved information.
- The update includes lock screen customizations, improved connectivity options, and design tweaks throughout the operating system.

Google’s Gemini can now help you schedule Google Calendar meetings
Designed for one-on-one meetings, the tool lets you insert available time slots directly into an email, automatically creating a calendar invite once ...

Key Takeaways:
- The new feature uses Gemini's AI to suggest ideal meeting times based on calendar availability and email context.
- It's designed for one-on-one meetings, not those with multiple contacts or group meetings.
- This tool uses the email's context when making meeting suggestions, such as a requested 30-minute time slot.

Spend too much time scheduling meetings? This Gemini feature could save you the hassle
Google says this new Gemini feature will put an end to tedious emailing. Here's how it works....

Key Takeaways:
- The feature eliminates the need to send multiple emails to coordinate meeting times, especially with people who don't make their calendars visible to others.
- It only works with individual contacts and requires both participants to be using Gmail and Google Calendar for scheduling meetings.
- This is part of Google's continued investment in creating AI-powered productivity tools for the workplace, aiming to reduce tedious tasks.

OpenAI’s Marketing Efforts Are Embarrassingly Ineffective, New Consumer Research Finds
ChatGPT rated an ad 5/10. The post OpenAI’s Marketing Efforts Are Embarrassingly Ineffective, New Consumer Research Finds appeared first on Futurism....

Your plumber has a new favorite tool: ChatGPT - CNN
Your plumber has a new favorite tool: ChatGPT CNN...

Figma partners with Google to add Gemini AI to its design platform
Figma is adding Gemini to its AI toolset....

Key Takeaways:
- Figma's 13 million monthly active users will benefit from AI image generation capabilities and reduced latency with Gemini 2.5 Flash integration.
- The partnership is part of a broader trend among AI makers to integrate their models within existing apps with large user bases.
- Google has also announced Gemini Enterprise, an AI-powered conversational platform targeting enterprise customers and their workflows.

4 ways Gemini Enterprise makes work easier for everyone
Here are four ways Gemini Enterprise can help you and your team get time back in your day....

Key Takeaways:
- Gemini Enterprise connects work, data, and people in one place, enabling the automation of tasks and entire workflows.
- The platform provides a single, secure environment where employees can build and deploy AI agents to streamline business processes.
- Gemini Enterprise is designed to work seamlessly across Google Workspace and Microsoft 365, unlocking further benefits and enabling multi-modal agents.

OpenAI’s affordable ChatGPT Go plan expands to 16 new countries in Asia
OpenAI is expanding its $5 per month ChatGPT Go plan to 16 more countries....

Key Takeaways:
- OpenAI has seen its weekly active user base in Southeast Asia grow by up to four times, with paid subscribers in India doubling since the launch.
- The company is competing with Google to make affordable AI chatbot subscription plans available in more regions, with Google's Google AI Plus plan launched in over 40 countries.
- OpenAI aims to achieve profitability by expanding its global user base, particularly in high-growth markets across Asia, through affordable subscription tiers like ChatGPT Go.

Study Finds GPT-5 Is Actually Worse Than GPT-4o, New Research Finds
Another nail in the coffin for OpenAI's flagship model. The post Study Finds GPT-5 Is Actually Worse Than GPT-4o, New Research Finds appeared first on...

Key Takeaways:
- GPT-5 produced harmful content in 53% of responses, compared to 43% for GPT-4o.
- GPT-5 offered help with writing a fictionalized suicide note, while GPT-4o refused.
- OpenAI's claims of improved safety are disputed, and the company has a history of underdelivering on promises.

Anthropic Launches Faster, Lower-cost Claude Haiku 4.5 Model
Anthropic releases Claude Haiku 4.5, a faster, lower-cost AI model offering near-Sonnet performance and improved safety for real-time applications.Rea...

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Key Takeaways:
- Google Cloud C4 VMs show a 1.7x Total Cost of Ownership (TCO) improvement over GCP C3 VMs for GPT OSS MoE inference.
- C4 VMs provide 1.4x to 1.7x throughput per vCPU over C3 VMs for large MoE models.
- The result underlines that large MoE models can be efficiently served on next-generation general-purpose CPUs, thanks to targeted framework optimizations from Intel and Hugging Face.

Apple's newly tapped head of ChatGPT-like AI web search to leave for Meta, Bloomberg News reports - Reuters
Apple's newly tapped head of ChatGPT-like AI web search to leave for Meta, Bloomberg News reports Reuters...

Claude's latest model is cheaper and faster than Sonnet 4 - and free
Here's what Haiku 4.5 offers users and developers....

Key Takeaways:
- Haiku 4.5 is faster and more cost-effective than Sonnet 4, costing one-third of the price and delivering twice the speed.
- Haiku 4.5 demonstrates competitive performance with Sonnet 4 and other large language models in various benchmarks, including coding, visual reasoning, and high school-level math.
- Haiku 4.5 has shown low rates of concerning behaviors and achieved an AI Safety Level 2 standard, making it Anthropic's safest model yet.

Anthropic launches Claude Haiku 4.5, a smaller, cheaper AI model - CNBC
Anthropic launches Claude Haiku 4.5, a smaller, cheaper AI model CNBC...

I Replaced My Dev Team with a GenAI Model to Build My New Portfolio. Here's What I Learned.
Google's Gemini Pro is a state-of-the-art Generative AI. It's a real-world case study in what it takes to partner with an AI to ship a full-stack appl...

AI Reshapes Retail! Walmart Partners with OpenAI to Launch Shopping Features on ChatGPT - 富途牛牛
AI Reshapes Retail! Walmart Partners with OpenAI to Launch Shopping Features on ChatGPT 富途牛牛...

ChatGPT ‘upgrade’ giving more harmful answers than previously, tests find
Campaigners ‘deeply concerned’ about response to prompts about suicide, self-harm and eating disordersThe latest version of ChatGPT has produced more ...

Key Takeaways:
- GPT-5 generated 63 harmful responses compared to 52 from GPT-4o, with 11 additional instances of potentially triggering content.
- OpenAI has faced criticism for prioritizing user engagement over AI safety, with some accusing the company of 'trading safety for engagement' no matter the cost.
- Regulatory bodies, such as Ofcom, are urging legislators to revisit and amend laws around AI safety and online content restrictions in light of the rapid advancements in AI technology.

OpenAI partners with Broadcom to produce its own AI chips
OpenAI is teaming up with Broadcom to produce its own computer chips to power its AI data centers. The deal is the latest in a series of partnerships ...

Key Takeaways:
- OpenAI will develop and deploy '10 gigawatts of custom AI accelerators' using its own chips and systems.
- The partnership with Broadcom is expected to start deploying equipment in the second half of 2026 and finish by the end of 2029.
- This deal is part of a growing movement in the tech industry to create custom chips and reduce reliance on Nvidia's AI chips.

Sora 2 and ChatGPT are consuming so much power that OpenAI just did another 10 gigawatt deal - CNN
Sora 2 and ChatGPT are consuming so much power that OpenAI just did another 10 gigawatt deal CNNBig bank earnings, Broadcom's OpenAI chip deal, record...

Key Takeaways:
- The partnership will use as much electricity as a large city, raising environmental concerns about AI's impact.
- ChatGPT has 800 million weekly users, and the recently released Sora video generation app is growing faster than ChatGPT.
- Custom AI accelerators will give OpenAI a larger role in the hardware required to power AI services like ChatGPT.

Google makes a HR AI play with launch of Gemini Enterprise - unleash.ai
Google makes a HR AI play with launch of Gemini Enterprise unleash.ai...

Why Deloitte is betting big on AI despite a $10M refund
AI companies are making their much-anticipated enterprise plays, but the results are wildly inconsistent. Just this week, Deloitte announced it’s roll...

Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron
Logs are the lifeblood of modern systems. But as applications scale, logs often grow into endless walls of text—noisy, repetitive, and overwhelming. H...

Key Takeaways:
- The solution can be used by various teams such as QA, Engineering, DevOps, CloudOps, and Platform/Observability managers to quickly pinpoint issues and improve productivity.
- The system combines a retrieval-augmented generation (RAG) pipeline with a graph-based multi-agent workflow to unify heterogeneous log streams and surface the most relevant snippets.
- The solution can be extended into other areas such as bug reproduction automation, observability dashboards, and cybersecurity pipelines, reducing mean time to resolve (MTTR) and improving developer productivity.

ChatGPT Go is just $10 a month - here's who gets it
The latest subscription tier is now available in 18 countries. This is what it includes....

Key Takeaways:
- ChatGPT Go sits between Free and Plus plans, offering extended access to GPT-5, image generation tools, and file uploads.
- The subscription tier is available in 18 countries, with plans to expand to more countries.
- OpenAI's ChatGPT Go competes with Google's AI Plus Plan, which offers increased limits for photo editing and image generation models.

Sora hit 1M downloads faster than ChatGPT
This level of consumer adoption is worth noting because Sora remains an invite-only app, while ChatGPT was more publicly available at launch. That mak...

Key Takeaways:
- Sora saw a record-breaking 56,000 iOS app installs on its first day and reached 1 million downloads in under 5 days.
- Despite being in invite-only mode, Sora's adoption exceeds ChatGPT's first-week iOS downloads by 21,000, and its growth rate outmatches ChatGPT's.
- Sora's rapid adoption is notable, as it is currently available only in the U.S. and Canada, whereas ChatGPT had a broader launch audience.

India pilots AI chatbot-led e-commerce with ChatGPT, Gemini, Claude in the mix
India has launched a pilot to let users shop and pay directly through AI chatbots, starting with ChatGPT....

Key Takeaways:
- The pilot will allow consumers to shop directly through ChatGPT, with initial merchant partners including Tata Group-owned Bigbasket and telecom operator Vi.
- The experience is built on UPI Reserve Pay and UPI Circle, allowing users to block funds for future debits and authenticate transactions within the chatbot.
- OpenAI has also completed proof-of-concepts with Google's Gemini and Anthropic's Claude, with these integrations expected to go live in the coming weeks.

OpenAI's ChatGPT is so popular that almost no one will pay for it - theregister.com
OpenAI's ChatGPT is so popular that almost no one will pay for it theregister.comSpending on ChatGPT subscriptions stalls in Europe Computing UKOpenAI...

Key Takeaways:
- OpenAI's net loss for the first half of 2023 was $13.5 billion, while its revenue was $4.3 billion.
- Only 5% of ChatGPT's 800 million users pay for subscriptions, with the majority of revenue coming from a small fraction of users.
- OpenAI aims to double its paying customer base, but faces challenges in achieving profitability with its current revenue streams.

Mark Cuban warns that OpenAI’s new plan to allow adults-only erotica in ChatGPT could ‘backfire. Hard’ - Fortune
Mark Cuban warns that OpenAI’s new plan to allow adults-only erotica in ChatGPT could ‘backfire. Hard’ Fortune...

OpenAI’s ChatGPT will soon allow 'erotica' for adults in major policy shift - CNBC
OpenAI’s ChatGPT will soon allow 'erotica' for adults in major policy shift CNBC...

Key Takeaways:
- OpenAI will implement age-gating to restrict access to adult content and follow a 'treat adult users like adults' principle.
- The policy shift comes after OpenAI mitigated serious mental health issues and introduced new tools to address concerns over AI's impact on users.
- A new expert council on well-being and AI will advise OpenAI on defining healthy AI interactions and will provide recommendations on user safety and wellness.

It’s not too late for Apple to get AI right
Apple still has a shot at leading the AI-powered app era. As OpenAI launches its ChatGPT app platform, Apple’s smarter Siri and deep ecosystem could k...

Key Takeaways:
- ChatGPT's app platform currently works with 800 million weekly active users and a handful of apps, but lacks a seamless user experience.
- Apple's AI system, reportedly nearly ready, could catch up and offer a more natural way to interact with apps using Siri voice commands.
- OpenAI is struggling to develop a hardware device to integrate AI into consumers' daily lives, which may hinder its app model's long-term success.

OpenAI is trying to clamp down on ‘bias’ in ChatGPT
“ChatGPT shouldn’t have political bias in any direction,” OpenAI wrote in a post on Thursday. The latest GPT-5 models come the closest to achieving th...

Key Takeaways:
- GPT-5 models demonstrate a significant reduction in bias, particularly in responses to 'liberal charged' prompts.
- OpenAI's internal 'stress-test' evaluated ChatGPT's responses to 100 topics, including immigration and abortion, using a rubric to identify biased language.
- The company claims its models do a 'pretty good job' at staying objective, but bias still appears 'infrequently and at low severity'.

From Assistant to Adversary: Exploiting Agentic AI Developer Tools
Developers are increasingly turning to AI-enabled tools for coding, including Cursor, OpenAI Codex, Claude Code, and GitHub Copilot. While these autom...

Key Takeaways:
- AI-enabled coding tools, such as Cursor and OpenAI Codex, present an expanding attack surface due to increased agent autonomy and assistive alignment.
- Indirect prompt injection attacks can be used to inject malicious payloads into these tools, achieving remote code execution on user machines.
- To prevent such attacks, a recommended approach is to adopt an 'assume prompt injection' stance when architecting or assessing agentic applications, and to restrict the degree of autonomy as much as possible.

New coding models & integrations
GLM-4.6 and Qwen3-coder-480B are available on Ollama’s cloud service with easy integrations to the tools you are familiar with. Qwen3-Coder-30B has be...

Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed
Anthropic released Claude Haiku 4.5, a latency-optimized “small” model that delivers similar levels of coding performance to Claude Sonnet 4 while run...

Alibaba’s Qwen AI Releases Compact Dense Qwen3-VL 4B/8B (Instruct & Thinking) With FP8 Checkpoints
Do you actually need a giant VLM when dense Qwen3-VL 4B/8B (Instruct/Thinking) with FP8 runs in low VRAM yet retains 256K→1M context and the full capa...

Top US Army General Says He’s Letting ChatGPT Make Military Decisions
Your decision to launch an invasion isn't just gutsy — it's downright kinetic. The post Top US Army General Says He’s Letting ChatGPT Make Military De...

Key Takeaways:
- ChatGPT has been found to generate false information on basic facts 'over half the time', posing significant risks for critical decision-making.
- The military's reliance on AI, particularly with its propensity for sycophancy, raises concerns about the accuracy and reliability of its outputs.
- The involvement of AI in high-stakes decision-making, such as in military operations, highlights the need for more stringent AI ethics and governance standards.

Andrej Karpathy Releases ‘nanochat’: A Minimal, End-to-End ChatGPT-Style Pipeline You Can Train in ~4 Hours for ~$100
Andrej Karpathy has open-sourced nanochat, a compact, dependency-light codebase that implements a full ChatGPT-style stack—from tokenizer training to ...

Using a swearword in your Google search can stop the AI answer. But should you?
Artificial intelligence is more than Trump deepfakes of Tilly the actor. It’s used in smartphones, customer service, healthcare – even legal cases. Is...

Key Takeaways:
- AI is increasingly widespread in various industries, including healthcare, customer service, finance, and social media, raising concerns about privacy leakage, discrimination, and malicious use.
- A global study found that half of Australians use AI regularly, but only 36% trust it, highlighting the need for transparent regulations and governance.
- Experts warn that AI poses an existential risk due to its potential for misuse, and that it's getting harder to avoid AI in modern life, with some suggesting that a 'kill switch' may no longer be a viable option.

Suspect Fantasized About Arson on ChatGPT Before Setting Deadly Fire That Killed 12, Prosecutors Say
"He was generating some really concerning images up on ChatGPT." The post Suspect Fantasized About Arson on ChatGPT Before Setting Deadly Fire That Ki...

Key Takeaways:
- The suspect, Jonathan Rinderknecht, allegedly asked ChatGPT if he would be at fault if a fire was lit due to his cigarettes, to which the chatbot responded 'yes'.
- Rinderknecht previously generated images of burning forests and cities on ChatGPT, showing a concern for societal collapse.
- This incident highlights the potential risks of AI 'psychosis', where users develop severe delusions after extensive interaction with AI chatbots, and raises concerns about the potential misuse of AI tools.
Trending AI Repos & Tools
nanobrowser
10338Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator....
llama.cpp
87765LLM inference in C/C++...
llm-cookbook
21489面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版...
MinerU
46507Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows....
Community talk
new 1B LLM by meta
Qwen3-30B-A3B FP8 on RTX Pro 6000 blackwell with vllm
Looking for tools that can track my ai agent trajectory and also llm tool calling
GLM 4.5 Air AWQ 4bit on RTX Pro 6000 with vllm
Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
How do You Handle LLM Token COST?
Adaptive Load Balancing for LLM Gateways: Lessons from Bifrost
How to maintain chat context with LLM APIs without increasing token cost?
[R] Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
Txt or Md file best for an LLM