Explore a the AI rundown of the most hightlighted news and community talks that moved AI this week.
OpenAI and Oracle reportedly signed a deal that includes OpenAI buying $300 billion of compute over a five-year span....
Microsoft to use some AI from Anthropic in shift from OpenAI, the Information reports ReutersMicrosoft Will Use Anthropic Models in Office 365 Copilot...
Nebius shares soar 49% in premarket trading on multi-billion AI infrastructure deal with Microsoft CNBCMicrosoft Signs Nebius Cloud Deal for as Much a...
AI firm Mistral valued at $14 billion as chip giant ASML takes major stake CNBCExclusive: ASML becomes Mistral AI’s top shareholder after leading late...
Oracle surges on AI cloud growth as customers race to secure computing capacity ReutersOracle stock booms 40%, on pace for best day since 1992 CNBCSto...
The $300B deal is a reminder that despite Oracle’s legacy status, it shouldn’t be overlooked when it comes to AI infrastructure. But key questions aro...
SentinelOne to Acquire Observo AI to Revolutionize SIEM and Security Operations SentinelOneObservo AI, Real Time Data Pipelines, and the Future of the...
https://www.theinformation.com/articles/microsoft-buy-ai-anthropic-shift-openai...
In Oracle’s recent call, Larry Ellison said something that caught my attention: “All this money we’re spending on training is going to be translated ...
Article URL: https://research.google/blog/vaultgemma-the-worlds-most-capable-differentially-private-llm/ Comments URL: https://news.ycombinator.com/it...
How Alibaba builds its most efficient AI model to date South China Morning PostQwen3-Next: A New Generation of Ultra-Efficient Model Architecture Unve...
A new Google DeepMind image editing model, fondly known as Nano Banana, is now in the Gemini app, giving you more creative control to blend and edit p...
TildeOpen LLM is an open-source foundational language model built to serve underrepresented Nordic and Eastern European languages. Developed with Euro...
Celtic languages — including Cornish, Irish, Scottish Gaelic and Welsh — are the U.K.’s oldest living languages. To empower their speakers, the UK-LLM...
Article URL: https://synbol.github.io/Lumina-DiMOO/ Comments URL: https://news.ycombinator.com/item?id=45221103 Points: 17 # Comments: 1...
Here: https://qwen.ai/blog?id=4074cca80393150c248e508aa62983f9cb7d27cd&from=research.latest-advancements-list...
First question asked ChatGPT 4o today was "What's your status?" This is the response....
Benchmarks Model Card: https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Thinking Instruct Model Card: https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-I...
https://preview.redd.it/50ap87u5g5of1.png?width=1200&format=png&auto=webp&s=2a3343131a6886043ce8b5fef053f330b9b60632 Wtf?...
We are thrilled to announce the official open-sourcing of IndexTTS-2.0 - an emotionally rich and duration-controllable autoregressive zero-shot text-t...
Bytedance's new seedream 4 image generator and editing model has some of the most insane photoshops i've seen yet. Credit to 'AI Search' on yt for the...
After experimenting with Qwen3 Next, it's a very impressive model. It does have problems with sycophancy and coherence- but it's fast, smart and it's ...
Remember when y'all roasted us about the license? We listened. Just dropped what we think is a world first: **70B model intermediate checkpoints**. N...
Baidu, the Chinese Google, recently released a couple of new models - an update to open source Ernie 4.5 and proprietary Ernie X1.1: https://preview....
Recently I presented another music theory problem and explained why it may be a great way to test LLMs' ability: [https://www.reddit.com/r/LocalLLaMA/...
model: [https://huggingface.co/facebook/MobileLLM-R1-950M](https://huggingface.co/facebook/MobileLLM-R1-950M) app (vibe coded): [https://huggingface....
Google made three major updates to its Gemini-powered products on Monday: The Gemini app now accepts audio files; Search can handle five new languages...
Apple Intelligence was designed to leverage things that generative AI already does well, like text and image generation, to improve upon existing feat...
Google has added support for 1080p resolution and vertical video formats to its Veo 3 AI video generator. According to the announcement on Google’s de...
I tried Apple's 2 big AI features announced at the iPhone 17 event - and both are game changers ZDNETHSBC Keeps Hold on Apple (AAPL), Sets $220 Price ...
What to expect at Meta Connect 2025: 'Hypernova' smart glasses, AI and the metaverse EngadgetThe Incredible Part of Meta's Next Smart Glasses Could Be...
In addition to searching the web, you can now add the web fetch tool to your requests and Claude will fetch and analyze content from any webpage URL—n...
[https://www.cnbc.com/2025/09/12/apple-google-meta-universal-translator.html](https://www.cnbc.com/2025/09/12/apple-google-meta-universal-translator.h...
Hey r/ArtificialIntelligence! Just saw Google released something pretty interesting, EmbeddingGemma, their new embedding model that's specifically b...
Apologies for the bad screenshot. I noticed this new button under my chat today. Not sure when it arrived but finally I can use a feature I really mi...
I’m on Pro and had been using memory for over a year — it kept track of personal details (dog, girlfriend, etc.) and even powered custom workflows lik...
Article URL: https://chipsandcheese.com/p/amds-rdna4-gpu-architecture-at-hot Comments URL: https://news.ycombinator.com/item?id=45235293 Points: 72 # ...
NVIDIA dropped MLPerf results for Blackwell Ultra yesterday. 5× throughput on DeepSeek-R1, record runs on Llama 3.1 and Whisper, plus some clever tric...
Gigabyte's AI Top CXL R5X4 expansion card lets you plug up to 512 GB of DDR5 ECC RDIMM RAM into a PCIe 5.0 x16 slot, using Compute Express Link (CXL) ...
A team of engineers have created a new optical chip that uses light (photons) instead of electricity for key AI operations like image recognition and ...
This virtually guarantees that it's coming to M5. Previous discussion and my comments: https://www.reddit.com/r/LocalLLaMA/comments/1mn5fe6/apple_pat...
The upgrade kit comprises a custom PCB designed with a clamshell configuration, facilitating the installation of twice the number of memory chips. Mos...
With the rise of medium-sized MoE (gpt-oss-120B, GLM-4.5-air, and now the incoming Qwen3-80B-A3B) and their excellent performance for local models (we...
https://x.com/XRoboHub/status/1965686796018467064 ...
A new system called Real Simple Licensing would allow AI companies to license training data at a massive scale — if they're willing to pay for it....
The state's landmark safety bill sets new transparency requirements for large AI companies....
The federal consumer regulator seeks to learn about how AI companies evaluate the safety of their chatbots....
If SB 243 is enacted, California would become the first state to require operators to implement safety protocols for AI companions and hold companies ...
The Federal Trade Commission (FTC) is ordering seven AI chatbot companies to provide information about how they assess the effects of their virtual co...
Claude’s new AI file creation feature ships with deep security risks built in Ars TechnicaClaude can now create and edit files AnthropicClaude Can Now...
On Wednesday, Sen. Ted Cruz introduced legislation to create a regulation "sandbox" that would allow artificial intelligence companies to experiment w...
A new licensing standard aims to let web publishers set the terms of how AI system developers use their work. On Wednesday, major brands like Reddit, ...
Heads up, fellow tinkers The EU AI Act’s first real deadline kicked in August 2nd so if you’re messing around with models that hit 10\^23 FLOPs or mo...
I am a researcher/artist working on historically accurate reconstructions of ancient cultures. I’ve noticed that requests for depictions of Greeks, Ro...
Article URL: https://www.alizila.com/qwen-ecosystem-expands-rapidly-accelerating-ai-adoption-across-industries/ Comments URL: https://news.ycombinator...
We’re excited to release Stable Audio 2.5, our latest audio model and the first developed for enterprise-grade use cases. Stable Audio 2.5 introduces ...
Lilly launches TuneLab platform to give biotechnology companies access to AI-enabled drug discovery models built through over $1 billion in research i...
Article URL: https://www.vectroid.com/blog/why-and-how-we-built-Vectroid Comments URL: https://news.ycombinator.com/item?id=45224141 Points: 53 # Comm...
Oboe is a new AI-powered learning platform that lets you create personalized courses on any topic with a prompt....
The startup is using real-time AI agents that inspect, analyze, and neutralize email threats....
Learn how Google Research developed Simplify in the Google app for iOS to help you understand complex information more easily....
Article URL: https://www.anthropic.com/news/create-files Comments URL: https://news.ycombinator.com/item?id=45182381 Points: 179 # Comments: 102...
This 30-year-old CEO says his AI negotiator can successfully haggle down the price of a car by thousands of dollars FortuneDealerships gain another AI...
I previously shared an open-source project for extracting structured data from documents. I’ve now hosted it as a free to use API. * Outputs: JSON, M...
One thing that drives me crazy with AI is how the quality drifts. Some days it’s sharp, then out of nowhere it starts refusing simple stuff or slowing...
Hospitals already starting to move to an AI-centric future: Translated from [https://www.calcalist.co.il/calcalistech/article/s1py711mige](https://ww...
* Girlfriend tried using GPT-5 to repair a precious photo with writing on it. * GPT-5s imagegen, because its not really an editing model, failed miser...
# This week's AI landscape was dominated by Jus Mundi Launches Jus AI 2: 'Breakthrough' Legal AI Combines Agentic Reasoning with Research Control, whi...
i’ve been bouncing between Claude Code and Cursor lately and the contrast is real. Claude Code feels great for speed — spin up a CLI, describe what y...
I use AI a lot for writing docs and love how well it works with Mermaid diagrams (since they're code-based). I kept asking Claude to help create prese...
# NOT PRODUCTION READY Works only in Desktop for now.. Two weeks ago I shared that I was building a browser-based video editor in just 14 days....
Announcing Genkit Go 1.0 and Enhanced AI-Assisted Development Google for Developers Blog...
In a blog post shared Wednesday, Mira Murati's startup offered a rare glimpse into some of work its doing to improve AI models....
I did 24 days of coding in 12 hours with a $20 AI tool - but there's one big pitfall ZDNET...
Article URL: https://www.qodo.ai/blog/deepcodebench-real-world-codebase-understanding-by-qa-benchmarking/ Comments URL: https://news.ycombinator.com/i...
ComfyUI — an open-source, node-based graphical interface for running and building generative AI workflows for content creation — published major updat...
Been building RAG systems for mid-size enterprise companies in the regulated space (100-1000 employees) for the past year and to be honest, this stuff...
[getting feedback](https://preview.redd.it/2wvkvb2b5iof1.png?width=905&format=png&auto=webp&s=f39e95cd2007d1cc27b1811e937f2e2fbe3c8d06) [different re...
I see quite a few people here saying they store their prompts in a Gdoc or on a sticky note, so I thought the (free) tool I built might be useful to y...
Hey fellow LLM devs! Stjepan from Manning here. 👋 I’m excited to share that **Sebastian Raschka**, the bestselling author of *Build a Large Language...
On 2025-09-08 the maintainer of some popular JS libraries was compromised, and new versions of some popular libraries were released with some crypto s...
I ran into a problem and discovered that Ollama defaults to a 4096 context length for all models, regardless of the model's actual capabilities. It si...
**16 tok/sec** with LM Studio → **\~24 tok/sec** by switching to llama.cpp → **\~31 tok/sec** upgrading RAM to DDR5 # PC Specs * **CPU:** Intel 1360...
As a huge AI audio nerd, I've recently been knee-deep in Microsoft's latest VibeVoice models and they really are awesome!! The work from the Microsoft...
In my [previous](https://www.reddit.com/r/LocalLLaMA/comments/1n21tb6/comment/nb4h42v/) post I highlighted a Blender python agent I'm working on. I've...
I really like how **NotebookLM** works - I just upload a file, ask any question, and it provides high-quality answers. How could one build a similar s...
I found a cheap HP DL380 G9 from a local eWaste place and decided to build an inference server. I will keep all equivalent prices in US$, including sh...
I posted about this but I don't think I really let on what it was and that is my bad. This is an agent builder and not just a chat wrapper. I did get...
Everyone’s focused on model quality, parameters, and benchmark scores. But when I talk to engineers quietly, a huge pain point seems to be the actual ...
One thing I’ve been experimenting with is long-term memory for AI systems. Most solutions today (RAG + vector DBs) are great for search, but they don’...
Most of us here have seen prompts break in ways that feel random: * the model hallucinates citations, * the “style guide” collapses halfway through, ...
Been using Claude Code for a few months and realized it can bypass its own permission system pretty easily. Even with deny config or [CLAUDE.md](http:...
Hello everyone, I've been working on a new open-source (MIT license) TypeScript library called `code-chopper`, and I wanted to share it with this com...
I’m building a RAG system using research papers from the arXiv dataset. The dataset is filtered for AI-related papers (around 440k+ documents), and I ...
Benchmarked every cloud model offered from the top providers for some projects I was working on. Looks like: * **Winner:** allam-2-7b on [Groq.ai](h...
Hey everyone, it's Michael from [Unsloth](https://github.com/unslothai/unsloth) here! Ever since we released Dynamic GGUFs, we've received so much lov...
So after reading the latest OpenAI white paper regarding why they think models hallucinate, I worked with Claude to try to help "untrain" my agents an...
Hey all, I kept seeing the same prompt tips repeated everywhere, so I put together a deeper guide for those who want to actually *master* prompt desig...
We just shipped v0.5.9 of Codanna. C and C++ joins Rust, Python, TypeScript, Go, and PHP. Functions, structs, classes, templates, macros—indexed and s...
Anyone here actually using OpenAI’s Responses API instead of Chat Completions? Feels like they’re pushing it everywhere now, also now via Codex. Cur...
Hey everyone, I got tired of seeing prompts that look good but break down when you actually use them. So I built **Aether**, a prompt framework that ...
Are LLMs better with certain formats such as JSON, XML, or Markdown, or do they handle all languages equally? And if they do have preferences, do we k...
No news items for this topic this week
So OpenAI dropped a paper with Georgia Tech about why LLMs hallucinate and it's pretty eye opening. We've all seen this. Ask ChatGPT when someone's b...
This past week was a busy period for mega-sized funding rounds, with all 10 of the largest U.S. financings exceeding the $100 million mark. Topping th...
[https://techcrunch.com/2025/09/12/we-are-entering-a-golden-age-of-robotics-startups-and-not-just-because-of-ai/](https://techcrunch.com/2025/09/12/we...