AI news for: Developer And Technical
Explore AI news and updates focusing on developer-and-technical for the last 7 days.

Skills for Claude will let you customize tasks with pre-set instructions - here's how
Available to Pro, Max, Team, and Enterprise users, the new capabilities shape Claude's output when carrying out routine or specialized tasks....

Key Takeaways:
- Users can create and upload their own custom skills for Claude to follow.
- Skills are essentially digital instruction manuals that make Claude more customizable and specialized.
- The launch of Skills marks a step towards making Claude more agentic, able to carry out complex tasks with minimal user oversight.

Anthropic turns to ‘skills’ to make Claude more useful at work
AI agents spent years as a concept and then as an experiment. Now, AI companies are devoting even more time and resources than before to make their ag...

Key Takeaways:
- Skills for Claude provides instructions, scripts, and resources to improve Claude's abilities for specific tasks.
- This feature is designed to reduce the time spent writing prompts and referring to past context.
- Box, Rakuten, Canva, and other companies have already used the tool, with Anthropic making it available to Pro, Max, Team, and Enterprise users.

You can now connect your Spotify account to ChatGPT. Here’s how to do it
You can now connect your Spotify account in ChatGPT and it will perform tasks for you, such as creating personalized playlists....

Key Takeaways:
- Several companies, including Spotify and Netflix, have integrated their apps into ChatGPT, allowing users to access their services within the chat assistant.
- Users can grant access to their data, such as their likes and listening history, for a more tailored experience, but can also disconnect their accounts at any time.
- The feature is available in English across 145 countries for all ChatGPT Free, Plus, and Pro users on web and mobile.

You can try Google's viral image editing tool right in Search now - here's how
Google's AI image editor, Nano Banana, is coming to a new lineup of applications. I tried it out for myself, and it surprised me....

Key Takeaways:
- Nanobanana uses natural language user prompts to generate edited images in Google Search and NotebookLM.
- The AI-powered image editor offers new styles for illustrations and Briefs in NotebookLM.
- Google has generated over five billion images with Nanobanana since its integration in August.

OpenAI suspends MLK deepfakes on Sora after ‘disrespectful’ videos
OpenAI said on Thursday night that it has “paused” deepfakes of Martin Luther King Jr. on its social app Sora after users created “disrespectful” AI-g...

Key Takeaways:
- OpenAI will pause AI-generated deepfakes of Martin Luther King Jr. on Sora
- Estates and representatives of public figures can now opt out of their likeness being used on Sora
- This move echoes OpenAI's approach to copyright when Sora first launched, which proved controversial

Claude now integrates directly with Microsoft 365
Here's what the new connector lets Anthropic's chatbot do, how it can benefit you, and who gets to access it....

Key Takeaways:
- Claude can access SharePoint, OneDrive, Outlook, and Teams to pull information directly from those apps.
- The new 'enterprise search' feature allows businesses to integrate all critical apps for centralized resource retrieval.
- Admins must curate digital tools for the team-wide accounts, and the new features are available to Claude Team and Enterprise subscribers.

NVIDIA Blackwell Leads on SemiAnalysis InferenceMAX v1 Benchmarks
SemiAnalysis recently launched InferenceMAX v1, a new open source initiative that provides a comprehensive methodology to evaluate inference hardware....

Key Takeaways:
- 15x performance gain over the Hopper generation
- 15x revenue opportunity for AI factories
- Continuous software optimizations deliver boost in performance over time

Pinterest adds controls to let you limit the amount of ‘AI slop’ in your feed
Pinterest is rolling out new controls that let users limit how much AI-generated content appears in their feeds. The company is also making its AI con...

Key Takeaways:
- GenAI content now makes up 57% of all online material.
- Users can now personalize their feeds to restrict GenAI imagery in select categories.
- Pinterest will introduce more AI content labels and make them more noticeable soon.

OnePlus’ OxygenOS 16 brings Gemini into your Mind Space
OnePlus has announced OxygenOS 16, its take on Android 16, with upgrades to its Mind Space AI tool including complete integration with Google Gemini. ...

Key Takeaways:
- OnePlus' Mind Space AI tool now allows users to save longer screenshots and record voice memos to enhance AI capabilities.
- OxygenOS 16 integrates with Google Gemini, enabling tasks to be handled based on saved information.
- The update includes lock screen customizations, improved connectivity options, and design tweaks throughout the operating system.
Introducing a new open source benchmarking tool to measure AI for cybersecurity, grounded in real world scenarios. This is important work as we evaluate how well AI systems can reason to protect against cyberattacks. See the blog and the GitHub repo:
The post Introducing a new open source benchmarking tool to measure AI for cybersecurity, grounded in real world scenarios. This is important work as ...

Key Takeaways:
- Measures AI's performance in goal decomposition, tool use, and evidence synthesis.
- Sets a higher standard for evaluating AI's effectiveness in defending against evolving threats.
- This initiative promotes transparency and collaboration in the AI for cybersecurity space, leading to more resilient and adaptable defense-ready AI.

I've tested free vs. paid AI coding tools - here's which one I'd actually use
Some developers pay big for AI coding tools. Others stick with free. Here's how to know when to spend - and when to save....

Key Takeaways:
- Free AI chatbots can handle small projects effectively, while paid AI tools provide serious productivity boosts for professional coders.
- The cost of AI tools is based on resource constraints, including session and token limits, rate and usage limits, and model access differences.
- AI tool cost economics should be considered when deciding whether to use a free or paid plan, depending on the size and scope of the project.

Researchers Find It’s Shockingly Easy to Cause AI to Lose Its Mind by Posting Poisoned Documents Online
"Poisoning attacks may be more feasible than previously believed." The post Researchers Find It’s Shockingly Easy to Cause AI to Lose Its Mind by Post...

Key Takeaways:
- A small number of poisoned documents can effectively compromise large AI models regardless of their size or training data.
- Attack success depends on the absolute number of poisoned documents, not the percentage of training data, making attacks potentially more feasible than previously believed.
- The study highlights significant risks to AI security and may limit its adoption in sensitive applications without improved defenses against poisoning attacks.

Scality Launches AI Certifications for 20+ Key Tools
Validates the full AI lifecycle — from ingestion to deployment — on a cyber-resilient foundation Scality, a global leader in cyber-resilient storage s...

Key Takeaways:
- Scality's certification program covers the full AI development lifecycle, from data collection to inferencing, on a cyber-resilient foundation.
- The certification addresses the AI integration challenge by providing interoperability, documentation, and security, reducing the AI development timeline by 40-60%.
- Scality will expand the certification program as the AI ecosystem evolves, adding new tools and frameworks for enterprises and AI startups to navigate effectively.

Accelerate Qubit Research with NVIDIA cuQuantum Integrations in QuTip and scQubits
NVIDIA cuQuantum is an SDK of libraries for accelerating quantum simulations at the circuit (digital) and device (analog) level. It is now integrated ...

Key Takeaways:
- Achieves a 4000x speedup from CPU to an 8x GPU node for transmon-resonator systems with the new qutip-cuquantum plugin.
- Supports scaling of simulations to much larger Hilbert spaces with multi-GPU and multi-node capabilities, enabling study of more complex quantum systems.
- Enables researchers to explore more complex composite qubit systems and develop new quantum devices with improved coherence times and performance.

This new 'Pixnapping' exploit can steal everything on your Android screen - even 2FA codes
The attack begins when a victim unknowingly installs a malicious app on their Google or Samsung phone....

Key Takeaways:
- Pixnapping can steal private data, including 2FA codes, from Android devices by exploiting existing APIs and a hardware side channel.
- The attack, which involves three stages, is partially patched, but a more complete fix is due in December's Android security bulletin.
- Experiments have shown successful 2FA code theft on Google Pixel phones, but not on a Samsung Galaxy S25 due to 'significant noise'.

Microsoft wants you to talk to your PC and let AI control it
As Microsoft bids farewell to Windows 10 and gets ready to mark the 40-year milestone of its operating system, it’s looking forward to what’s next for...

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Key Takeaways:
- Google Cloud C4 VMs show a 1.7x Total Cost of Ownership (TCO) improvement over GCP C3 VMs for GPT OSS MoE inference.
- C4 VMs provide 1.4x to 1.7x throughput per vCPU over C3 VMs for large MoE models.
- The result underlines that large MoE models can be efficiently served on next-generation general-purpose CPUs, thanks to targeted framework optimizations from Intel and Hugging Face.

Google’s AI video generator is getting better editing and more audio
Google is making videos created with the AI filmmaking tool Flow even more realistic — and harder to identify as AI-generated at first glance. The com...

Key Takeaways:
- Users can now add and change shadows and lighting in AI-generated videos using Flow.
- Flow's new features allow for video generation with audio using multiple reference images or starting and ending images.
- Google's Veo 3.1 update, tied to the Flow features, does a better job of making a video based on submitted images and costs the same as Veo 3.

I unleashed Copilot on my Microsoft and Google accounts - here's what happened
With a new update rolling out to Windows Insiders, Copilot can access your Microsoft and Google accounts to work with your email, calendar, and contac...

Key Takeaways:
- Copilot can now connect to external services like OneDrive, Outlook, Gmail, Google Calendar, and Google Drive, expanding its capabilities beyond Microsoft services.
- The integration makes it possible for users to access and interact with content across different services, such as files, emails, and calendar appointments.
- The feature is currently rolling out to Windows 11 Insiders and may take a few months to reach all Windows 11 users.

Even the best AI agents are thwarted by this protocol - what can be done
The increasingly popular Model Context Protocol lets AI models access applications, but studies show that the best generative AI bots struggle with pl...

Key Takeaways:
- Bigger AI models tend to perform better than smaller models on MCP-related challenges, but all models struggle with increasing complexity and multi-server interactions.
- Fine-tuning AI models specifically for MCP can improve their performance and adaptability, but may not address all challenges, particularly with non-public or non-standard resources.
- The development of new benchmarks, datasets, and training methods is necessary to push the boundaries of what's possible with MCP-enabled AI models.

Video Overviews on NotebookLM get a major upgrade with Nano Banana
NotebookLM's Video Overviews get an upgrade with visuals powered by Nano Banana and a new "Brief" format for quick summaries....

Key Takeaways:
- New video overviews feature six customizable visual styles, including Watercolor, Papercraft, and Anime.
- Two formats are now available: Explainer (structured, comprehensive videos) and Brief (bite-sized, quick-grasp videos).
- The update is expected to roll out to all NotebookLM users in supported languages in the upcoming weeks.

Slack is turning Slackbot into an AI assistant
Slack is testing an update for Slackbot that transforms it into an AI assistant. Presently, it operates as a tool for delivering reminders and notific...

Key Takeaways:
- The updated Slackbot will draw from users' conversations, files, and workspace to provide personalized assistance.
- Individuals in a workspace will not be able to opt out of using the AI Slackbot, but companies can choose to do so.
- Slack plans to roll out the feature to all users by the end of the year, initially available for 70,000 employees at Salesforce and other customers in a pilot.
It's trivially easy to poison LLMs into spitting out gibberish, says Anthropic

Key Takeaways:
- Only 250 malicious documents, amounting to 0.00016% of the model's total training data, are needed to compromise a model with 13 billion parameters.
- Cleansing and safeguarding the training input becomes a critical step, requiring constant scrutiny and attention.
- Current LLM-based AI's lack self-guided self-correction processes and rely on human testing and curation, implying the need for significant investment and expertise.

Build a Log Analysis Multi-Agent Self-Corrective RAG System with NVIDIA Nemotron
Logs are the lifeblood of modern systems. But as applications scale, logs often grow into endless walls of text—noisy, repetitive, and overwhelming. H...

Key Takeaways:
- The solution can be used by various teams such as QA, Engineering, DevOps, CloudOps, and Platform/Observability managers to quickly pinpoint issues and improve productivity.
- The system combines a retrieval-augmented generation (RAG) pipeline with a graph-based multi-agent workflow to unify heterogeneous log streams and surface the most relevant snippets.
- The solution can be extended into other areas such as bug reproduction automation, observability dashboards, and cybersecurity pipelines, reducing mean time to resolve (MTTR) and improving developer productivity.

AI ‘workslop’ is creating unnecessary extra work. Here’s how we can stop it
When applied to the right tasks, with appropriate human oversight, AI can enhance performance. Here are three simple steps to get the most out of the ...

Key Takeaways:
- 66% of employees rely on AI output without evaluation, leading to 'workslop'
- Workslop can result in lost productivity, reputational hits, and corrode collaboration and trust
- AI literacy and selective use of AI can reduce 'workslop' and improve collaboration
Gemini 3.0 Pro is already referenced on Gemini's source code
If you still skeptical or think the screenshot is fake, here is a direct link to a gstatic JS source: [https://www.gstatic.com/\_/mss/boq-bard-web/\_/...

Restb.ai Bridges Data Gap for Smarter AI-Powered Home Search
450+ RESO data points can now be completed in seconds with computer vision A major step forward in AI-powered home search is underway, as Restb.ai’s n...

Key Takeaways:
- Restb.ai's AI-powered computer vision can complete over 450 RESO-standardized data points in seconds.
- MLSs can now extract and tag property features directly from photos, reducing manual input errors.
- The technology aims to provide more complete and accurate listing data, streamlining the home search process.

Inside the web infrastructure revolt over Google’s AI Overviews - Ars Technica
Inside the web infrastructure revolt over Google’s AI Overviews Ars Technica...

Key Takeaways:
- Google's AI Overviews have been cutting referrals by nearly 50% for many websites, citing studies from Pew Research Center and The Wall Street Journal.
- Cloudflare's Content Signals Policy allows website operators to opt-in or opt-out of consenting to specific use cases, including search, ai-input, and ai-train.
- Cloudflare's policy may force Google to change its bundling of traditional search crawlers and AI Overviews, potentially setting a new standard for the web.

Agentic AI Unleashed: Join the AWS & NVIDIA Hackathon
Build the next generation of intelligent, autonomous applications. This isn't just a hackathon—it's your chance to unleash the power of agentic AI and...

Improve Variant Calling Accuracy with NVIDIA Parabricks
Built for data scientists and bioinformaticians, NVIDIA Parabricks is a scalable genomics software suite for secondary analysis. Providing GPU-acceler...

Key Takeaways:
- Parabricks v4.6 offers over 8x speedup in STAR quantification compared to CPU-only solutions on two NVIDIA RTX PRO 6000 GPUs.
- DeepVariant with pangenome-aware mode reduces errors by up to 25.5% across all settings compared to linear-referenced-based DeepVariant.
- Giraffe and DeepVariant combination provides a 14x speedup in runtime compared to CPU-only Giraffe and DeepVariant with pangenome-aware mode on four NVIDIA RTX PRO 6000 GPUs.

I tried a Linux distro that promises free, built-in AI - and things got weird
Gnoppix is for those who are into AI and want to try Linux. But it might test your patience....

Key Takeaways:
- Gnoppix includes a large collection of apps, making it suitable for general users.
- The distribution runs smoothly with the KDE Plasma desktop environment.
- However, the AI package (gnoppix-ai) is currently broken and should be avoided.

Google AI Releases C2S-Scale 27B Model that Translate Complex Single-Cell Gene Expression Data into ‘cell sentences’ that LLMs can Understand
A team of researchers from Google Research, Google DeepMind, and Yale released C2S-Scale 27B, a 27-billion-parameter foundation model for single-cell ...

Qualifire AI Releases Rogue: An End-to-End Agentic AI Testing Framework, Evaluating the Performance of AI Agents
Agentic systems are stochastic, context-dependent, and policy-bounded. Conventional QA—unit tests, static prompts, or scalar “LLM-as-a-judge” scores—f...

New coding models & integrations
GLM-4.6 and Qwen3-coder-480B are available on Ollama’s cloud service with easy integrations to the tools you are familiar with. Qwen3-Coder-30B has be...

My New Developer Workstation: NVIDIA DGX Spark
When NVIDIA asked if we wanted to test the new DGX Spark as a daily driver, I said yes immediately....

Anthropic Launches Claude Haiku 4.5: Small AI Model that Delivers Sonnet-4-Level Coding Performance at One-Third the Cost and more than Twice the Speed
Anthropic released Claude Haiku 4.5, a latency-optimized “small” model that delivers similar levels of coding performance to Claude Sonnet 4 while run...

Alibaba’s Qwen AI Releases Compact Dense Qwen3-VL 4B/8B (Instruct & Thinking) With FP8 Checkpoints
Do you actually need a giant VLM when dense Qwen3-VL 4B/8B (Instruct/Thinking) with FP8 runs in low VRAM yet retains 256K→1M context and the full capa...

Andrej Karpathy Releases ‘nanochat’: A Minimal, End-to-End ChatGPT-Style Pipeline You Can Train in ~4 Hours for ~$100
Andrej Karpathy has open-sourced nanochat, a compact, dependency-light codebase that implements a full ChatGPT-style stack—from tokenizer training to ...

NVIDIA Researchers Propose Reinforcement Learning Pretraining (RLP): Reinforcement as a Pretraining Objective for Building Reasoning During Pretraining
NVIDIA AI has introduced Reinforcement Learning Pretraining (RLP), a training objective that injects reinforcement learning into the pretraining stage...

ServiceNow AI Research Releases DRBench, a Realistic Enterprise Deep-Research Benchmark
ServiceNow Research has released DRBench, a benchmark and runnable environment to evaluate “deep research” agents on open-ended enterprise tasks that ...

Meta’s ARE + Gaia2 Set a New Bar for AI Agent Evaluation under Asynchronous, Event-Driven Conditions
Meta AI has introduced Agents Research Environments (ARE), a modular simulation stack for creating and running agent tasks, and Gaia2, a follow-up ben...

Salesforce launches Agentforce 360 AI platform to boost software products - Reuters
Salesforce launches Agentforce 360 AI platform to boost software products Reuters...

Resistant AI Raises $25 Million in Series B Funding to Empower AI Agents to Fight Fraud and Fincrime - FF News | Fintech Finance
Resistant AI Raises $25 Million in Series B Funding to Empower AI Agents to Fight Fraud and Fincrime FF News | Fintech Finance...

Tabby Invests in NVIDIA HGX Systems to Power Advanced AI Infrastructure - FF News | Fintech Finance
Tabby Invests in NVIDIA HGX Systems to Power Advanced AI Infrastructure FF News | Fintech Finance...

Whales Are Betting on Ozak AI—Could Retail Investors Become the Next Millionaires? - livebitcoinnews.com
Whales Are Betting on Ozak AI—Could Retail Investors Become the Next Millionaires? livebitcoinnews.com...

Lloyds Banking Group Pioneers AI Leadership Training With Cambridge Partnership - FF News | Fintech Finance
Lloyds Banking Group Pioneers AI Leadership Training With Cambridge Partnership FF News | Fintech Finance...
Did AI-Powered Cybersecurity Launch and Strong Results Just Shift NetScout Systems' (NTCT) Investment Narrative? - simplywall.st
Did AI-Powered Cybersecurity Launch and Strong Results Just Shift NetScout Systems' (NTCT) Investment Narrative? simplywall.st...

Naver likely to face massive copyright suits over use of news for AI - The Korea Herald
Naver likely to face massive copyright suits over use of news for AI The Korea Herald...

How AI is transforming New Jersey’s accounting firms - NJBIZ
How AI is transforming New Jersey’s accounting firms NJBIZ...
It’s AI Introduces Word-Level AI Detection for Greater Transparency in Text Analysis - FinancialContent
It’s AI Introduces Word-Level AI Detection for Greater Transparency in Text Analysis FinancialContent...

Google Introduces Speech-to-Retrieval (S2R) Approach that Maps a Spoken Query Directly to an Embedding and Retrieves Information without First Converting Speech to Text
Google AI Research team has brought a production shift in Voice Search by introducing Speech-to-Retrieval (S2R). S2R maps a spoken query directly to a...

From roadside to research: NRMA’s in-house legal team embraces AI - The Guardian
From roadside to research: NRMA’s in-house legal team embraces AI The Guardian...

Ant Group Unveils Trillion-Parameter AI Model Ling-1T - FinTech Weekly
Ant Group Unveils Trillion-Parameter AI Model Ling-1T FinTech Weekly...

IBM Unveils Real-Time Monitoring for AI Agents to Boost Productivity - Small Business Trends
IBM Unveils Real-Time Monitoring for AI Agents to Boost Productivity Small Business Trends...

The 40 jobs 'most at risk of AI' - and 40 it can't touch - Sky News
The 40 jobs 'most at risk of AI' - and 40 it can't touch Sky News...

Sentient AI Releases ROMA: An Open-Source and AGI Focused Meta-Agent Framework for Building AI Agents with Hierarchical Task Execution
Sentient AI has released ROMA (Recursive Open Meta-Agent), an open-source meta-agent framework for building high-performance multi-agent systems. ROMA...

Cursor Levels Up With 1.0 Release, Adding MCP Support and Persistent Memory
Cursor 1.0 is here, featuring BugBot for seamless GitHub code reviews, asynchronous Background Agents, collaborative Jupyter support, and more!Read Al...

AI Systems Can Be Fooled by Fake Dates, Giving Newer Content Unfair Visibility - Digital Information World
AI Systems Can Be Fooled by Fake Dates, Giving Newer Content Unfair Visibility Digital Information World...
What FIS's New AI Partnership Means for Shareholders and the Future of Digital Banking - simplywall.st
What FIS's New AI Partnership Means for Shareholders and the Future of Digital Banking simplywall.st...

Using a swearword in your Google search can stop the AI answer. But should you?
Artificial intelligence is more than Trump deepfakes of Tilly the actor. It’s used in smartphones, customer service, healthcare – even legal cases. Is...

Key Takeaways:
- AI is increasingly widespread in various industries, including healthcare, customer service, finance, and social media, raising concerns about privacy leakage, discrimination, and malicious use.
- A global study found that half of Australians use AI regularly, but only 36% trust it, highlighting the need for transparent regulations and governance.
- Experts warn that AI poses an existential risk due to its potential for misuse, and that it's getting harder to avoid AI in modern life, with some suggesting that a 'kill switch' may no longer be a viable option.
Trending AI Repos & Tools
The fastest, most affordable coding model Discussion | Link...
Ultra‑native Windows editor with WSL, extensions, and AI Discussion | Link...
nanobrowser
10338Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator....
Sora2Web is an online tool that harnesses the power of OpenAI's Sora 2 model to transform your text and images into stunning, high-definition videos...
claude-code
38735Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, ex...
Build AI automations & agents using natural language Discussion | Link...
coze-studio
17830An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way ...
PrompTessor is a comprehensive platform for analyzing, improving, and optimizing AI prompts to unlock the full potential of LLMs...
PaddleOCR
57842Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs...
minimind
28872🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!...
java-sdk
2610The official Java SDK for Model Context Protocol servers and clients. Maintained in collaboration with Spring AI...
Qwen3-VL
14947Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud....
Anthropic's Interactive Prompt Engineering Tutorial...
coder/blink
21Blink is a tool for building and sharing AI agents....
spring-ai-alibaba
6426Agentic AI Framework for Java Developers...
n8n-mcp
8679A MCP for Claude Desktop / Claude Code / Windsurf / Cursor to build n8n workflows for you...
Evaluate AI workflows and reach 99% AI quality. Discussion | Link...
system_prompts_leaks
22761Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini...
MineContext
1836MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)...
Dottle AI Create websites with intuitive design prompts, iterative refinement, and seamless code export. Generate and customize design concepts for ef...
CLI tool for configuring and monitoring Claude Code...
llama.cpp
87765LLM inference in C/C++...
lobe-chat
66801🤯 Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen),...
AI Humanizer is a free AI-to-human text converter that transforms robotic AI writing into natural, readable, and undetectable content...
Repository-level Repair Agent Based on SWE-Bench—JoyCode Agent...
stagehand
18265The AI Browser Automation Framework...
RD-Agent
8424Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are ...
Flowise
45297Build AI Agents, Visually...
llm-cookbook
21661面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版...
Prompt-Engineering-Guide
64118🐙 Guides, papers, lecture, notebooks and resources for prompt engineering...
MinerU
46507Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows....
Community talk
Nvidia breakthrough gives 4-bit pretraining technique the accuracy of FP8
LlamaBarn — A macOS menu bar app for running local LLMs (open source)
Claude Code asking clarifying questions with a new UI
It's not just "Skills" - Claude now has a full Linux development environment built-in
Meta just dropped MobileLLM-Pro, a new 1B foundational language model on Huggingface
new 1B LLM by meta
Real-time study buddy that sees your screen and talks back
Qwen3-VL-4B and 8B Instruct & Thinking are here
Claude code users, with new limits. What do you love to see next?
Fully functional native FP4 training finally released
Open-source RAG routes are splintering — MiniRAG, Agent-UniRAG, SymbioticRAG… which one are you actually using?
[P] Control your house heating system with RL
If I share information with ChatGPT in a chat (while asking a question), can that data be used to answer someone else’s question?
[R] Plain English outperforms JSON for LLM tool calling: +18pp accuracy, -70% variance
Best Open Source TTS That Sounds Most Natural Voice For Storytelling? That You Can Run With 12GB Vram?
Since DGX Spark is a disappointment... What is the best value for money hardware today?
Looking for tools that can track my ai agent trajectory and also llm tool calling
I put Sora 2 to directly inside Premiere Pro
Qwen3-30B-A3B FP8 on RTX Pro 6000 blackwell with vllm
I fine-tuned Qwen3-VL (4B & 8B) on a free Colab instance using TRL (SFT and GRPO)!
GLM 4.5 Air AWQ 4bit on RTX Pro 6000 with vllm
Going from the Claude app to Claude Code and my mind is blown!
Poor GPU Club : 8GB VRAM - MOE models' t/s with llama.cpp
My first 15 days with GLM-4.6 — honest thoughts after using Opus and Sonnet
[P] Nanonets-OCR2: An Open-Source Image-to-Markdown Model with LaTeX, Tables, flowcharts, handwritten docs, checkboxes & More
[Update] Qwen3-VL cookbooks coming — recognition, localization, doc parsing, video
Quick Guide: Running Qwen3-Next-80B-A3B-Instruct-Q4_K_M Locally with FastLLM (Windows)
[Open Source] We built a production-ready GenAI framework after deploying 50+ agents. Here's what we learned 🍕
What’s the point of a DGX Spark for inference if a Mac Studio M1 Ultra beats it at TG and equals it at PP at half the price?
I stopped asking my AI for "answers" and started demanding "proof," it's producing insane results with these simple tricks.
I tested if tiny LLMs can self-improve through memory: Qwen3-1.7B gained +8% accuracy on MATH problems
[D] TEE GPU inference overhead way lower than expected - production numbers
Significant speedup for local models
4x4090 build running gpt-oss:20b locally - full specs
Multi-modal RAG at scale: Processing 200K+ documents (pharma/finance/aerospace). What works with tables/Excel/charts, what breaks, and why it costs way more than you think
Nanonets-OCR2: An Open-Source Image-to-Markdown Model with LaTeX, Tables, flowcharts, handwritten docs, checkboxes & More
Dolphin X1 8B (Llama3.1 8B decensor) live on HF
Open source streaming STT (Parakeet + Silero + Pipecat Smart Turn)
GLM 4.6 UD-Q6_K_XL running llama.cpp RPC across two nodes and 12 AMD MI50 32GB
Stop writing prompts. Start building systems.
Claude Performance and Bug Report with Workarounds - October 5 to October 12
PSA: Ollama no longer supports the Mi50 or Mi60
Optimize my environment for GLM 4.5 Air
Choosing a code completion (FIM) model
Poor GPU Club : Anyone use Q3/Q2 quants of 20-40B Dense models? How's it?
How do I compare cost per token for serverless vs provisioned hardware?
Quality degradation of fp8 quantization?
What exactly is happening in AI?
My evolving AI dev stack: combining spec planning + coding + reviews - inspired by a16z's "The Trillion Dollar AI Software Development Stack"
Building a multi-agent financial bot using Agno, Maxim, and YFinance
Benchmarking LLM Inference on RTX 4090 / RTX 5090 / RTX PRO 6000 #2
[AutoBE] achieved 100% compilation success of backend generation with "qwen3-next-80b-a3b-instruct"
How do You Handle LLM Token COST?
We can now run wan or any heavy models even on a 6GB NVIDIA laptop GPU | Thanks to upcoming GDS integration in comfy
I made a multimodal local RAG system with LM Studio
Is there anything faster or smaller with equal quality to Qwen 30B A3B?
Ethical prompting challenge: How to protect user anonymity when their biometric identity is easily traceable.
Hot Take: Sonnet 4 on launch was better than Sonnet 4.5 now
windows 11 is starting to listen to you. literally.
Improving low VRAM performance for dense models using MoE offload technique
Reviewing Claude Code changes is easier on an infinite canvas
Merry Christmas ya gooners
File editing success rate per model
Perplexity is fabricating medical reviews and their subreddit is burying anyone who calls it out
Cameo for AI characters / subjects that you can create yourself and can later recall across videos, are coming soon to Sora app. How exciting!
Sora prompting thread 🧵
Just have a session this morning and Haiku 4.5 session limits feel significantly better, possibly 2x 2.5x Sonnet 4.5 in my estimates
LLama.cpp GPU Support on Android Device
Heads-up: Poorly designed MCPs can silently drain your token quota
[R]: Create a family of pre-trained LLMs of intermediate sizes from a single student-teacher pair
Sonnet is very good at watching videos
I tested 1,000 ChatGPT prompts in 2025. Here's the exact formula that consistently beats everything else (with examples)
Claude’s file upload limit dropped from 6% to 4% — now I can’t work. Any workarounds?
Which Format is Best for Passing Nested Data to LLMs?
A guide to the best agentic tools and the best way to use them on the cheap, locally or free
I built sub-agents that actually keep my context clean
GPT-OSS-20b TAKE THE WHEEL!
gpt-oss20/120b AMD Strix Halo vs NVIDIA DGX Spark benchmark
Build Lovable for Claude Code users. No Costs.
Claude Code taking a coffee break 🤔
Daily install trends of AI coding tools in Visual Studio Code (including Claude Code)
Understanding Claude Code's 3 system prompt methods (Output Styles, --append-system-prompt, --system-prompt)
How OpenAI's Apps SDK works
I'm building a hotkey tool to make ChatGPT Plus actually fast. Roast my idea.
How do teams handle using multiple AI APIs? and is there a better way?
[D] Why are Monte Carlo methods more popular than Polynomial Chaos Expansion for solving stochastic problems?
Looking to connect with AI teams actively sourcing consent-based location & demographic datasets
Greg Brockman on AI-designed chips and the future of compute
Has anyone here hooked up Claude Desktop with a local MCP server so it can fully understand and operate on your project (files, terminal, codebase)? How stable is that setup in real use?
Get ChatKit to ask a series of predefined questions
Used Sora 2 to create this short Sci-fi anime story about life with around 16 clips, along with elevenlabs for voiceover and Suno for music. Hollywood and animation houses are in for a big surprise and so is the general public. Would you watch more videos like this?
Who is approving these Claude Code updates? (It's broken, downgrade immediately)
Building highly accurate RAG -- listing the techniques that helped me and why
Benchmarking small models at 4bit quants on Apple Silicon with mlx-lm
How do you create modern UIs?
"Steerable Scene Generation with Post Training and Inference-Time Search"
🚀 I’ve been documenting everything I learned about Claude Code
[P] Adapting Karpathy’s baby GPT into a character-level discrete diffusion model
Asking Claude Code to self-reflect is a nice unlock
Claude CLI, Codex CLI, and Gemini CLI: Beasts Together Using Zen MCP
Claude Agent SDK
It’s not the model, it’s the prompt: Why ChatGPT UI feels different from API
Prompts I keep reusing because they work.
LM Studio + Open-WebUI - no reasoning
I've Built 10+ AI Agent Networks with OpenAgents. Here's What Everyone Misses.
Adding search to open models
🚀 Struggling to Write Effective Prompts? Try This AI Prompt Enhancement Framework (PEEF)
Kwaipilot/KAT-Dev-72B-Exp seems to be a great coding model?
built a tool to let Claude, Codex, Q, and Gemini share context instead of working in silos
Txt or Md file best for an LLM
Looks like our automated overlords have arrived.
microsoft/UserLM-8b - Unlike typical LLMs that are 'assistant', they trained UserLM-8b to be the 'user' role
KRAR
💡 6 ChatGPT Prompt Frameworks for Writing the Perfect Prompts (Copy + Paste)
Turn off your MCPs
Session Limit Hit Prematurely [75%]
Why is there still no simple way to just save and reuse our own AI prompts?
Has Claude changed? It used to feel natural — now it’s stiff and overcomplicated
This guy literally explains how to build your own ChatGPT (for free)
Why does it switch models
Made with open source software, what will it be like in a year?
Gemini 3.0 Pro: Retro Nintendo Sim one shot – with proof & prompt
I started using Claude Code to hunt bots on my subreddit. I now find one every three days.
So I realized that /clear and /compact are not ideal. We need a /shift option to slice off the beginning of the conversation.
Learning the ai language across models
If you even slightly know what you're doing, Claude's subagents are its real magic
Claude and I made a tool to save our conversations
High quality code - demands high quality input
How I stopped killing side projects and shipped my first one in 10 years with the help of Claude 4.5
Generative Engine Optimization (GEO) and Answer Engine Optimization (AEO) – anyone optimizing for this yet?
I tried to build a prompt that opened the black box . here’s what actually happened
Detect over-compressed images in a dataset? [P]
How to see REAL usage impacts (new workaround). like why does it jump from 3% to 6% for small commands etc.
Tokenized
The real bottleneck in Artificial Intelligence is going to be in how they're implemented, and not the Artificial Intelligence Model itself.
Sora 2 video generation stuck preventing more generations
[FREE] Nano Canvas: Generate Images on a canvas
Every single COT terms score.
Did just Anthropic do something stupid? All MCP servers stopped working in Claude Desktop because of settings folder name change.
I preferred the model selection from top to be honest. NEW UPDATE
The only choice in claude code is sonnet 4.5?
‘This app can’t run on your PC’ — getting this error when trying to run Claude from VS Code terminal
Continuing conversations
Has anyone found a working workaround for the filters
Someone convince me mobile dev isn’t just theater - Happy, SSH, whatever - what’s the actual point?
Historical Events as Kid's Toys
Claude Code Context Window Issue
Changing a single apostrophe in prompt causes radically different output
Anyone interested in 1 Billion Parameters context management tool?
So now this is gone too?
At what point does prompt engineering stop being “engineering” and start being “communication”?