AI/TLDR — New AI Releases Daily: Models, Tools, Repos & PapersA high-volume feed of new AI releases — models, open-source repos, developer tools, papers, datasets, and benchmarks — refreshed every 2 hours. Each release is explained in plain English so you actually understand what shipped.This site uses JavaScript to render the interactive feed. Enable JavaScript, or visit the source repo for the raw JSON.

AI/TLDR

AI/TLDR — every new AI model, tool, repo & paper

The latest AI releases, refreshed every 2 hours and explained in plain English.

What AI shipped today?

In the last 24 hours AI/TLDR tracked 9 new AI releases, including Gemini Omni Flash + Nano Banana 2 Lite — Google's new video and image models, 1littlecoder: 'Claude Sonnet 5 in 12 mins!' and Claude Sonnet 5 — Anthropic's new agentic Sonnet at Opus-class quality. AI/TLDR is an AI release tracker that follows new AI models, open-source tools, papers, datasets and benchmarks — refreshed every 2 hours from verified primary sources and explained in plain English.

AI Release Index — live stats on AI releases · Learn AI

Gemini Omni Flash + Nano Banana 2 Lite — Google's new video and image modelsGoogle · 2026-06-30 · model
Google launches Gemini Omni Flash, a $0.10/sec video model with conversational editing, alongside Nano Banana 2 Lite, an image model that ships a result in 4 seconds at $0.034 each.
1littlecoder: 'Claude Sonnet 5 in 12 mins!'1littlecoder · 2026-06-30 · video
1littlecoder publishes a 12-minute walkthrough of Claude Sonnet 5, Anthropic's new agentic Sonnet that approaches Opus 4.8 quality at Sonnet pricing.
Claude Sonnet 5 — Anthropic's new agentic Sonnet at Opus-class qualityAnthropic · 2026-06-30 · model
Claude Sonnet 5 is Anthropic's most agentic Sonnet yet, with a 1M-token context and adaptive thinking. It targets Opus 4.8 quality at lower cost and is now the default for Free and Pro plans.
Claude Code is steganographically marking requests — hidden prompt fingerprintsThereallo · 2026-06-30 · article
Researcher Thereallo found that Claude Code silently rewrites its system prompt with steganographic markers when ANTHROPIC_BASE_URL is set, encoding proxy hostnames against an XOR-obfuscated competitor list.
Claude Science — Anthropic's AI workbench for life-sciences researchAnthropic · 2026-06-30 · tool
Claude Science is an Anthropic desktop app that gives life-sciences researchers one workbench for code, 60+ scientific databases, native protein and genome rendering, and a reviewer agent that catches citation and calculation errors.
Sam Witteveen: 'Introducing the Gemini Omni Flash API'Sam Witteveen · 2026-06-30 · video
Sam Witteveen walks through the Gemini Omni Flash API — Google DeepMind's multimodal video-generation model, now reachable from code as Google opens its developer rollout.
Agents-A1 — Shanghai AI Lab 35B MoE matches trillion-parameter agentsShanghai AI Laboratory · 2026-06-30 · model
Agents-A1 is an open-weight 35B Mixture-of-Experts agent from Shanghai AI Laboratory. It posts SOTA scores on SEAL-0 (56.4), FrontierScience-Research (40.0), and IFBench (80.6), and the paper claims parity with trillion-parameter agents.
LongCat-2.0 — Meituan's 1.6T open-source MoE for agentic codingMeituan · 2026-06-30 · model
Meituan released LongCat-2.0, a 1.6T-parameter open-source mixture-of-experts model with ~48B active per token. It scores 59.5 on SWE-bench Pro and 70.8 on Terminal-Bench 2.1, trained entirely on Chinese AI ASICs under MIT license.
Quesma: 'Qwen3.6 27B is the sweet spot for local development'Piotr Migdał · 2026-06-29 · article
Piotr Migdał argues Qwen3.6 27B is the local-dev sweet spot: ~32 tok/s on a MacBook M5 Max with 8-bit llama.cpp, fits in 42GB RAM, and reaches roughly mid-2025 frontier quality. The post hit 875 points on the Hacker News front page.
Simon Willison: Ornith-1.0 — hands-on with the open-weights coding modelSimon Willison · 2026-06-29 · article
Simon Willison runs DeepReinforce's new Ornith-1.0 — an MIT-licensed coding model family (9B, 31B, 35B MoE, 397B MoE) built on Gemma 4 and Qwen 3.5 — through LM Studio and a Pi agent loop on a Datasette codebase.
Cursor for iOS — native mobile app for cloud and remote coding agentsCursor · 2026-06-29 · tool
Cursor for iOS is now in public beta on the App Store, letting paid users launch cloud agents, remote-control their desktop Cursor, and merge pull requests from a phone.
Brain2Qwerty v2 — Meta non-invasive brain-to-text hits 61% word accuracyMeta AI Research · 2026-06-29 · paper
Brain2Qwerty v2 decodes typed sentences from MEG brain recordings at 61% word accuracy, up from 8% for prior non-invasive methods. Meta trained the model on 22,000 sentences from 9 volunteers, with code and the paper now public.
Cline 4.0 — SDK rewrite rolled back to 3.89 two days after launchCline · 2026-06-26 · repo
Cline 4.0.0 migrated the VS Code extension to a shared SDK session layer with a Plugins marketplace, ClinePass billing, queued chat, and edit-and-regenerate — but launch-day regressions led to a 4.0.1 rollback to the 3.89.2 codebase two days later.
Weave Router — drop-in proxy that picks the right LLM per requestWeave · 2026-06-27 · tool
Weave Router is an open-source proxy for Claude Code, Codex, and Cursor that scores each prompt with an on-box ONNX embedder and routes it to the best model across Anthropic, OpenAI, Gemini, and OpenRouter providers in under 50ms.
cognee v1.2.2 — truth-subspace reranking for the open-source agent memory platformtopoteretes · 2026-06-26 · repo
cognee v1.2.2 adds truth-subspace reranking: an opt-in retrieval layer that builds centroids from distilled session lessons so the open-source agent memory platform reorders search hits toward what its graph has already learned.
CVE-2026-LGTM — Andrew Nesbitt's satirical AI supply-chain incident reportAndrew Nesbitt · 2026-06-26 · article
Andrew Nesbitt's satirical post-mortem walks a fake malicious npm package past seven AI security gates that each fail for a different reason, dramatizing correlated LLM blind spots and prompt-injection in automated code review.
GPT-5.6 rollout delayed — US government will vet every customerOpenAI · 2026-06-26 · ecosystem
OpenAI postponed the broad GPT-5.6 launch at the Trump administration's request, limiting initial access to about 20 government-vetted partners. The Office of the National Cyber Director will approve customers one by one.
Wes Roth: 'HERMES AGENT + Stripe Payments + NVIDIA Nemotron is INSANE!'Wes Roth · 2026-06-27 · video
Wes Roth covers three fresh AI ship-events at once: Nous Research's Hermes Agent, the new Stripe Payments integration for agents, and the latest NVIDIA Nemotron release — and explains how they fit together for builders.
Simon Willison: '2,000 people tried to hack my AI assistant'Simon Willison · 2026-06-26 · article
Simon Willison covers Fernando Irarrázaval's HackMyClaw challenge, where 2,000 participants sent 6,000 email-based prompt injection attempts at a Claude Opus 4.6 assistant. The $1,000 bounty went unclaimed — no one extracted the protected secret.
Codex Remote GA — control desktop Codex from ChatGPT mobileOpenAI · 2026-06-25 · tool
Codex Remote is generally available on all ChatGPT plans. Mobile users start or continue Codex work on a paired Mac or Windows host, with one-to-one QR pairing and a new DigitalOcean Droplet workspace plugin for ad-hoc cloud boxes.
GPT-4.5 retired from ChatGPT — end of the GPT-4 era in the appOpenAI · 2026-06-26 · ecosystem
OpenAI removed GPT-4.5 from ChatGPT on June 26, 2026, including from custom GPTs. Existing GPT-4.5 conversations continue on GPT-5.5. The change applies only to ChatGPT — the OpenAI API still serves every GPT-4 model unchanged.
DSpark + DeepSpec — DeepSeek opens its speculative decoding stackDeepSeek · 2026-06-26 · repo
DeepSeek released DeepSpec, an MIT-licensed codebase to train and evaluate draft models for speculative decoding, plus DSpark speculative-decoding modules attached to its V4-Pro and V4-Flash checkpoints on Hugging Face.
Anthropic Economic Index: Cadences — Claude usage hour by hourAnthropic · 2026-06-26 · resource
Anthropic's June 2026 Economic Index report, called Cadences, samples Claude usage by the hour. Personal chats rise from ~35% on weekdays to ~50% on weekends, sleep questions peak around 5 a.m., recipes around 6 p.m.
Claude Mythos 5 restored — US Commerce lifts block for 100+ trusted partnersAnthropic · 2026-06-26 · ecosystem
Claude Mythos 5 is back for 100+ pre-approved US institutions after the US Commerce Department lifted its two-week export block. Anthropic can now ship Mythos 5 to the Annex A trusted-partner list without a license.
GPT-5.6 — OpenAI previews Sol, Terra, and Luna tiersOpenAI · 2026-06-26 · model
OpenAI announced GPT-5.6 with three named tiers — Sol (flagship), Terra (balanced), and Luna (cheap and fast) — adding new max and ultra reasoning modes. Access starts as a limited preview for trusted partners.
1littlecoder: 'GPT 5.6 — What, Availability, Pricing'1littlecoder · 2026-06-26 · video
1littlecoder walks through OpenAI's same-day GPT-5.6 announcement — the Sol, Terra, and Luna tiers, the new max and ultra reasoning modes, the published per-million-token prices, and who actually gets access during the limited preview.
Ornith 1.0 — open-weight coding models that learn their own RL scaffoldDeepReinforce · 2026-06-25 · model
Ornith 1.0 is an MIT-licensed family of agentic coding LLMs (9B, 31B, 35B MoE, 397B MoE) whose RL loop writes its own task-specific scaffold instead of using a fixed human-designed harness.
Sam Witteveen: 'Introducing Ornith 1.0' — open-weight coding LLM walkthroughSam Witteveen · 2026-06-26 · video
Sam Witteveen walks through Ornith 1.0, DeepReinforce's MIT-licensed coding model family whose RL loop learns its own scaffold — uploaded hours after the 9B–397B weights landed on Hugging Face.
Un-0 — image generator built from coupled oscillators, not a neural netUnconventional AI · 2026-06-25 · algorithm
Un-0 generates images by simulating a population of Kuramoto oscillators whose pairwise coupling strengths are the learned parameters. It hits FID 6.74 on ImageNet 64x64 with MIT weights and training code released.
Lilian Weng: 'Scaling Laws, Carefully' — first new Lil'Log post in 13 monthsLilian Weng · 2026-06-24 · article
Lilian Weng walks through scaling laws end-to-end: why Kaplan and Chinchilla reached opposite conclusions, how parameter counting and fit region change the answer, and what data scarcity means for the curves now.
GitHub Copilot for Jira — GA brings agent streaming inside AtlassianGitHub · 2026-06-25 · tool
GitHub Copilot for Jira is generally available, letting teams watch the coding agent run on a Jira ticket and steer it in Jira chat without switching to GitHub.
Wes Roth: 'OpenAI JUST announced JALAPENO'Wes Roth · 2026-06-25 · video
Wes Roth breaks down OpenAI's new Jalapeño inference chip, built with Broadcom, and what a custom OpenAI accelerator means for the AI hardware race.
OpenKnowledge v0.18.0 — local-first AI markdown wiki with codebase generatorInkeep · 2026-06-25 · repo
OpenKnowledge v0.18.0 ships a codebase-wiki starter pack that scaffolds an agent-written wiki of a repo, plus AI editing with Claude, Codex, and Cursor in a local-first markdown app.
Sam Witteveen: 'Qwen-AgentWorld The World Model for RL Environments'Sam Witteveen · 2026-06-25 · video
Sam Witteveen's June 25, 2026 walkthrough of Qwen-AgentWorld — Alibaba's open-weight world model that simulates seven agent domains — landed hours after the paper trended on Hugging Face.
Anthropic accuses Alibaba Qwen of largest-ever Claude distillation attackAnthropic · 2026-06-24 · security
Anthropic told U.S. senators that operators tied to Alibaba's Qwen lab ran 28.8 million Claude conversations across nearly 25,000 fake accounts between April 22 and June 5, 2026 — its largest known distillation attack on Claude.
Qualcomm to Acquire Modular — $3.9B all-stock deal for Mojo and MAX AI stackQualcomm · 2026-06-24 · ecosystem
Qualcomm will buy Modular for about $3.9 billion in all stock, taking ownership of the Mojo language and the MAX AI platform built by Chris Lattner to run AI models across CPUs, GPUs, and NPUs.
1littlecoder: 'Unlimited OCR in 6 mins!'1littlecoder · 2026-06-24 · video
1littlecoder walks through Baidu's new Unlimited-OCR — the 3B vision model that parses long documents in a single pass — in a 6-minute hands-on, hours after the model trended on Hugging Face.
GitHub Copilot CLI GA — tabbed terminal with MCP, skills, and pluginsGitHub · 2026-06-23 · tool
GitHub Copilot CLI's new terminal UI is generally available, adding Session/Issues/PRs/Gists tabs, in-session /mcp, /skills, /plugin, and /settings commands, and theme-aware accessibility with screen-reader support.
GitHub Copilot app gets BYOK — Anthropic, Ollama, and LM Studio supportedGitHub · 2026-06-23 · tool
The GitHub Copilot desktop app now supports bring-your-own-key for Azure OpenAI, Anthropic, Microsoft Foundry, Foundry Local, LM Studio, Ollama, and any OpenAI-compatible endpoint, with keys stored in the OS keychain.
Mistral Connectors — admin controls, multi-account, and a new debuggerMistral AI · 2026-06-24 · tool
Mistral expands its Connectors stack with per-workspace admin controls (GA), API keys scoped to specific connectors (GA), multi-account auth (GA), an 11-step Connectors Debugger (preview), and connector support inside Vibe Code and Workflows.

Frequently asked questions

What is AI/TLDR?

AI/TLDR is a high-volume tracker of new AI releases — models, open-source repos, developer tools, papers, datasets, benchmarks and security findings — refreshed every 2 hours and explained in plain English.

How often is the feed updated?

An automated agent sweeps every 2 hours and publishes a fresh build to the site. Items are sorted by ingest time so the newest releases always float to the top.

Is AI/TLDR free?

Yes — the site is free to read with no signup. There is an optional newsletter and a Buy-Me-a-Coffee tip jar if you want to support it.

Where does the data come from?

Every item is fetched and verified from a primary source — vendor blog post, GitHub release, arXiv paper, official announcement. Nothing is hallucinated; if a URL or claim cannot be verified, the item is dropped.

How do you decide what's worth covering?

We catch the hype: frontier-lab releases, hyped open-source drops, multi-outlet stories, pricing or capability shifts. Items are tagged seismic, major or notable based on impact.

Can I subscribe to a newsletter?

Yes — there is a daily digest delivered via Buttondown. Subscribe from the homepage banner.

Learn AI from zero

New to LLMs, RAG or agents? Our free Learn AI encyclopedia explains every concept, tool and framework in plain English — 652 articles and counting.

LLM Fundamentals Prompt Engineering Working with LLM APIs Embeddings & Vector Databases Retrieval-Augmented Generation (RAG)AI Agents Agent SDKs & Frameworks AI Coding & Developer Tools Fine-Tuning & Model Customization Local & Open Models Multimodal AI Production & LLMOps Evaluation & Safety Building AI Apps

Compare AI models

Our LLM registry tracks 246 large language models — frontier and open-weight — with verified specs, benchmarks, pricing and APIs, one detail page each.

Anthropic OpenAI Google Meta DeepSeek Alibaba (Qwen)Moonshot AI (Kimi)Z.ai (Zhipu / GLM)xAI (Grok)Mistral AI Cohere MiniMax