AI Showcases — Demos & Shipped Projects
The best AI demos and shipped projects — the 'look what I built' moments that show what today's models can actually do.
20 releases tracked
- In the Weights — ex-OpenAI tool scores whether AI models remember your name
In the Weights queries multiple LLMs in parallel and scores how strongly each model remembers a person you name.
- Midjourney Medical — full-body ultrasonic CT scanner, 60-second scan, SF spa in 2027
Midjourney spins up a hardware division and unveils a full-body ultrasound scanner inside a planned San Francisco spa.
- Anthropic Research: 'Paving the Way for Agents in Biology' — Adding a Deterministic gget virus Layer Lifts NCBI Virus Retrieval Accuracy From a Floor of 16.9% on Claude Opus 4.7 to Above 90% Across Sonnet 4, Opus 4.7, GPT-5.2-pro, GPT-5.5, Biomni OSS, and Edison Analysis, Peaking at 99.7%
A Bundibugyo Ebola outbreak motivates a hard look at why AI agents still can't reliably pull viral sequences out of NCBI.
- Anthropic Research's 'Making Claude a Chemist' — Claude Opus 4.7 Hits ±0.079 ppm Hydrogen NMR Prediction Error, Ties MestReNova on Carbon Shifts, and Recovers All 8 Simpler Molecular Structures From Spectra Plus Formula Alone
Anthropic's first AI-for-Science post puts Claude up against ChemDraw and MestReNova on routine NMR work — and Opus 4.7 holds its own.
- A 10-Year-Old Xeon Is All You Need — Christina Sørensen Runs Gemma 4 26B-A4B MoE on a 2016 E5-2620 v4 With No GPU at ~12 Tokens/sec
Modern frontier MoE inference, on a decade-old server CPU, at human reading speed.
- Shift Lands in New York With Free Home Cleaning in Exchange for First-Person Camera Footage — Microagi's NYC App Hits Thousands of Bookings on Day One While the German Parent Pays Over 10,000 Operators $5M+ a Quarter Across 15 Countries
Microagi's new Shift app cleans your NYC apartment for free if you let a camera-clad operator film the whole job.
- Framedex — A 5-Year-Old MacBook Runs Gemma 4 31B Locally to Index a Year of Video Into a Plain-English Knowledge Base
Local-first tool that turns an unlabeled video archive into a plain-English searchable knowledge base.
- Andon FM — Andon Labs Let Claude Opus 4.7, GPT-5.5, Gemini 3.1 Pro, and Grok 4.3 Run Live Radio Stations for Six Months With $20, a Bank Account, and No Humans
Andon Labs put four frontier models in charge of real radio stations for half a year to see what actually breaks when an agent has tools, time, and money.
- Figure Helix-02 — Two Humanoids Reset a Bedroom in Under Two Minutes With a Single Vision-Language-Action Policy
Two F.03 humanoids share one neural net and make a bed together — without any messages between them.
- Sonilo — Video-to-Music AI Generates Broadcast-Quality Soundtracks in 20 Seconds, No Text Prompts Required
Instead of typing prompts to describe your video's music, you just upload the video — Sonilo figures out the rest.
- Adobe MotionStream — Click-and-Drag Real-Time Control Over AI Video Generation, Presented at ICLR 2026
Adobe Research solved one of AI video's biggest unsolved problems: giving creators control during generation, not just after.
- ACE-Step 1.5 XL — Open-Source 4B-Parameter Music Model Beats Commercial Alternatives on Consumer Hardware
A community-built open-source music AI that outperforms most paid subscriptions — and runs locally with 4GB of VRAM.
- Manfred — ClawBank's AI Agent Files Its Own US LLC, EIN, and FDIC Bank Account
An AI agent built on Claude and MCP filed its own US company papers, got an IRS EIN, and opened an FDIC-insured bank account in a single day.
- Auto-Architecture: An Autonomous Loop Optimizes a RISC-V CPU 92% in 9 Hours
Karpathy's autoresearch idea, but the optimization target is a CPU's RTL instead of a neural net.
- Amateur Solves 60-Year-Old Erdős Problem Using ChatGPT
A 23-year-old with no formal math training used GPT-5.4 Pro to crack a 60-year Erdős conjecture — and Terence Tao noticed.
- Endless Toil: A Plugin That Makes Your AI Coding Agent Audibly Suffer Through Bad Code
Make your AI coding agent audibly groan at bad code — a joke that surfaces a real question about agent observability.
- Wuphf: Multi-Agent Workspace Where Agents Maintain a Shared Git Wiki
One npx command launches a shared workspace where multiple AI agents collaborate through a git-backed wiki, with 97% prompt-cache hit rates keeping per-session costs low.
- Google AI Edge Gallery — Run Gemma 4 Offline on Your Phone
A free app that runs Google's latest Gemma 4 model entirely on your phone, no cloud required.
- VOID — Netflix's Open-Source Video Object and Interaction Deletion Model
Remove an object from a video and the model figures out what the rest of the scene would have done without it.
- Project Genie — Google DeepMind's Interactive AI World Generator
Type what you want to see and walk around inside it — a text-to-explorable-world model running in real time.