AI/TLDR

Sam Witteveen · 2026-06-26 · notable

Sam Witteveen: 'Introducing Ornith 1.0' — open-weight coding LLM walkthrough

Sam Witteveen walks through Ornith 1.0, DeepReinforce's MIT-licensed coding model family whose RL loop learns its own scaffold — uploaded hours after the 9B–397B weights landed on Hugging Face.

Sam Witteveen YouTube thumbnail for the Ornith 1.0 walkthrough

A first hands-on look at Ornith 1.0 — DeepReinforce's open-weight coding LLM family that trains its own RL scaffold.

What is it?

Sam Witteveen breaks down Ornith 1.0, the MIT-licensed agentic coding model family DeepReinforce released on June 25, 2026. The lineup spans a 9B dense, a 31B dense, a 35B mixture-of-experts, and a 397B MoE flagship, all built on Gemma 4 and Qwen 3.5 pretrained checkpoints.

How does it work?

The walkthrough explains the self-scaffolding RL loop at the core of Ornith 1.0: the model first proposes a task-specific scaffold, then uses that scaffold to generate a solution rollout, with both targets co-evolving during training. Sam Witteveen is one of the eight AI YouTubers AI/TLDR tracks for fast, hands-on coverage of open frontier releases.

Why does it matter?

Coding-model news tends to land as a paper plus weights and little else. Ornith 1.0 is open-source and hits 82.4% on SWE-Bench Verified, but most engineers will first meet it through Sam Witteveen's video — a same-day demo of the variants, the RL recipe, and what the scaffold-co-evolution buys over a fixed harness.

Try it

https://www.youtube.com/watch?v=uD4-uy0GmHE

Sources · 3 outlets

Tags

  • video
  • sam-witteveen
  • ornith
  • deepreinforce
  • open-weights
  • agentic-coding
  • reinforcement-learning
  • self-scaffolding

← All releases · Learn AI