AllTopicsTodayAllTopicsToday
Notification
Font ResizerAa
  • Home
  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies
Reading: DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds
Share
Font ResizerAa
AllTopicsTodayAllTopicsToday
  • Home
  • Blog
  • About Us
  • Contact
Search
  • Home
  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies
Have an existing account? Sign In
Follow US
©AllTopicsToday 2026. All Rights Reserved.
AllTopicsToday > Blog > AI > DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds
Blog1913 23.png
AI

DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

AllTopicsToday
Last updated: June 25, 2026 10:48 pm
AllTopicsToday
Published: June 25, 2026
Share
SHARE




DeepReinforce has launched Ornith-1.0, an open supply mannequin household constructed for agent coding. Out there in 4 sizes, from the 9B high-density mannequin to the 397B knowledgeable blended flagship. All Checkpoints are shipped below Hugging Face’s MIT License. The mannequin is post-trained primarily based on pre-trained Gemma 4 and Qwen 3.5.

Most coding brokers mix fashions with human-designed fixation harnesses. Ornith-1.0 as a substitute learns its personal description. The DeepReinforce analysis group studies state-of-the-art outcomes on open fashions of comparable measurement.

TL;DR

Ornith-1.0 ships below MIT in 9B, 31B, 35B-MoE, and 397B-MoE sizes and is constructed on Gemma 4 and Qwen 3.5. The mannequin learns its personal scaffolding throughout RL and collectively optimizes the harness and resolution. Ornith-1.0-397B outperforms Claude Opus 4.7 in each headline benchmarks, however falls wanting Opus 4.8 and the bigger GLM-5.2-744B. Three layers (fastened belief boundaries, deterministic displays, and frozen LLM judges) forestall rewards from being hacked.

What’s Ornis-1.0?

Ornith-1.0 is a set of inference fashions tailor-made for coding brokers. The variants are 9B Dense, 31B Dense, 35B MoE, and 397B MoE. The 35B mannequin is a mixture of consultants and prompts roughly 3B parameters per token. FP8 and GGUF builds are additionally printed to hurry up native supply.

Every mannequin is an inference mannequin. Replies begin with a block earlier than the ultimate reply. The supplied recipe permits the reasoning parser, so the hint is returned in a separate reasoning_content area. This mannequin additionally points well-formed instrument requires agent loops.

Set up is simple. The 9B mannequin is about 19GB in bf16 and runs on a single 80GB GPU. The supplied recipes goal vLLM, SGLang, and Transformers. Every mannequin exposes an OpenAI-compatible endpoint. Subsequently, the usual agent framework works with none code adjustments.

interactive explainer

Meta AI Releases SAM Audio: A State-of-the-Art Unified Model that Uses Intuitive and Multimodal Prompts for Audio Separation
Security Concerns With AI Trading Bots (And How to Stay Safe)
CBS blocks James Talarico interview by Stephen Colbert
South Korea’s third-quarter GDP grows at fastest pace in over a year
Apple iPhone 17 goes on sale as questions remain over China market, AI strategy
TAGGED:CodingDeepReinforcefamilylearnsmodelopensourceOrnith1.0ReleasesScaffolds
Share This Article
Facebook Email Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Follow US

Find US on Social Medias
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!
Popular News
Cloud 1200x675.jpg
Gaming

The Opening Still Hits So Hard

AllTopicsToday
AllTopicsToday
September 2, 2025
Building a ‘Human-in-the-Loop’ Approval Gate for Autonomous Agents
Deep Agents Tutorial: LangGraph for Smarter AI
10 Best Foods to Reduce Insulin Resistance Naturally
GFN Thursday: Flight Controls on GeForce NOW
- Advertisement -
Ad space (1)

Categories

  • Tech
  • Investing & Finance
  • AI
  • Entertainment
  • Wellness
  • Gaming
  • Movies

About US

We believe in the power of information to empower decisions, fuel curiosity, and spark innovation.
Quick Links
  • Home
  • Blog
  • About Us
  • Contact
Important Links
  • About Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
  • Contact

Subscribe US

Subscribe to our newsletter to get our newest articles instantly!

©AllTopicsToday 2026. All Rights Reserved.
1 2
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?