DeNovoSWE Scales Long-Horizon Environments for Repository Generation

By Harsh Desai10 June 2026

TL;DR

DeNovoSWE scales long-horizon environments for training LLM agents to generate complete software repositories from high-level specifications.

What changed

DeNovoSWE introduces scaled long-horizon environments for training LLM-based code agents to generate entire repositories from high-level specifications. This expands agent roles beyond localized edits in existing codebases. Developers and Vibe Builders gain access to setups focused on complete software creation.

Why it matters

Developers can train agents for full repository tasks in use cases such as end-to-end project builds where prior methods stayed limited to short sequences. Basic Users see potential for agents that handle complete implementations similar to how tools like standard code agents manage incremental changes.

What to watch for

Compare results against existing short-horizon code agents when testing repository outputs. Verify by checking the paper environments on the source platform for scaling details and completion metrics.

Who this matters for

Vibe Builders: Use DeNovoSWE to move from single-file prompts to generating entire functional project structures.
Developers: Integrate long-horizon training environments to improve agent performance on complex repo-wide tasks.

Harsh’s take

DeNovoSWE signals a shift from the 'copilot' era of small patches to the 'architect' era of full repository generation. Most current agents fail because they lose context over long horizons: this framework provides the necessary environment to train for persistence. It is a direct response to the limitations of SWE-bench, which focuses on fixes rather than creation.

Operators should stop thinking about AI as a snippet generator. If you are building internal tools, you need to move toward these long-horizon benchmarks to ensure your agents can handle dependency management and multi-file logic. The ability to generate a clean, working repo from a single spec is the new baseline for high-end dev productivity.

by Harsh Desai

Source:huggingface.co

More AI news

Feature10 June 2026
Package Firewall: Blocking 8,000+ malicious packages daily
Replit launched Package Firewall in partnership with Socket. The feature blocks over 8,000 malicious packages daily at installation time.