Skip to content

Commit 6291b25

Browse files
authored
Merge pull request #995 from massgen/dev/v0.1.63
feat: v0.1.63
2 parents 37b6f84 + 560dd14 commit 6291b25

59 files changed

Lines changed: 2361 additions & 498 deletions

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

.github/workflows/tests.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,5 +37,5 @@ jobs:
3737
run: >
3838
uv run pytest massgen/tests
3939
-m "not live_api and not docker and not expensive"
40-
-k "not test_timeline_snapshot and not test_final_lock_option and not test_web_quickstart_reasoning_sync"
40+
-k "not test_timeline_snapshot and not test_final_lock_option and not test_web_quickstart_reasoning_sync and not test_subagent_input_bar_snapshot_matches_main_input"
4141
-q --tb=no

CHANGELOG.md

Lines changed: 23 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
99

1010
## Recent Releases
1111

12+
**v0.1.63 (March 13, 2026)** - Ensemble & Contracts
13+
Subagent ensemble pattern with `disable_injection` and `defer_voting_until_all_answered` as defaults. Round evaluator transformation pressure and success contracts. Lighter refinement for subagents. Killed agent handling.
14+
1215
**v0.1.62 (March 11, 2026)** - MassGen Skill & Viewer
1316
New general-purpose MassGen Skill with 4 modes (general, evaluate, plan, spec) for use from Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
1417

@@ -23,6 +26,26 @@ Planning improvements with auto-added improvements to task plan and plan review
2326

2427
---
2528

29+
## [0.1.63] - 2026-03-13
30+
31+
### Added
32+
- **Ensemble Pattern Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
33+
- **Transformation Pressure** ([#996](https://github.com/massgen/MassGen/pull/996)): Round evaluator applies transformation pressure to push agents toward meaningful structural changes
34+
- **Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates that agents must satisfy before the round evaluator allows convergence
35+
36+
### Changed
37+
- **Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
38+
- **Killed Agent Handling** ([#996](https://github.com/massgen/MassGen/pull/996)): Graceful management of agents that time out or fail mid-round
39+
- **Verification Replay** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluation consistency across rounds via replayed verification context
40+
41+
### Fixed
42+
- **Timeout Fallback** ([#996](https://github.com/massgen/MassGen/pull/996)): More robust coordination when agents hit timeout boundaries
43+
44+
### Technical Details
45+
- **Major Focus**: Ensemble & Contracts — ensemble pattern defaults, transformation pressure, success contracts, lighter refinement
46+
- **PRs Merged**: [#996](https://github.com/massgen/MassGen/pull/996) (dev/v0.1.62-p1)
47+
- **Contributors**: @ncrispino, @HenryQi and the MassGen team
48+
2649
## [0.1.62] - 2026-03-11
2750

2851
### Added

CONTRIBUTING.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -359,7 +359,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
359359

360360
## 🔧 Development Workflow
361361

362-
> **Important**: Our next version is v0.1.63. If you want to contribute, please contribute to the `dev/v0.1.63` branch (or `main` if dev/v0.1.63 doesn't exist yet).
362+
> **Important**: Our next version is v0.1.64. If you want to contribute, please contribute to the `dev/v0.1.64` branch (or `main` if dev/v0.1.64 doesn't exist yet).
363363
364364
### 1. Create Feature Branch
365365

@@ -368,7 +368,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
368368
git fetch upstream
369369

370370
# Create feature branch from dev/v0.1.60 (or main if dev branch doesn't exist yet)
371-
git checkout -b feature/your-feature-name upstream/dev/v0.1.63
371+
git checkout -b feature/your-feature-name upstream/dev/v0.1.64
372372
```
373373

374374
### 2. Make Your Changes
@@ -507,7 +507,7 @@ git push origin feature/your-feature-name
507507
```
508508

509509
Then create a pull request on GitHub:
510-
- Base branch: `dev/v0.1.63` (or `main` if dev branch doesn't exist yet)
510+
- Base branch: `dev/v0.1.64` (or `main` if dev branch doesn't exist yet)
511511
- Compare branch: `feature/your-feature-name`
512512
- Add clear description of changes
513513
- Link any related issues
@@ -617,7 +617,7 @@ Have a significant feature idea not covered by existing tracks?
617617
- [ ] Tests pass locally
618618
- [ ] Documentation is updated if needed
619619
- [ ] Commit messages follow convention
620-
- [ ] PR targets `dev/v0.1.63` branch (or `main` if dev branch doesn't exist yet)
620+
- [ ] PR targets `dev/v0.1.64` branch (or `main` if dev branch doesn't exist yet)
621621

622622
### PR Description Should Include
623623

README.md

Lines changed: 32 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ This project started with the "threads of thought" and "iterative refinement" id
6969
<details open>
7070
<summary><h3>🆕 Latest Features</h3></summary>
7171

72-
- [v0.1.62 Features](#-latest-features-v0162)
72+
- [v0.1.63 Features](#-latest-features-v0163)
7373
</details>
7474

7575
<details open>
@@ -122,15 +122,15 @@ This project started with the "threads of thought" and "iterative refinement" id
122122
<details open>
123123
<summary><h3>🗺️ Roadmap</h3></summary>
124124

125-
- [Recent Achievements (v0.1.62)](#recent-achievements-v0162)
126-
- [Previous Achievements (v0.0.3 - v0.1.61)](#previous-achievements-v003---v0161)
125+
- [Recent Achievements (v0.1.63)](#recent-achievements-v0163)
126+
- [Previous Achievements (v0.0.3 - v0.1.62)](#previous-achievements-v003---v0162)
127127
- [Key Future Enhancements](#key-future-enhancements)
128128
- Bug Fixes & Backend Improvements
129129
- Advanced Agent Collaboration
130130
- Expanded Model, Tool & Agent Integrations
131131
- Improved Performance & Scalability
132132
- Enhanced Developer Experience
133-
- [v0.1.63 Roadmap](#v0163-roadmap)
133+
- [v0.1.64 Roadmap](#v0164-roadmap)
134134
</details>
135135

136136
<details open>
@@ -155,24 +155,20 @@ This project started with the "threads of thought" and "iterative refinement" id
155155

156156
---
157157

158-
## 🆕 Latest Features (v0.1.62)
158+
## 🆕 Latest Features (v0.1.63)
159159

160-
**🎉 Released: March 11, 2026**
160+
**🎉 Released: March 13, 2026**
161161

162-
**What's New in v0.1.62:**
163-
- **🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
164-
- **👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
165-
- **Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
162+
**What's New in v0.1.63:**
163+
- **🎯 Ensemble Pattern** - `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration.
164+
- **🔄 Round Evaluator Improvements** - Transformation pressure and success contracts for deeper quality assessment.
165+
- **Lighter Refinement** - Reduced token overhead for subagent workflows with killed agent handling.
166166

167-
**Try v0.1.62 Features:**
167+
**Try v0.1.63 Features:**
168168
```bash
169-
# Install the MassGen Skill for your AI agent
170-
npx skills add massgen/skills --all
171-
# Then in Claude Code, Cursor, Copilot, etc.:
172-
# /massgen "Your complex task"
173-
174-
# Try the Session Viewer
175-
uv run massgen viewer --pick
169+
pip install massgen==0.1.63
170+
# Try the round evaluator with ensemble defaults
171+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a polished landing page for an AI product"
176172
```
177173

178174
[See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1244,25 +1240,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
12441240

12451241
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
12461242

1247-
### Recent Achievements (v0.1.62)
1243+
### Recent Achievements (v0.1.63)
1244+
1245+
**🎉 Released: March 13, 2026**
12481246

1249-
**🎉 Released: March 11, 2026**
1247+
#### Ensemble Pattern
1248+
- **Ensemble Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
1249+
- **Automatic Ensemble**: Defaults apply when spawning subagent orchestrators without explicit override
12501250

1251-
#### MassGen Skill
1252-
- **General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1253-
- **Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1254-
- **Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
1251+
#### Round Evaluator Improvements
1252+
- **Transformation Pressure** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluator pushes agents toward meaningful structural changes rather than surface-level edits
1253+
- **Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates agents must satisfy before convergence
1254+
- **Verification Replay**: Evaluation consistency across rounds via replayed verification context
12551255

1256-
#### Session Viewer
1257-
- **Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1258-
- **Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
1256+
#### Lighter Refinement & Fixes
1257+
- **Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
1258+
- **Killed Agent Handling**: Graceful management of agents that time out or fail mid-round
1259+
- **Timeout Fallback**: More robust coordination when agents hit timeout boundaries
12591260

1260-
#### Backend & Quickstart
1261-
- **Backend Improvements** ([#992](https://github.com/massgen/MassGen/pull/992)): Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
1262-
- **Quickstart Modes**: Headless quickstart (`--quickstart --headless`) for CI/CD, web quickstart (`--web-quickstart`) for browser-based setup
1263-
- **Evaluation & Planning**: Better planning prompts with thoroughness support, removed should/could criteria
1261+
### Previous Achievements (v0.0.3 - v0.1.62)
12641262

1265-
### Previous Achievements (v0.0.3 - v0.1.61)
1263+
**MassGen Skill & Viewer (v0.1.62)**: General-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
12661264

12671265
**Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
12681266

@@ -1531,13 +1529,12 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
15311529

15321530
We welcome community contributions to achieve these goals.
15331531

1534-
### v0.1.63 Roadmap
1532+
### v0.1.64 Roadmap
15351533

1536-
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
1534+
Version 0.1.64 focuses on adding a Gemini CLI backend:
15371535

15381536
#### Planned Features
15391537
- **Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1540-
- **Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
15411538

15421539
---
15431540

README_PYPI.md

Lines changed: 32 additions & 35 deletions
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@ This project started with the "threads of thought" and "iterative refinement" id
6868
<details open>
6969
<summary><h3>🆕 Latest Features</h3></summary>
7070

71-
- [v0.1.62 Features](#-latest-features-v0162)
71+
- [v0.1.63 Features](#-latest-features-v0163)
7272
</details>
7373

7474
<details open>
@@ -121,15 +121,15 @@ This project started with the "threads of thought" and "iterative refinement" id
121121
<details open>
122122
<summary><h3>🗺️ Roadmap</h3></summary>
123123

124-
- [Recent Achievements (v0.1.62)](#recent-achievements-v0162)
125-
- [Previous Achievements (v0.0.3 - v0.1.61)](#previous-achievements-v003---v0161)
124+
- [Recent Achievements (v0.1.63)](#recent-achievements-v0163)
125+
- [Previous Achievements (v0.0.3 - v0.1.62)](#previous-achievements-v003---v0162)
126126
- [Key Future Enhancements](#key-future-enhancements)
127127
- Bug Fixes & Backend Improvements
128128
- Advanced Agent Collaboration
129129
- Expanded Model, Tool & Agent Integrations
130130
- Improved Performance & Scalability
131131
- Enhanced Developer Experience
132-
- [v0.1.63 Roadmap](#v0163-roadmap)
132+
- [v0.1.64 Roadmap](#v0164-roadmap)
133133
</details>
134134

135135
<details open>
@@ -154,24 +154,20 @@ This project started with the "threads of thought" and "iterative refinement" id
154154

155155
---
156156

157-
## 🆕 Latest Features (v0.1.62)
157+
## 🆕 Latest Features (v0.1.63)
158158

159-
**🎉 Released: March 11, 2026**
159+
**🎉 Released: March 13, 2026**
160160

161-
**What's New in v0.1.62:**
162-
- **🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
163-
- **👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
164-
- **Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
161+
**What's New in v0.1.63:**
162+
- **🎯 Ensemble Pattern** - `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration.
163+
- **🔄 Round Evaluator Improvements** - Transformation pressure and success contracts for deeper quality assessment.
164+
- **Lighter Refinement** - Reduced token overhead for subagent workflows with killed agent handling.
165165

166-
**Try v0.1.62 Features:**
166+
**Try v0.1.63 Features:**
167167
```bash
168-
# Install the MassGen Skill for your AI agent
169-
npx skills add massgen/skills --all
170-
# Then in Claude Code, Cursor, Copilot, etc.:
171-
# /massgen "Your complex task"
172-
173-
# Try the Session Viewer
174-
uv run massgen viewer --pick
168+
pip install massgen==0.1.63
169+
# Try the round evaluator with ensemble defaults
170+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a polished landing page for an AI product"
175171
```
176172

177173
[See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1243,25 +1239,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
12431239

12441240
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
12451241

1246-
### Recent Achievements (v0.1.62)
1242+
### Recent Achievements (v0.1.63)
1243+
1244+
**🎉 Released: March 13, 2026**
12471245

1248-
**🎉 Released: March 11, 2026**
1246+
#### Ensemble Pattern
1247+
- **Ensemble Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
1248+
- **Automatic Ensemble**: Defaults apply when spawning subagent orchestrators without explicit override
12491249

1250-
#### MassGen Skill
1251-
- **General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1252-
- **Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1253-
- **Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
1250+
#### Round Evaluator Improvements
1251+
- **Transformation Pressure** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluator pushes agents toward meaningful structural changes rather than surface-level edits
1252+
- **Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates agents must satisfy before convergence
1253+
- **Verification Replay**: Evaluation consistency across rounds via replayed verification context
12541254

1255-
#### Session Viewer
1256-
- **Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1257-
- **Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
1255+
#### Lighter Refinement & Fixes
1256+
- **Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
1257+
- **Killed Agent Handling**: Graceful management of agents that time out or fail mid-round
1258+
- **Timeout Fallback**: More robust coordination when agents hit timeout boundaries
12581259

1259-
#### Backend & Quickstart
1260-
- **Backend Improvements** ([#992](https://github.com/massgen/MassGen/pull/992)): Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
1261-
- **Quickstart Modes**: Headless quickstart (`--quickstart --headless`) for CI/CD, web quickstart (`--web-quickstart`) for browser-based setup
1262-
- **Evaluation & Planning**: Better planning prompts with thoroughness support, removed should/could criteria
1260+
### Previous Achievements (v0.0.3 - v0.1.62)
12631261

1264-
### Previous Achievements (v0.0.3 - v0.1.61)
1262+
**MassGen Skill & Viewer (v0.1.62)**: General-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
12651263

12661264
**Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
12671265

@@ -1530,13 +1528,12 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
15301528

15311529
We welcome community contributions to achieve these goals.
15321530

1533-
### v0.1.63 Roadmap
1531+
### v0.1.64 Roadmap
15341532

1535-
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
1533+
Version 0.1.64 focuses on adding a Gemini CLI backend:
15361534

15371535
#### Planned Features
15381536
- **Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1539-
- **Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
15401537

15411538
---
15421539

0 commit comments

Comments
 (0)