You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: .github/workflows/tests.yml
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -37,5 +37,5 @@ jobs:
37
37
run: >
38
38
uv run pytest massgen/tests
39
39
-m "not live_api and not docker and not expensive"
40
-
-k "not test_timeline_snapshot and not test_final_lock_option and not test_web_quickstart_reasoning_sync"
40
+
-k "not test_timeline_snapshot and not test_final_lock_option and not test_web_quickstart_reasoning_sync and not test_subagent_input_bar_snapshot_matches_main_input"
Subagent ensemble pattern with `disable_injection` and `defer_voting_until_all_answered` as defaults. Round evaluator transformation pressure and success contracts. Lighter refinement for subagents. Killed agent handling.
New general-purpose MassGen Skill with 4 modes (general, evaluate, plan, spec) for use from Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
14
17
@@ -23,6 +26,26 @@ Planning improvements with auto-added improvements to task plan and plan review
23
26
24
27
---
25
28
29
+
## [0.1.63] - 2026-03-13
30
+
31
+
### Added
32
+
-**Ensemble Pattern Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
-**Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates that agents must satisfy before the round evaluator allows convergence
35
+
36
+
### Changed
37
+
-**Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
38
+
-**Killed Agent Handling** ([#996](https://github.com/massgen/MassGen/pull/996)): Graceful management of agents that time out or fail mid-round
39
+
-**Verification Replay** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluation consistency across rounds via replayed verification context
40
+
41
+
### Fixed
42
+
-**Timeout Fallback** ([#996](https://github.com/massgen/MassGen/pull/996)): More robust coordination when agents hit timeout boundaries
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -359,7 +359,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
359
359
360
360
## 🔧 Development Workflow
361
361
362
-
> **Important**: Our next version is v0.1.63. If you want to contribute, please contribute to the `dev/v0.1.63` branch (or `main` if dev/v0.1.63 doesn't exist yet).
362
+
> **Important**: Our next version is v0.1.64. If you want to contribute, please contribute to the `dev/v0.1.64` branch (or `main` if dev/v0.1.64 doesn't exist yet).
363
363
364
364
### 1. Create Feature Branch
365
365
@@ -368,7 +368,7 @@ Create a `.env` file in the `massgen` directory as described in [README](README.
368
368
git fetch upstream
369
369
370
370
# Create feature branch from dev/v0.1.60 (or main if dev branch doesn't exist yet)
@@ -155,24 +155,20 @@ This project started with the "threads of thought" and "iterative refinement" id
155
155
156
156
---
157
157
158
-
## 🆕 Latest Features (v0.1.62)
158
+
## 🆕 Latest Features (v0.1.63)
159
159
160
-
**🎉 Released: March 11, 2026**
160
+
**🎉 Released: March 13, 2026**
161
161
162
-
**What's New in v0.1.62:**
163
-
-**🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
164
-
-**👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
165
-
-**⚡ Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
162
+
**What's New in v0.1.63:**
163
+
-**🎯 Ensemble Pattern** - `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration.
164
+
-**🔄 Round Evaluator Improvements** - Transformation pressure and success contracts for deeper quality assessment.
165
+
-**⚡ Lighter Refinement** - Reduced token overhead for subagent workflows with killed agent handling.
166
166
167
-
**Try v0.1.62 Features:**
167
+
**Try v0.1.63 Features:**
168
168
```bash
169
-
# Install the MassGen Skill for your AI agent
170
-
npx skills add massgen/skills --all
171
-
# Then in Claude Code, Cursor, Copilot, etc.:
172
-
# /massgen "Your complex task"
173
-
174
-
# Try the Session Viewer
175
-
uv run massgen viewer --pick
169
+
pip install massgen==0.1.63
170
+
# Try the round evaluator with ensemble defaults
171
+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a polished landing page for an AI product"
176
172
```
177
173
178
174
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1244,25 +1240,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1244
1240
1245
1241
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
1246
1242
1247
-
### Recent Achievements (v0.1.62)
1243
+
### Recent Achievements (v0.1.63)
1244
+
1245
+
**🎉 Released: March 13, 2026**
1248
1246
1249
-
**🎉 Released: March 11, 2026**
1247
+
#### Ensemble Pattern
1248
+
-**Ensemble Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
1249
+
-**Automatic Ensemble**: Defaults apply when spawning subagent orchestrators without explicit override
1250
1250
1251
-
#### MassGen Skill
1252
-
-**General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1253
-
-**Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1254
-
-**Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
1251
+
#### Round Evaluator Improvements
1252
+
-**Transformation Pressure** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluator pushes agents toward meaningful structural changes rather than surface-level edits
1253
+
-**Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates agents must satisfy before convergence
1254
+
-**Verification Replay**: Evaluation consistency across rounds via replayed verification context
1255
1255
1256
-
#### Session Viewer
1257
-
-**Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1258
-
-**Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
1256
+
#### Lighter Refinement & Fixes
1257
+
-**Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
1258
+
-**Killed Agent Handling**: Graceful management of agents that time out or fail mid-round
1259
+
-**Timeout Fallback**: More robust coordination when agents hit timeout boundaries
1259
1260
1260
-
#### Backend & Quickstart
1261
-
-**Backend Improvements** ([#992](https://github.com/massgen/MassGen/pull/992)): Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
1262
-
-**Quickstart Modes**: Headless quickstart (`--quickstart --headless`) for CI/CD, web quickstart (`--web-quickstart`) for browser-based setup
✅ **MassGen Skill & Viewer (v0.1.62)**: General-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
1266
1264
1267
1265
✅ **Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
1268
1266
@@ -1531,13 +1529,12 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1531
1529
1532
1530
We welcome community contributions to achieve these goals.
1533
1531
1534
-
### v0.1.63 Roadmap
1532
+
### v0.1.64 Roadmap
1535
1533
1536
-
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
1534
+
Version 0.1.64 focuses on adding a Gemini CLI backend:
1537
1535
1538
1536
#### Planned Features
1539
1537
-**Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1540
-
-**Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
@@ -154,24 +154,20 @@ This project started with the "threads of thought" and "iterative refinement" id
154
154
155
155
---
156
156
157
-
## 🆕 Latest Features (v0.1.62)
157
+
## 🆕 Latest Features (v0.1.63)
158
158
159
-
**🎉 Released: March 11, 2026**
159
+
**🎉 Released: March 13, 2026**
160
160
161
-
**What's New in v0.1.62:**
162
-
-**🧩 MassGen Skill** - New general-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents.
163
-
-**👁️ Session Viewer** - New `massgen viewer` command for real-time observation of automation sessions with interactive picker and web mode.
164
-
-**⚡ Backend & Quickstart** - Claude Code/Codex/Copilot backend improvements, headless and web quickstart modes.
161
+
**What's New in v0.1.63:**
162
+
-**🎯 Ensemble Pattern** - `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration.
163
+
-**🔄 Round Evaluator Improvements** - Transformation pressure and success contracts for deeper quality assessment.
164
+
-**⚡ Lighter Refinement** - Reduced token overhead for subagent workflows with killed agent handling.
165
165
166
-
**Try v0.1.62 Features:**
166
+
**Try v0.1.63 Features:**
167
167
```bash
168
-
# Install the MassGen Skill for your AI agent
169
-
npx skills add massgen/skills --all
170
-
# Then in Claude Code, Cursor, Copilot, etc.:
171
-
# /massgen "Your complex task"
172
-
173
-
# Try the Session Viewer
174
-
uv run massgen viewer --pick
168
+
pip install massgen==0.1.63
169
+
# Try the round evaluator with ensemble defaults
170
+
uv run massgen --config @examples/features/round_evaluator_example.yaml "Create a polished landing page for an AI product"
175
171
```
176
172
177
173
→ [See full release history and examples](massgen/configs/README.md#release-history--examples)
@@ -1243,25 +1239,27 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1243
1239
1244
1240
⚠️ **Early Stage Notice:** As MassGen is in active development, please expect upcoming breaking architecture changes as we continue to refine and improve the system.
1245
1241
1246
-
### Recent Achievements (v0.1.62)
1242
+
### Recent Achievements (v0.1.63)
1243
+
1244
+
**🎉 Released: March 13, 2026**
1247
1245
1248
-
**🎉 Released: March 11, 2026**
1246
+
#### Ensemble Pattern
1247
+
-**Ensemble Defaults** ([#996](https://github.com/massgen/MassGen/pull/996)): `disable_injection` and `defer_voting_until_all_answered` now default to true for ensemble-style subagent orchestration
1248
+
-**Automatic Ensemble**: Defaults apply when spawning subagent orchestrators without explicit override
1249
1249
1250
-
#### MassGen Skill
1251
-
-**General-Purpose Skill** ([#992](https://github.com/massgen/MassGen/pull/992)): New multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents
1252
-
-**Auto-Sync**: GitHub Actions workflow to auto-sync skill to separate repository for easy installation
1253
-
-**Reference Docs**: Comprehensive workflow guides and prompt templates for each mode
1250
+
#### Round Evaluator Improvements
1251
+
-**Transformation Pressure** ([#996](https://github.com/massgen/MassGen/pull/996)): Evaluator pushes agents toward meaningful structural changes rather than surface-level edits
1252
+
-**Success Contracts** ([#996](https://github.com/massgen/MassGen/pull/996)): Explicit quality gates agents must satisfy before convergence
1253
+
-**Verification Replay**: Evaluation consistency across rounds via replayed verification context
1254
1254
1255
-
#### Session Viewer
1256
-
-**Viewer Command** ([#992](https://github.com/massgen/MassGen/pull/992)): New `massgen viewer` for real-time observation of automation sessions
1257
-
-**Interactive Picker**: `--pick` flag for session selection, `--web` for browser-based viewing
1255
+
#### Lighter Refinement & Fixes
1256
+
-**Lighter Refinement** ([#996](https://github.com/massgen/MassGen/pull/996)): Subagents use lighter refinement prompts to reduce token overhead and latency
1257
+
-**Killed Agent Handling**: Graceful management of agents that time out or fail mid-round
1258
+
-**Timeout Fallback**: More robust coordination when agents hit timeout boundaries
1258
1259
1259
-
#### Backend & Quickstart
1260
-
-**Backend Improvements** ([#992](https://github.com/massgen/MassGen/pull/992)): Claude Code background task execution, Codex native filesystem and MCP support, Copilot runtime model discovery
1261
-
-**Quickstart Modes**: Headless quickstart (`--quickstart --headless`) for CI/CD, web quickstart (`--web-quickstart`) for browser-based setup
✅ **MassGen Skill & Viewer (v0.1.62)**: General-purpose multi-agent skill with 4 modes (general, evaluate, plan, spec) for Claude Code and other AI agents. Session viewer for real-time observation. Backend improvements for Claude Code, Codex, and Copilot. Headless and web quickstart modes.
1265
1263
1266
1264
✅ **Round Evaluator Paradigm (v0.1.61)**: New round evaluator subagent type that automatically spawns evaluator subagents after each new answer to provide detailed feedback as input to the next round. Major orchestrator refactoring with improved evaluation prompts, task plan injection, and subagent fixes.
1267
1265
@@ -1530,13 +1528,12 @@ MassGen is currently in its foundational stage, with a focus on parallel, asynch
1530
1528
1531
1529
We welcome community contributions to achieve these goals.
1532
1530
1533
-
### v0.1.63 Roadmap
1531
+
### v0.1.64 Roadmap
1534
1532
1535
-
Version 0.1.63 focuses on adding a Gemini CLI backend and image/video editing capabilities:
1533
+
Version 0.1.64 focuses on adding a Gemini CLI backend:
1536
1534
1537
1535
#### Planned Features
1538
1536
-**Gemini CLI Backend** ([#952](https://github.com/massgen/MassGen/issues/952)): Gemini CLI as a first-class backend option
1539
-
-**Image/Video Edit Capabilities** ([#959](https://github.com/massgen/MassGen/issues/959)): Check and support image/video editing capabilities across providers
0 commit comments