Fix unreliable capture of standard/error outputs by rudolf-adamkovic · Pull Request #184 · nuprl/MultiPL-E

rudolf-adamkovic · 2026-03-17T20:06:54Z

Fix potentially incomplete output capture when running unit tests, as seen with e.g. GNU Guile. Changes:

Drain standard output and error output pipes
Fix triply hard-coded stdout/stderr truncation
Increase stdout/stderr truncation from 2kB to 16kB

Fix potentially incomplete output capture when running unit tests, as seen with e.g. GNU Guile. Changes: - Drain standard output and error output pipes - Fix triply hard-coded stdout/stderr truncation - Increase stdout/stderr truncation from 2kB to 16kB

arjunguha

The draining fix is right and we have that in other places.

Capturing the entire output can cause a lot of problems, especially when a program produces unbounded output.

rudolf-adamkovic · 2026-04-16T13:36:19Z

Capturing the entire output can cause a lot of problems, especially when a program produces unbounded output.

Yes, a limit is necessary. I found 4kB insufficient and anything above 16kB problematic. In my measurements, across 15 languages (3 million completions generated), the 4kB limit caused minimal changes in pass@1 (Instruct & Base), but it made the output files less useful for deeper analyses. More specifically (Instruct, t = 0.2), the 4kB limit reclassified 759 (2.32%) out of 32,654 failing completions (χ2 = 47, p < 0.001, ϕ = 0.04, n = 32,654), 93.41% becoming identifier errors that truncation had obscured. Scheme (GNU Guile) was affected the most, as its errors include stack traces.

rudolf-adamkovic marked this pull request as ready for review March 26, 2026 14:16

arjunguha reviewed Apr 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unreliable capture of standard/error outputs#184

Fix unreliable capture of standard/error outputs#184
rudolf-adamkovic wants to merge 1 commit intonuprl:mainfrom
rudolf-adamkovic:fix-output-capture

rudolf-adamkovic commented Mar 17, 2026

Uh oh!

arjunguha left a comment

Uh oh!

rudolf-adamkovic commented Apr 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rudolf-adamkovic commented Mar 17, 2026

Uh oh!

arjunguha left a comment

Choose a reason for hiding this comment

Uh oh!

rudolf-adamkovic commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rudolf-adamkovic commented Apr 16, 2026 •

edited

Loading