Robustness Testing Reproducing #21206

pontaberglund · 2026-01-27T10:54:25Z

pontaberglund
Jan 27, 2026

Hello! I need some help with the robustness testing. For example, with issue #17529 I am trying to reproduce it using the command from the README.md. I might not understand everything correctly, but should the issue always be reproducible, or is it flaky? Currently, I get "Failed to reproduce" at the end, even though the test passes 100 times. Is it possible to reproduce it consistently?

TLDR: Can I somehow reproduce the issues documented in README.md consistently?

Answered by serathius

Jan 27, 2026

It's very hard to provide 100% reproducibility of issues, some have 10% reproducibility while other <1%. That's why we are running 100 tries to reproduce. The overall goal is to keep it as high as possible to ensure we are not losing previous functionality while developing robustness. However reality is it's very hard to maintain a soft guarantee across the time, as it's pretty hard to automate and maintain.

My latest attempt to improve this was documenting the process. As part of that I tracked and documented which was the last commit I was able to reproduce the particular bug. I did that running the reproduction script and if it didn't work I bisected the commit history to find last com…

View full answer

serathius · 2026-01-27T11:27:17Z

serathius
Jan 27, 2026
Maintainer

It's very hard to provide 100% reproducibility of issues, some have 10% reproducibility while other <1%. That's why we are running 100 tries to reproduce. The overall goal is to keep it as high as possible to ensure we are not losing previous functionality while developing robustness. However reality is it's very hard to maintain a soft guarantee across the time, as it's pretty hard to automate and maintain.

My latest attempt to improve this was documenting the process. As part of that I tracked and documented which was the last commit I was able to reproduce the particular bug. I did that running the reproduction script and if it didn't work I bisected the commit history to find last commit. My goal is that we could use track down which change has broken the reproduction, and fix all reproductions over time.

For #17529 specifically I was not able to reproduce it on main branch for some time. Most recent reproduction was on c272ade from May 30, 2025.

1 reply

pontaberglund Jan 27, 2026
Author

Okay, thank you for the quick reply!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robustness Testing Reproducing #21206

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Robustness Testing Reproducing #21206

Uh oh!

pontaberglund Jan 27, 2026

Replies: 1 comment · 1 reply

Uh oh!

serathius Jan 27, 2026 Maintainer

Uh oh!

pontaberglund Jan 27, 2026 Author

pontaberglund
Jan 27, 2026

Replies: 1 comment 1 reply

serathius
Jan 27, 2026
Maintainer

pontaberglund Jan 27, 2026
Author