From 57857e0a4eac8da074314c441f6d6f00f5145ad4 Mon Sep 17 00:00:00 2001
From: w1zdun <19057733+w1zdun@users.noreply.github.com>
Date: Sun, 7 Jun 2026 17:18:18 +0200
Subject: [PATCH] fix(tests): widen waitForever CPU test window to eliminate
 parallel-suite flakiness
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

The regression test for #27 measures process-wide user CPU
(getrusage(RUSAGE_SELF)) over a 200ms window with a 50,000 µs threshold.
swift-testing runs all other suites in parallel in the same process, and
their overhead alone was observed at ~70,000 µs — tripping the threshold
with no busy-loop present.

The bug being guarded against pins a full core (~1 µs CPU per µs of
wall-clock), so the threshold must scale with the window while suite
noise stays roughly constant (the rest of the static suite finishes
within the first ~100ms). Widen the window to 500ms and set the
threshold to half a core (250,000 µs): ~3x above observed noise, ~2x
below a pinned core.

Verified: 3 consecutive full static-suite runs pass; with the original
busy-loop temporarily reintroduced the test fails (528,314 µs observed).
---
 .../WaitForeverCpuTests.swift                 | 22 +++++++++++--------
 1 file changed, 13 insertions(+), 9 deletions(-)
diff --git a/Tests/Container-Compose-StaticTests/WaitForeverCpuTests.swift b/Tests/Container-Compose-StaticTests/WaitForeverCpuTests.swift
index ce99641e..bba9ef3f 100644
--- a/Tests/Container-Compose-StaticTests/WaitForeverCpuTests.swift
+++ b/Tests/Container-Compose-StaticTests/WaitForeverCpuTests.swift
@@ -32,18 +32,22 @@ struct WaitForeverCpuTests {
     /// Regression for #27: `waitForever()` must suspend, not busy-loop.
     ///
     /// Method: take a `getrusage(RUSAGE_SELF)` snapshot, spawn a child Task
-    /// that calls `waitForever()`, sleep for 200ms wall-clock, take a second
+    /// that calls `waitForever()`, sleep for 500ms wall-clock, take a second
     /// snapshot, and compare user-CPU consumed.
     ///
     /// On the bug (`for await _ in AsyncStream<Void>(unfolding: {})`), the
-    /// child task pins one core, so user CPU consumed during the 200ms window
-    /// is around 200,000 µs (one full core). With the fix
+    /// child task pins one core, so user CPU consumed during the 500ms window
+    /// is around 500,000 µs (one full core). With the fix
     /// (`withUnsafeContinuation { _ in }`), the child task suspends and
     /// consumes essentially nothing.
     ///
-    /// Threshold of 50,000 µs gives ~4× headroom over a noisy CI baseline
-    /// (test-runner overhead, parallel tasks) while still reliably catching
-    /// a single core's worth of busy-loop work.
+    /// The measurement is process-wide, and swift-testing runs other suites
+    /// in parallel in the same process, so the baseline includes their CPU
+    /// (observed up to ~70,000 µs for the full static suite). That noise is
+    /// front-loaded — the rest of the suite finishes within the first
+    /// ~100ms — while a busy-loop scales with the window. The threshold of
+    /// half a core (250,000 µs over 500ms) therefore sits ~3× above suite
+    /// noise and ~2× below a pinned core.
     ///
     /// Side effect: this test leaks one suspended task per invocation
     /// (`waitForever` is `-> Never` and the suspended task can't be cancelled
@@ -61,12 +65,12 @@ struct WaitForeverCpuTests {
             await composeUp.waitForever()
         }
 
-        try await Task.sleep(nanoseconds: 200_000_000)  // 200ms
+        try await Task.sleep(nanoseconds: 500_000_000)  // 500ms
 
         let after = userCpuMicroseconds()
         let consumed = after - before
 
-        #expect(consumed < 50_000,
-                "waitForever consumed \(consumed) µs of user CPU in 200ms — likely busy-looping (regression for #27)")
+        #expect(consumed < 250_000,
+                "waitForever consumed \(consumed) µs of user CPU in 500ms — likely busy-looping (regression for #27)")
     }
 }