[22:02:09Z] === predator-a3b-bench run_id=5fb2913d-6500-4ecf-9e97-d43f7dd61145 === [22:02:09Z] Output: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/5fb2913d-6500-4ecf-9e97-d43f7dd61145.jsonl [22:02:09Z] Creating run-mirror dir on Predator: D:\WeeyugaBench\runs\5fb2913d-6500-4ecf-9e97-d43f7dd61145 [22:02:17Z] Polling for Qwen3-30B-A3B-UD-IQ2_M.gguf on Predator (download in progress)... [22:02:20Z] download progress: 7.23 GB (poll 1/60) [22:02:53Z] download progress: 7.39 GB (poll 2/60) [22:03:26Z] download progress: 7.54 GB (poll 3/60) [22:03:58Z] download progress: 7.69 GB (poll 4/60) [22:04:31Z] download progress: 7.86 GB (poll 5/60) [22:05:04Z] download progress: 8 GB (poll 6/60) [22:05:38Z] download progress: 8.09 GB (poll 7/60) [22:06:11Z] download progress: 8.23 GB (poll 8/60) [22:06:46Z] download progress: 8.39 GB (poll 9/60) [22:07:18Z] download progress: 8.55 GB (poll 10/60) [22:07:51Z] download progress: 8.69 GB (poll 11/60) [22:08:23Z] download progress: 8.88 GB (poll 12/60) [22:08:55Z] download progress: 9.04 GB (poll 13/60) [22:09:28Z] download progress: 9.13 GB (poll 14/60) [22:10:01Z] download progress: 9.3 GB (poll 15/60) [22:10:34Z] download progress: 9.49 GB (poll 16/60) [22:11:07Z] download progress: 9.64 GB (poll 17/60) [22:11:40Z] download progress: 9.81 GB (poll 18/60) [22:12:13Z] download progress: 9.99 GB (poll 19/60) [22:12:44Z] GGUF ready: 10.12 GB [22:12:44Z] Stopping llama-server on Predator... [22:12:47Z] Starting llama-server with Qwen3-30B-A3B-UD-IQ2_M.gguf (this is the slow one — 17.7 GB to mmap, lots of layer probes)... [22:12:50Z] Waiting for llama-server to come up (give it up to 6 min — 30B mmap from disk is slow)... [22:15:00Z] llama-server up after 108s [22:15:01Z] Running llama-bench (this also takes much longer for 30B; budget ~10-15 min)... [22:19:38Z] Running prompts (cold + 3 warm per prompt; 30B + thinking will be slow on P-HARD)... [22:19:38Z] prompt=hello cold [22:20:04Z] prompt=hello warm 1 [22:20:20Z] prompt=hello warm 2 [22:20:36Z] prompt=hello warm 3 [22:20:52Z] prompt=P-MEDIUM cold [22:22:29Z] prompt=P-MEDIUM warm 1 [22:23:56Z] prompt=P-MEDIUM warm 2 [22:25:42Z] prompt=P-MEDIUM warm 3 [22:27:51Z] prompt=P-HARD cold [22:31:17Z] prompt=P-HARD warm 1 [22:34:25Z] prompt=P-HARD warm 2 [22:38:04Z] prompt=P-HARD warm 3 [22:41:07Z] Stopping llama-server on Predator... [22:41:10Z] === bench complete === [22:41:10Z] Synthesizing report... [22:41:10Z] Report: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/5fb2913d-6500-4ecf-9e97-d43f7dd61145.md [22:41:10Z] Mirroring run-dir to Predator...