[22:11:39Z] === pavilion-a3b-bench run_id=23066b38-ea9c-4dd3-b2f5-32912a67fce4 === [22:11:39Z] Output: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/23066b38-ea9c-4dd3-b2f5-32912a67fce4.jsonl [22:11:39Z] Creating run-mirror dir on Pavilion: D:\WeeyugaBench\runs\23066b38-ea9c-4dd3-b2f5-32912a67fce4 [22:11:43Z] Polling for Qwen3-30B-A3B-UD-IQ2_XXS.gguf on Pavilion... [22:11:45Z] GGUF ready: 9.65 GB [22:11:45Z] Stopping any llama-server on Pavilion :11437... [22:11:49Z] Starting llama-server with Qwen3-30B-A3B-UD-IQ2_XXS.gguf on :11437... [22:11:54Z] Waiting for llama-server (up to 12 min — Pavilion mmap of 9.65 GB IQ2_XXS is slow)... [22:13:48Z] llama-server up after 95s [22:13:50Z] Running llama-bench (Pavilion + 30B + heavy offload — ~15-25 min budget)... [22:24:05Z] Running prompts (cold + 3 warm per prompt)... [22:24:05Z] prompt=hello cold [22:27:52Z] prompt=hello warm 1 [22:28:08Z] prompt=hello warm 2 [22:28:25Z] prompt=hello warm 3 [22:28:37Z] prompt=P-MEDIUM cold [22:30:49Z] prompt=P-MEDIUM warm 1 [22:31:51Z] prompt=P-MEDIUM warm 2 [22:32:41Z] prompt=P-MEDIUM warm 3 [22:33:26Z] prompt=P-HARD cold [22:35:31Z] prompt=P-HARD warm 1 [22:36:40Z] prompt=P-HARD warm 2 [22:37:37Z] prompt=P-HARD warm 3 [22:38:36Z] Stopping any llama-server on Pavilion :11437... [22:38:39Z] === bench complete === [22:38:39Z] Synthesizing report... [22:38:39Z] Report: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/23066b38-ea9c-4dd3-b2f5-32912a67fce4.md [22:38:39Z] Mirroring run-dir to Pavilion...