[16:01:49Z] === predator-trio-bench run_id=09d8fbde-0008-49bb-99da-03eeaca72be1 === [16:01:49Z] Output: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/09d8fbde-0008-49bb-99da-03eeaca72be1.jsonl [16:01:49Z] On-device mirror: predator:D:\WeeyugaBench\runs\09d8fbde-0008-49bb-99da-03eeaca72be1\ [16:01:49Z] Creating run-mirror dir on Predator: D:\WeeyugaBench\runs\09d8fbde-0008-49bb-99da-03eeaca72be1 [16:01:52Z] >>> MODEL: granite [16:01:52Z] Stopping any running llama-server on Predator... [16:01:55Z] Starting llama-server for granite with granite-4.1-8b-Q4_K_M.gguf... [16:01:59Z] Waiting for llama-server (granite) to come up... [16:03:09Z] llama-server up after 59s [16:03:11Z] Running llama-bench for granite... [16:04:52Z] Running prompts for granite (cold + 3 warm per prompt)... [16:04:52Z] prompt=hello cold [16:04:53Z] prompt=hello warm 1 [16:04:54Z] prompt=hello warm 2 [16:04:56Z] prompt=hello warm 3 [16:04:56Z] prompt=P-MEDIUM cold [16:05:03Z] prompt=P-MEDIUM warm 1 [16:05:09Z] prompt=P-MEDIUM warm 2 [16:05:16Z] prompt=P-MEDIUM warm 3 [16:05:23Z] prompt=P-HARD cold [16:05:43Z] prompt=P-HARD warm 1 [16:06:02Z] prompt=P-HARD warm 2 [16:06:21Z] prompt=P-HARD warm 3 [16:06:39Z] >>> MODEL: gemma [16:06:39Z] Stopping any running llama-server on Predator... [16:06:42Z] Starting llama-server for gemma with gemma-4-E4B-it-Q4_K_M.gguf... [16:06:45Z] Waiting for llama-server (gemma) to come up... [16:07:30Z] llama-server up after 37s [16:07:31Z] Running llama-bench for gemma... [16:08:12Z] Running prompts for gemma (cold + 3 warm per prompt)... [16:08:12Z] prompt=hello cold [16:08:16Z] prompt=hello warm 1 [16:08:17Z] prompt=hello warm 2 [16:08:20Z] prompt=hello warm 3 [16:08:23Z] prompt=P-MEDIUM cold [16:08:27Z] prompt=P-MEDIUM warm 1 [16:08:31Z] prompt=P-MEDIUM warm 2 [16:08:44Z] prompt=P-MEDIUM warm 3 [16:08:57Z] prompt=P-HARD cold [16:09:25Z] prompt=P-HARD warm 1 [16:09:42Z] prompt=P-HARD warm 2 [16:09:58Z] prompt=P-HARD warm 3 [16:10:12Z] >>> MODEL: qwen [16:10:12Z] Stopping any running llama-server on Predator... [16:10:16Z] Starting llama-server for qwen with Qwen3.5-9B-Q4_K_M.gguf... [16:10:19Z] Waiting for llama-server (qwen) to come up... [16:11:16Z] llama-server up after 49s [16:11:18Z] Running llama-bench for qwen... [16:13:48Z] Running prompts for qwen (cold + 3 warm per prompt)... [16:13:48Z] prompt=hello cold [16:13:53Z] prompt=hello warm 1 [16:13:58Z] prompt=hello warm 2 [16:14:03Z] prompt=hello warm 3 [16:14:08Z] prompt=P-MEDIUM cold [16:14:44Z] prompt=P-MEDIUM warm 1 [16:15:19Z] prompt=P-MEDIUM warm 2 [16:15:55Z] prompt=P-MEDIUM warm 3 [16:16:30Z] prompt=P-HARD cold [16:17:44Z] prompt=P-HARD warm 1 [16:18:54Z] prompt=P-HARD warm 2 [16:20:05Z] prompt=P-HARD warm 3 [16:21:15Z] Stopping any running llama-server on Predator... [16:21:18Z] === bench complete === [16:21:18Z] Synthesizing report... [16:21:18Z] Report: /Users/slobodan/projects/WeeyugaWeb/docs/BENCHMARKS/runs/09d8fbde-0008-49bb-99da-03eeaca72be1.md [16:21:19Z] Mirroring run-dir to Predator... [16:21:25Z] On-device mirror complete: predator:D:\WeeyugaBench\runs\09d8fbde-0008-49bb-99da-03eeaca72be1\