Summary of changes:
┌──────┬──────────────────────────────────────────────────────────────────┬──────────┐
│ Step │ Change │ Result │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 1 │ Removed duplicate llm-performance-metrics.test.ts │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 2 │ Added rawTimestamps assertions to toLogEntry test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 3 │ Added rawTimestamps assertions to single-turn aggregate test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 4 │ Added rawTimestamps assertions to multi-turn aggregate test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 5 │ Added negative TTFT filtering test │ 15 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 6 │ Added "first turn missing TTFT, later turns have it" test │ 16 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 7 │ Added sanity check tests (warn on >500 tok/s, no warn otherwise) │ 18 tests │
└──────┴──────────────────────────────────────────────────────────────────┴──────────┘
This is what it looks like now when I run `pi`
📊 Performance: llama.cpp/Qwen3.6-35B-A3B-MXFP4_MOE.gguf
Prefill: 15,460 tokens @ 20104.0 tok/s
Generation: 12,179 tokens @ 52.6 tok/s
Combined: 27,639 tokens @ 118.9 tok/s (3.9m total)
TTFT: 769ms
Turns: 36
32 lines
783 B
JSON
32 lines
783 B
JSON
{
|
|
"version": "5",
|
|
"specifiers": {
|
|
"jsr:@std/assert@*": "1.0.19",
|
|
"jsr:@std/assert@^1.0.19": "1.0.19",
|
|
"jsr:@std/internal@^1.0.12": "1.0.12",
|
|
"jsr:@std/testing@*": "1.0.18"
|
|
},
|
|
"jsr": {
|
|
"@std/assert@1.0.19": {
|
|
"integrity": "eaada96ee120cb980bc47e040f82814d786fe8162ecc53c91d8df60b8755991e",
|
|
"dependencies": [
|
|
"jsr:@std/internal"
|
|
]
|
|
},
|
|
"@std/internal@1.0.12": {
|
|
"integrity": "972a634fd5bc34b242024402972cd5143eac68d8dffaca5eaa4dba30ce17b027"
|
|
},
|
|
"@std/testing@1.0.18": {
|
|
"integrity": "d3152f57b11666bf6358d0e127c7e3488e91178b0c2d8fbf0793e1c53cd13cb1",
|
|
"dependencies": [
|
|
"jsr:@std/assert@^1.0.19"
|
|
]
|
|
}
|
|
},
|
|
"workspace": {
|
|
"dependencies": [
|
|
"jsr:@std/assert@1"
|
|
]
|
|
}
|
|
}
|