93a5675f06
chore(pi-notifications): remove remaining console.log
2026-04-28 13:06:44 +01:00
113878e83f
chore(pi-notifications): remove debug mode and console.log noise
...
- Remove PI_NOTIFICATION_DEBUG env var and steer signal logic
- Remove console.log on extension load
- Keep only: session_start notification + agent_end beep
- Clean README without debug references
2026-04-28 13:04:33 +01:00
7faabcb038
chore: update llm-metrics log
2026-04-28 13:03:38 +01:00
d7eabfffbb
docs(pi-notifications): update README for audio-based alerts
2026-04-28 13:03:11 +01:00
823af3c486
feat(pi-notifications): switch to afplay audio instead of desktop notifications
...
- Uses afplay to play an audio file (default: Glass.aiff)
- Configurable via PI_NOTIFICATION_AUDIO env var
- Works from sandboxed context — no osascript needed
- test-notify.ts verifies audio playback standalone
- Synced to auto-discovery extension path
2026-04-28 13:02:18 +01:00
040513e1d6
feat(pi-notifications): use 'tell application' for notifications to suppress Show button
...
- Tell target app (default: Ghostty) to display notification instead of raw osascript
- This attributes notification to the app, avoiding the 'Show' button that opens Script Editor
- Configurable via PI_NOTIFICATION_APP env var
- test-notify.ts falls back to plain display notification if target app isn't running
- Synced to auto-discovery extension path
2026-04-28 12:56:02 +01:00
45a13fd08c
feat(pi-notifications): add PI_NOTIFICATION_DEBUG mode with visible steer signal
...
- Add PI_NOTIFICATION_DEBUG=true env var
- When enabled, calls ctx.ui.steer() instead of desktop notification
- Lets you verify trigger logic in the agent loop without actual notifications
- Synced to both monorepo and auto-discovery extension paths
2026-04-28 12:09:06 +01:00
98e18643c5
pi-performance: Make Time to first token more accurate.
...
Summary of changes:
┌──────┬──────────────────────────────────────────────────────────────────┬──────────┐
│ Step │ Change │ Result │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 1 │ Removed duplicate llm-performance-metrics.test.ts │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 2 │ Added rawTimestamps assertions to toLogEntry test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 3 │ Added rawTimestamps assertions to single-turn aggregate test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 4 │ Added rawTimestamps assertions to multi-turn aggregate test │ 14 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 5 │ Added negative TTFT filtering test │ 15 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 6 │ Added "first turn missing TTFT, later turns have it" test │ 16 tests │
├──────┼──────────────────────────────────────────────────────────────────┼──────────┤
│ 7 │ Added sanity check tests (warn on >500 tok/s, no warn otherwise) │ 18 tests │
└──────┴──────────────────────────────────────────────────────────────────┴──────────┘
This is what it looks like now when I run `pi`
📊 Performance: llama.cpp/Qwen3.6-35B-A3B-MXFP4_MOE.gguf
Prefill: 15,460 tokens @ 20104.0 tok/s
Generation: 12,179 tokens @ 52.6 tok/s
Combined: 27,639 tokens @ 118.9 tok/s (3.9m total)
TTFT: 769ms
Turns: 36
2026-04-28 10:52:00 +01:00
a38c76c65e
move pi-llm-performance to monorepo, update README and add deno.json
2026-04-28 10:06:03 +01:00