-
Notifications
You must be signed in to change notification settings - Fork 131
Description
I have a few iterations for per response context tracking n_tokens via API. Also, for persistent context monitor in chat. I'd like to measure you interest ahead of PR.
How do you feel about this in light of the existing /usage metrics estimation (and compression) system?
For API usage, I don't presume you are open to a complete replacement of existing integrations, or would you prefer parallell development (or perhaps coexistence as a fallback system when API metrics are unavailable)?
For persistent usage monitor in chat, rendering has some issues for scrolling and window resizing - these would be most easily overcome via sidebar - something you are comfortable with?
A perfect compliment to compression, accurate context would open the doors to enhanced user awareness on the near side, and agent self-awareness on the far side. Autonomous looping would be an excellent place to go from there.
It would also be fairly easy to create var for user setting max context (per config server) in env.
Thoughts? Thanks