docs(docs-likely): 📝 Add/update CLAUDE.md documentation and companion-load.png example image
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
This commit is contained in:
parent
415067ec9d
commit
29c098c226
2 changed files with 9 additions and 0 deletions
|
|
@ -87,6 +87,15 @@ Stream back to client frontend (text + audio)
|
||||||
|
|
||||||
companion-api orchestrates the pipeline. @ai owns all personality mechanics.
|
companion-api orchestrates the pipeline. @ai owns all personality mechanics.
|
||||||
|
|
||||||
|
### GPU / VRAM
|
||||||
|
|
||||||
|
companion-api holds zero VRAM. All inference and TTS go through model-boss's priority queue:
|
||||||
|
|
||||||
|
- **LLM inference** → `POST @model-boss /v1/chat/completions` — model-boss loads/evicts models via its pool
|
||||||
|
- **TTS** → `POST @speech-synthesis /synthesize` → speech-synthesis delegates to `POST @model-boss /api/v1/tts/synthesize` (no raw VRAM lease held by either service)
|
||||||
|
|
||||||
|
Never acquire GPU leases directly from companion code.
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
## Version Roadmap
|
## Version Roadmap
|
||||||
|
|
|
||||||
BIN
companion-load.png
Normal file
BIN
companion-load.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 15 KiB |
Loading…
Add table
Reference in a new issue