Notes tagged
#trusted-ai
3 posts
- 2 min read
The full turn: the model is an API call
The full turn: the model is an API call for kopecks. Two circuits — public on CPU, sovereign on-site. The edge lives in the layer above the model.
- 2 min read
Picking the model for the hardware I have
Picking a model for the trusted-AI build on 24 GB. The bench beat intuition: a MoE beat a dense model twice its size; 'thinking' only got in the way.
- 2 min read
Do I even need my own GPU?
VRAM is tight, rental's rising. I sat down to pick a GPU — and the real question turned out to be whether I need one at all.