For API users, the promise is simple.
The first words show up quickly, the answer keeps streaming, and a traffic spike from someone else does not turn your app into a waiting room.
Your AI feature should feel the same at 2 PM as it does at 2 AM. Perchy gives each active app a clear temporary lane, then releases it when the app goes quiet.
Perchy keeps your app in a clear lane on machines that are not filled past the point where people start feeling the slowdown.
A normal shared API sells the same capacity again and again, so every user can feel slower when traffic rises. Perchy sells positions only up to the calm operating range of each group of machines.
When your app sends requests, the position is yours. Stop sending for the timeout you choose, and the position returns to the market.
The first words show up quickly, the answer keeps streaming, and a traffic spike from someone else does not turn your app into a waiting room.
You pay while your app occupies a position. The minimum is one second. A longer idle hold costs more, but keeps your lane ready between bursts.
Connect from home, the studio, or a small lab. No public address or network setup is required; your machine can earn while it is available.
const answer = await perchy.chat({
model: "fast",
lane: "keep-clear"
});