Bilateral halt
Either party stops by ceasing their next action. No explicit halt message, no attack surface, no DoS vector. The session settles at the token boundary you chose.
How halt works →A Solana state-channel payment protocol for streaming LLM inference. Either side can halt at any output-token boundary; settlement only ever moves USDC for output the consumer actually accepted. Built on x402.
A consumer asks for JSON. The producer streams tokens; the consumer signs incremental commitments. When the model goes off-topic, the evaluator halts the stream — and the unspent deposit refunds on-chain.
// click Start to stream tokensToday's pay-after-delivery model bundles commitment and value in two coarse points; TAP makes them flow together, token by token. The trust window collapses from "full request value" to "a few tokens of inference cost".
Build a producer or consumer in a few lines. The reference demo wraps Gemini 2.5 Flash; adapters are scaffolded for Anthropic, OpenAI, and local Ollama. The on-chain program is deployed to Solana devnet under 2tqofcitv1LHFGCLCmR9Kyke6TmArQwpHSinWWtmCje9.