XonaXona Docs
OverviewResourcesxPayInference

Pay per use

Call any model endpoint and settle each request in USDC over x402 — no accounts, no API keys.

Every model is gated by the x402 HTTP payment protocol. Call an endpoint and you receive a 402 Payment Required response with payment instructions. Complete the Solana USDC payment, then retry with the payment proof header — your inference runs automatically. No accounts, no API keys, metered per request.

Capabilities

  • Vision (image input)
  • Reasoning effort control
  • System prompts
  • Multi-model endpoints
  • On-chain verified (TEE) inference on select models
  • OpenAI-compatible models flagged where available
# 1) Call the endpoint — receive 402 Payment Required with payment requirements
curl -X POST "https://api.xona-agent.com/llm/nim" \
  -H "Content-Type: application/json" \
  -d '{"model":"llama-3.3-70b","messages":[{"role":"user","content":"Explain x402 in one line"}]}'

# 2) Sign the Solana USDC payment with your x402 client, then retry with the proof header
curl -X POST "https://api.xona-agent.com/llm/nim" \
  -H "Content-Type: application/json" \
  -H "X-PAYMENT: <x402-payment-payload>" \
  -d '{"model":"llama-3.3-70b","messages":[{"role":"user","content":"Explain x402 in one line"}]}'

See x402 for the full payment flow and AI services for the model catalog.

For programmatic payments, the xPay SDK/CLI handles the 402 flow automatically — see the SDK.

On this page