Cheaper inference,
verified quality

Access 24+ AI models through one OpenAI-compatible endpoint. Competitive pricing from a live marketplace, backed by multi-layer quality enforcement.

Why buy on Dragonfly

Pay less. Get the same quality. See everything.

A spot market where price, reliability, and settlement are visible — not a black box with hidden margins.

Price

Market-driven pricing

Sellers compete on a live order book, driving prices below official rates. You always pay the best available ask — no opaque markups.

Compatibility

One endpoint, every model

Same OpenAI SDK, one base URL. Switch between DeepSeek, Qwen, GLM, Kimi, MiniMax and more with zero code changes.

Control

Policy-driven routing

Bind cost, latency, and provider constraints to your API key. The matching engine enforces your rules on every request.

Visibility

Full settlement trail

Every request produces route_attempt, route_result, and settlement_result events. Debug cost and reliability from one ledger.

How we guarantee quality

Six layers of protection between you and a bad response.

1

Health scoring

Every seller is scored on success rate, latency (p50/p90/p99), and availability in 30-minute windows. Anomalies are flagged automatically.

2

Quality sampling

Relay responses are periodically compared against official provider output. We measure WER, CER, and cosine similarity per listing.

3

Seller reserve

A percentage of every trade is held in reserve. If a seller underperforms, the reserve is consumed to compensate affected buyers.

4

Automatic enforcement

Risk flags trigger auto-suspension or delisting. Sellers accumulate penalty points — unreliable capacity is removed from the book.

5

Quality-first matching

Choose quality_first routing and the engine weights health (35%), error rate (35%), and latency (30%) above cost.

6

Dispute & refund

Suspicious trades are flagged for review. Confirmed issues result in automatic buyer refunds from the seller's reserve.

Start routing in 3 steps

From signup to first inference in under a minute.

1

Create an API key

Sign up, top up your balance, and generate a key. Bind a routing profile for cost, latency, or quality priority.

2

Point your SDK

Change your base URL to Dragonfly. Same OpenAI SDK, same parameters. One line of code.

3

Requests route automatically

The matching engine finds the best eligible seller, executes the request, and settles the trade. You see every step.

Ready to route smarter?

Create a key, add $0.10 to start (or use the free trial credit), and make your first request. No minimums, no lock-in.

Dragonfly - AI Model Marketplace / AI 模型市場