{"ok":true,"host":"jatevo.ai","status":"ready","manifest":{"positioning":"jatevo.ai is a pay-per-call LLM inference host serving agents and developers who need access to large open-source models without managing their own infrastructure. It offers two distinct models—a 120B OSS model with up to 65536-token context and DeepSeek V3.1—both accessible via a standard chat completion interface. It is suited for text generation, analysis, summarization, and multi-turn conversation workloads where cost is settled on-chain.","host_overview":"jatevo.ai hosts two large language model inference endpoints: a GPT-OSS 120B open-source model and DeepSeek V3.1. Both accept chat-formatted message arrays and return completions, with billing handled per-call via USDC on Base.","routing_guidance":"Use jatevo.ai when an agent needs on-demand LLM text completion from a hosted open-source model with per-call USDC billing and no infrastructure setup. Choose invoke-gpt-oss-120b-chat when large context windows (up to 65536 tokens) are required. Choose query-deepseek-v3-1-llm when DeepSeek V3.1 output quality or style is preferred and token usage reporting is needed. Do not use this host for image generation, embeddings, vector search, or real-time data retrieval—those require dedicated endpoints not available here. Do not use either skill when a proprietary closed-source model (e.g., GPT-4o, Claude) is explicitly required; route to OpenAI or Anthropic APIs instead. Avoid stream=false with large max_tokens on query-deepseek-v3-1-llm in latency-sensitive contexts.","capability_clusters":[{"skill_names":["invoke-gpt-oss-120b-chat","query-deepseek-v3-1-llm"],"cluster_name":"Chat Completion Inference","cluster_summary":"Both skills provide chat-formatted LLM inference, accepting message arrays with configurable temperature, max tokens, and optional streaming. Together they give agents a choice between two hosted open-source models for text generation tasks."}],"cross_skill_workflows":[{"steps":[{"skill_name":"invoke-gpt-oss-120b-chat","description":"Send the same message array to the GPT-OSS 120B model and capture its completion and token usage."},{"skill_name":"query-deepseek-v3-1-llm","description":"Send the identical message array to DeepSeek V3.1 and capture its completion and token usage statistics for side-by-side comparison."}],"when_to_use":"Use when an agent needs to compare outputs from two different LLMs on the same prompt to evaluate quality, consistency, or stylistic differences before selecting a model for downstream use.","workflow_name":"Model Comparison Completion"}]},"model":"claude-sonnet-4-6","version_no":2,"generated_at":"2026-05-20T02:27:29.931Z","provenance":"ai_authored_unreviewed","ai_authored":true,"merchant_reviewed":false,"merchant_edited":false,"merchant_reviewed_at":null,"merchant_edited_at":null,"skill_md_url":"https://x402gle.com/servers/jatevo.ai/SKILL.md","skills_url":"https://x402gle.com/servers/jatevo.ai/skills.json"}