Speech-to-text endpoint that transcribes audio/video with per-minute Solana (USDC) payments via x402Factory.