Ultra-low-latency access to a Gemini-powered LLM for fast text generation and conversational responses.