Groq API
api.groq.com
Groq provides ultra-fast LLM inference with an OpenAI-compatible API. Use Groq to generate chat completions, transcribe audio, and translate audio to English using models like Llama 3.3 and Whisper. Groq's custom LPU hardware delivers the fastest token generation speeds available.
api_key
free
1.0
https://api.groq.com
1
2026-03-01T11:07:56.000Z
Health
This is a community-maintained manifest. Health monitoring is not available because this service doesn't host its own /.well-known/agent endpoint yet. Learn more about trust levels →
Capabilities
create_chat_completion
communicationGenerate a chat completion using Groq's fast inference. Supports models like llama-3.3-70b-versatile, mixtral-8x7b-32768, and gemma2-9b-it. Accepts a messages array with role/content pairs.
Detail: https://api.groq.com/capabilities/create_chat_completion
create_transcription
communicationTranscribe audio to text using Whisper Large V3 on Groq hardware. Accepts audio files in mp3, mp4, wav, webm, m4a, and other formats. Returns transcribed text with optional timestamps.
Detail: https://api.groq.com/capabilities/create_transcription
create_translation
communicationTranslate audio from any supported language into English text using Whisper Large V3. Accepts the same audio formats as transcription.
Detail: https://api.groq.com/capabilities/create_translation
list_models
communicationList all models currently available on Groq's inference platform, including their IDs, ownership, and creation dates.
Detail: https://api.groq.com/capabilities/list_models
Agent Preview
This is what an AI agent sees when it discovers this service via the Gateway:
Service: Groq API Description: Groq provides ultra-fast LLM inference with an OpenAI-compatible API. Use Groq to generate chat completions, transcribe audio, and translate audio to English using models like Llama 3.3 and Whisper. Groq's custom LPU hardware delivers the fastest token generation speeds available. Auth: api_key Capabilities: - create_chat_completion: Generate a chat completion using Groq's fast inference. Supports models like llama-3.3-70b-versatile, mixtral-8x7b-32768, and gemma2-9b-it. Accepts a messages array with role/content pairs. - create_transcription: Transcribe audio to text using Whisper Large V3 on Groq hardware. Accepts audio files in mp3, mp4, wav, webm, m4a, and other formats. Returns transcribed text with optional timestamps. - create_translation: Translate audio from any supported language into English text using Whisper Large V3. Accepts the same audio formats as transcription. - list_models: List all models currently available on Groq's inference platform, including their IDs, ownership, and creation dates.