Groq Models - ZeroTwo Docs

Available models
Key features
Best use cases
Related models

Groq provides blazingly fast AI inference using custom LPU (Language Processing Unit) hardware, delivering the fastest response times available.

Available models

Llama 3 (via Groq)

Strengths: Extreme speed, low latencyFastest inference available

Mixtral (via Groq)

Strengths: Speed with capabilityFast mixture-of-experts model

Key features

Extreme speed: Fastest inference in the industry
Low latency: Sub-second response times
High throughput: Process many requests quickly
Competitive quality: Good model performance

Best use cases

Real-time applications
Interactive chat experiences
High-volume API processing
Latency-sensitive applications
Rapid prototyping

Groq is ideal when speed is the primary concern. Use for real-time applications where immediate responses matter.

All models

Browse all models

OpenAI

GPT models for capability

DeepSeek Models xAI (Grok) Models

⌘I