DBRX
Developed by Databricks
Key Capabilities
- Fine-grained MoE with 16 smaller experts for better routing
- Strong coding and SQL generation
- Competitive with LLaMA 2-70B and Mixtral at lower compute cost
- 32K context window
- Optimized for data analytics and business intelligence
VRAM Requirements by Quantization
Choose the right GPU based on your performance and quality needs.
| Model / Quantization | VRAM Required |
|---|---|
| FP16 | 264GB |
| Q4 | 75GB |
Use Cases
DBRX (132B total (36B active via fine-grained MoE, 16 experts, 4 active)) can be deployed for enterprise AI applications including document processing, code generation, data analysis, and conversational AI. License: Databricks Open Model License (permissive, commercial use).
Run DBRX with Petronella
PTG deploys DBRX for data-driven organizations already using Databricks or similar platforms. Its SQL and analytics strengths make it ideal for private business intelligence AI.
Recommended Hardware
| Model Size | Recommended GPU |
|---|---|
| FP16 | DGX Spark (128GB) or 2x RTX PRO 6000 (192GB) |
| Q4 | RTX PRO 6000 Blackwell (96GB) |
Deploy DBRX On-Premises
Our team builds GPU-accelerated systems configured and optimized for DBRX. Private, secure, and fully under your control.