Open-Source AI Model

DBRX

Developed by Databricks

Local AI Deployment Experts 24+ Years IT Infrastructure GPU Hardware In Stock

Key Capabilities

  • Fine-grained MoE with 16 smaller experts for better routing
  • Strong coding and SQL generation
  • Competitive with LLaMA 2-70B and Mixtral at lower compute cost
  • 32K context window
  • Optimized for data analytics and business intelligence

VRAM Requirements by Quantization

Choose the right GPU based on your performance and quality needs.

Model / QuantizationVRAM Required
FP16264GB
Q475GB

Use Cases

DBRX (132B total (36B active via fine-grained MoE, 16 experts, 4 active)) can be deployed for enterprise AI applications including document processing, code generation, data analysis, and conversational AI. License: Databricks Open Model License (permissive, commercial use).

Run DBRX with Petronella

PTG deploys DBRX for data-driven organizations already using Databricks or similar platforms. Its SQL and analytics strengths make it ideal for private business intelligence AI.

Recommended Hardware

Model SizeRecommended GPU
FP16DGX Spark (128GB) or 2x RTX PRO 6000 (192GB)
Q4RTX PRO 6000 Blackwell (96GB)

Deploy DBRX On-Premises

Our team builds GPU-accelerated systems configured and optimized for DBRX. Private, secure, and fully under your control.