OLMo 2
Developed by AI2 (Allen Institute for AI)
Key Capabilities
- Fully open: model weights, training data, training code, and evaluation
- Transparent training recipes for reproducibility
- Competitive with similarly-sized proprietary models
- Built for AI safety research and auditing
- Dolma dataset fully documented and auditable
VRAM Requirements by Quantization
Choose the right GPU based on your performance and quality needs.
| Model / Quantization | VRAM Required |
|---|---|
| 7B FP16 | 14GB |
| 13B FP16 | 26GB |
Use Cases
OLMo 2 (7B, 13B) can be deployed for enterprise AI applications including document processing, code generation, data analysis, and conversational AI. License: Apache 2.0 (fully open: weights, data, code, training recipes).
Run OLMo 2 with Petronella
PTG deploys OLMo 2 for organizations requiring full AI transparency and auditability. The only major model with completely open training data, code, and recipes - essential for regulated industries requiring AI explainability.
Recommended Hardware
| Model Size | Recommended GPU |
|---|---|
| 7B | RTX 5080 (16GB) |
| 13B | RTX PRO 4000 (24GB) or RTX 5090 (32GB) |
Deploy OLMo 2 On-Premises
Our team builds GPU-accelerated systems configured and optimized for OLMo 2. Private, secure, and fully under your control.