Specifications
| Property | Value |
|---|---|
| Parameters | 700M |
| Context Length | 32K tokens |
| Architecture | LFM2 (Dense) |
Edge Deployment
Optimized for resource-constrained devices
Low Latency
Fast inference for real-time applications
Fine-tunable
TRL compatible (SFT, DPO, GRPO)
Quick Start
- Transformers
- llama.cpp
- vLLM
Install:Download & Run: