How to use from
Hermes Agent
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf LiquidAI/LFM2.5-8B-A1B-GGUF:
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default LiquidAI/LFM2.5-8B-A1B-GGUF:
Run Hermes
hermes
Quick Links
Liquid AI
Try LFMDocsLEAPDiscord

LFM2.5-8B-A1B-GGUF

LFM2 is a new generation of hybrid models developed by Liquid AI, specifically designed for edge AI and on-device deployment. It sets a new standard in terms of quality, speed, and memory efficiency.

Find more details in the original model card: https://huggingface.co/LiquidAI/LFM2.5-8B-A1B

🏃 How to run LFM2

Example usage with llama.cpp:

llama-cli -hf LiquidAI/LFM2.5-8B-A1B-GGUF
Downloads last month
136,369
GGUF
Model size
8B params
Architecture
lfm2moe
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LiquidAI/LFM2.5-8B-A1B-GGUF

Quantized
(44)
this model
Quantizations
1 model

Spaces using LiquidAI/LFM2.5-8B-A1B-GGUF 2

Collection including LiquidAI/LFM2.5-8B-A1B-GGUF