Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

10

Base only

Active filters: deepinfra

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 4 days ago • 4.06M • • 4.78k

Qwen/Qwen3.6-35B-A3B

Image-Text-to-Text • 36B • Updated Apr 24 • 5.13M • • 2.07k

google/gemma-4-31B-it

Image-Text-to-Text • 33B • Updated 8 days ago • 9.79M • • 2.96k

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 4 days ago • 2.78M • • 1.46k

google/gemma-4-26B-A4B-it

Image-Text-to-Text • 27B • Updated 8 days ago • 11.4M • • 1.12k

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated 23 days ago • 2.76M • • 1.44k

zai-org/GLM-5.1

Text Generation • 754B • Updated 29 days ago • 123k • • 1.76k

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated Apr 24 • 921k • • 1.51k

Qwen/Qwen3.5-122B-A10B

Image-Text-to-Text • 125B • Updated Apr 24 • 751k • • 568

stepfun-ai/Step-3.5-Flash

Text Generation • 199B • Updated Mar 17 • 326k • • 820