Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

1,341

Full-text search

Active filters: multimodal

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.25M • • 1.06k

NCSOFT/VARCO-VISION-2.0-14B

Image-Text-to-Text • 15B • Updated about 7 hours ago • 770 • 13

lingshu-medical-mllm/Lingshu-32B

Image-Text-to-Text • 33B • Updated 23 days ago • 2.16k • 51

NCSOFT/GME-VARCO-VISION-Embedding

Feature Extraction • 8B • Updated about 11 hours ago • 803 • 9

lingshu-medical-mllm/Lingshu-7B

Image-Text-to-Text • 8B • Updated 23 days ago • 5.59k • 44

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 3.8M • 459

NCSOFT/VARCO-VISION-2.0-1.7B

Image-Text-to-Text • Updated 2 days ago • 6

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14 • 460k • • 407

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 111k • 1.71k

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated 2 days ago • 40.2k • 432

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 885k • • 1.21k

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 121k • 318

stepfun-ai/Step1X-Edit

Image-to-Image • Updated 9 days ago • 348 • • 300

BAAI/Video-XL-2

Video-Text-to-Text • 8B • Updated Jun 6 • 611 • 50

Kwai-Keye/Keye-VL-8B-Preview

Video-Text-to-Text • 9B • Updated 12 days ago • 28.5k • 64

Qwen/Qwen2.5-Omni-3B

Any-to-Any • 6B • Updated Apr 30 • 166k • 253

robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17, 2024 • 733 • 87

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 961k • 433

allenai/MolmoE-1B-0924

Image-Text-to-Text • Updated Apr 24 • 2.14k • 148

allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated Apr 4 • 166k • 536

NCSOFT/VARCO-VISION-14B-HF

Image-Text-to-Text • 15B • Updated 2 days ago • 1.84k • 29

bartowski/Qwen2-VL-2B-Instruct-GGUF

Image-Text-to-Text • 2B • Updated Dec 17, 2024 • 4.29k • 28

Minthy/ToriiGate-v0.4-7B

Image-Text-to-Text • 8B • Updated Jan 22 • 1.52k • 50

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • 73B • Updated Jun 6 • 343k • • 512

mlx-community/Qwen2.5-VL-3B-Instruct-6bit

Image-Text-to-Text • 0.9B • Updated Feb 26 • 56 • 2

chenjoya/LiveCC-7B-Instruct

8B • Updated Apr 25 • 4.61k • 37

Hcompany/Holo1-7B

Image-Text-to-Text • 8B • Updated Jun 10 • 14.2k • 217

lusxvr/nanoVLM-450M

Image-Text-to-Text • 0.5B • Updated Jun 4 • 1.82k • 6

0xroyce/silent-voice-multimodal

8B • Updated 6 days ago • 254 • 2

imageomics/bioclip

Zero-Shot Image Classification • Updated May 17, 2024 • 171k • 50