1 - 30 of 251 available models.
Model

Google’s latest Gemma, small yet strong for chat and generation


Pulls

100K+

Stars

47

Last Updated

3 months

Model

Qwen3 is the latest Qwen LLM, built for top-tier coding, math, reasoning, and language tasks.


Pulls

100K+

Stars

79

Last Updated

about 1 month

Model

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks


Pulls

100K+

Stars

28

Last Updated

29 days

Model

Solid LLaMA 3 update, reliable for coding, chat, and Q&A tasks


Pulls

100K+

Stars

22

Last Updated

8 months

Model

Tiny LLM built for speed, edge devices, and local development


Pulls

100K+

Stars

26

Last Updated

3 months

Model

Distilled LLaMA by DeepSeek, fast and optimized for real-world tasks


Pulls

100K+

Stars

71

Last Updated

7 months

Model

Efficient multimodal AI for text, image, audio, and video on low-resource devices.


Pulls

50K+

Stars

9

Last Updated

6 months

Model

Google’s latest Gemma, in its QAT (quantization aware trained) variant


Pulls

50K+

Stars

19

Last Updated

3 months

Model

Versatile Qwen update with better language skills and wider support


Pulls

50K+

Stars

8

Last Updated

8 months

Model

Microsoft’s compact model, surprisingly capable at reasoning and code


Pulls

50K+

Stars

20

Last Updated

8 months

Model

Newest LLama 3 release with improved reasoning and generation quality


Pulls

50K+

Stars

17

Last Updated

8 months

Model

Qwen3-Coder is Qwen’s new series of coding agent models.


Pulls

50K+

Stars

15

Last Updated

3 months

Model

The most advanced Qwen model yet, with major gains in text, vision, video, and reasoning.


Pulls

50K+

Stars

5

Last Updated

about 1 month

Model

Efficient open model with top-tier performance and fast inference


Pulls

10K+

Stars

19

Last Updated

8 months

Model

DeepCoder-14B-Preview is a code reasoning LLM fine-tuned to scale up to long context lengths


Pulls

10K+

Stars

10

Last Updated

8 months

Model

SmolLM3 is a 3.1B model for efficient on-device use, with strong performance in chat


Pulls

10K+

Stars

5

Last Updated

5 months

Model

Meta’s LLama 3.1: Chat-focused, benchmark-strong, multilingual-ready.


Pulls

10K+

Stars

4

Last Updated

8 months

Model

Granite Docling is a multimodal model for efficient document conversion.


Pulls

10K+

Stars

1

Last Updated

2 months

Model

Google’s latest Gemma, small yet strong for chat and generation


Pulls

10K+

Stars

0

Last Updated

about 2 months

Model

Embedding Gemma is a state-of-the-art text embedding model from Google DeepMind


Pulls

9.9K

Stars

3

Last Updated

3 months

Model

Designed for reasoning, agent and general capabilities, and versatile developer-friendly features


Pulls

8.9K

Stars

1

Last Updated

3 months

Model

24B multimodal instruction model by Mistral AI, tuned for accuracy, tool use & fewer repeats


Pulls

8.8K

Stars

1

Last Updated

3 months

Model

Mistral fine-tuned via NVIDIA NeMo for smoother enterprise use


Pulls

8.5K

Stars

3

Last Updated

9 months

Model

mxbai-embed-large-v1 is a top English embed model by Mixedbread AI, great for RAG and more.


Pulls

8.0K

Stars

3

Last Updated

9 months

Model

SmolVLM: lightweight multimodal model for video, image, and text analysis, optimized for devices.


Pulls

7.7K

Stars

3

Last Updated

2 months

Model

Safety reasoning models for policy-based text classification and foundational safety tasks.


Pulls

7.7K

Stars

1

Last Updated

about 1 month

Model

Qwen3 Embedding: multilingual models for advanced text/ranking tasks like retrieval & clustering.


Pulls

7.6K

Stars

0

Last Updated

28 days

Model

Nomic Embed Text v1 is an open‑source, fully auditable text embedding model


Pulls

6.5K

Stars

3

Last Updated

4 months

Model

Qwen3 Embedding: multilingual models for advanced text/ranking tasks like retrieval & clustering.


Pulls

5.8K

Stars

0

Last Updated

28 days

Model

Experimental Qwen variant—lean, fast, and a bit mysterious


Pulls

5.8K

Stars

3

Last Updated

8 months