Small Language Models
updated
facebook/opt-iml-max-1.3b
Text Generation
•
Updated
•
1.17k
•
43
Text Generation
•
Updated
•
10.7k
•
86
togethercomputer/RedPajama-INCITE-Chat-3B-v1
Text Generation
•
Updated
•
1.08k
•
152
Text Generation
•
1B
•
Updated
•
24.3k
•
321
Text Generation
•
3B
•
Updated
•
15.7k
•
498
Text Generation
•
2B
•
Updated
•
15.3k
•
26
Text Generation
•
3B
•
Updated
•
23.3k
•
32
cerebras/Cerebras-GPT-1.3B
Text Generation
•
Updated
•
1.35k
•
50
cerebras/Cerebras-GPT-2.7B
Text Generation
•
Updated
•
1.06k
•
45
mtgv/MobileLLaMA-1.4B-Chat
Text Generation
•
Updated
•
362
•
20
mtgv/MobileLLaMA-2.7B-Chat
Text Generation
•
Updated
•
66
•
6
M4-ai/TinyMistral-6x248M-Instruct
Text Generation
•
1B
•
Updated
•
20
•
11
M4-ai/NeuralReyna-Mini-1.8B-v0.3
Text Generation
•
2B
•
Updated
•
58
•
11
stabilityai/stablelm-2-zephyr-1_6b
Text Generation
•
2B
•
Updated
•
3.28k
•
186
stabilityai/stable-code-instruct-3b
Text Generation
•
3B
•
Updated
•
2.24k
•
180
stabilityai/stablelm-zephyr-3b
Text Generation
•
3B
•
Updated
•
7.67k
•
259
Text Generation
•
Updated
•
32
•
25
TinyLlama/TinyLlama-1.1B-Chat-v1.0
Text Generation
•
1B
•
Updated
•
1.4M
•
1.49k
Text Generation
•
1B
•
Updated
•
13.8k
•
25
Text Generation
•
2B
•
Updated
•
4.29k
•
123
Text Generation
•
2B
•
Updated
•
75.8k
•
•
69
Text Generation
•
4B
•
Updated
•
13.9k
•
44
Text Generation
•
2B
•
Updated
•
1.11M
•
•
155
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
5.92M
•
•
580
Qwen/Qwen2.5-Coder-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
95.5k
•
•
96
Text Generation
•
3B
•
Updated
•
8.19M
•
359
Text Generation
•
3B
•
Updated
•
53.7k
•
833
Text Generation
•
3B
•
Updated
•
41.6k
•
171
Text Generation
•
3B
•
Updated
•
1.56k
•
90
Text Generation
•
3B
•
Updated
•
79
•
20
Text Generation
•
1B
•
Updated
•
2.98k
•
218
Text Generation
•
3B
•
Updated
•
181k
•
•
1.26k
Text Generation
•
1B
•
Updated
•
41.4k
•
1.35k
Text Generation
•
3B
•
Updated
•
1.15M
•
3.42k
ministral/Ministral-3b-instruct
Text Generation
•
3B
•
Updated
•
121k
•
81
HuggingFaceTB/SmolLM-1.7B-Instruct
Text Generation
•
2B
•
Updated
•
3.44k
•
118
h2oai/h2o-danube-1.8b-chat
Text Generation
•
2B
•
Updated
•
768
•
55
h2oai/h2o-danube2-1.8b-chat
Text Generation
•
2B
•
Updated
•
170
•
62
h2oai/h2o-danube3-4b-chat
Text Generation
•
4B
•
Updated
•
2.22k
•
67
h2oai/h2o-danube3.1-4b-chat
Text Generation
•
4B
•
Updated
•
133
•
5
Text Generation
•
1B
•
Updated
•
830
•
41
Text Generation
•
6B
•
Updated
•
15.3k
•
70
Text Generation
•
6B
•
Updated
•
3.69k
•
41
Updated
•
168
•
257
6B
•
Updated
•
56.5k
•
1.16k
zai-org/glm-edge-1.5b-chat
Text Generation
•
2B
•
Updated
•
1.14k
•
17
Text Generation
•
4B
•
Updated
•
425
•
12
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
1B
•
Updated
•
2.95M
•
•
1.23k
meta-llama/Llama-3.2-3B-Instruct
Text Generation
•
3B
•
Updated
•
1.98M
•
•
1.91k
NousResearch/Hermes-3-Llama-3.2-3B
Text Generation
•
3B
•
Updated
•
6.26k
•
175
ibm-granite/granite-3b-code-instruct-2k
Text Generation
•
3B
•
Updated
•
1.32k
•
39
ibm-granite/granite-3.0-2b-instruct
Text Generation
•
3B
•
Updated
•
3.18k
•
46
nvidia/Hymba-1.5B-Instruct
Text Generation
•
2B
•
Updated
•
265
•
242
HuggingFaceTB/SmolLM2-1.7B
Text Generation
•
2B
•
Updated
•
11k
•
140
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
2B
•
Updated
•
1.84M
•
•
1.42k
apple/OpenELM-1_1B-Instruct
Text Generation
•
1B
•
Updated
•
1.51k
•
70
apple/OpenELM-3B-Instruct
Text Generation
•
3B
•
Updated
•
1.11k
•
339
internlm/internlm2-chat-1_8b
Text Generation
•
2B
•
Updated
•
3.89k
•
35
internlm/internlm2_5-1_8b-chat
Text Generation
•
2B
•
Updated
•
726
•
25
agentica-org/DeepScaleR-1.5B-Preview
Text Generation
•
2B
•
Updated
•
11k
•
577
microsoft/Phi-3-mini-128k-instruct
Text Generation
•
4B
•
Updated
•
55.3k
•
1.69k
microsoft/Phi-4-mini-instruct
Text Generation
•
4B
•
Updated
•
136k
•
652
Text Generation
•
1.0B
•
Updated
•
2.33M
•
785
Text Generation
•
Updated
•
193
•
121
ibm-granite/granite-3.3-2b-instruct
Text Generation
•
3B
•
Updated
•
68.7k
•
82
Text Generation
•
4B
•
Updated
•
2.71k
•
•
495
Qwen/Qwen3-4B-Thinking-2507
Text Generation
•
4B
•
Updated
•
420k
•
•
507
Qwen/Qwen3-4B-Instruct-2507
Text Generation
•
4B
•
Updated
•
2.78M
•
•
616
Text Generation
•
3B
•
Updated
•
69k
•
•
864
ibm-granite/granite-4.0-h-micro
Text Generation
•
3B
•
Updated
•
4.22k
•
128
nvidia/Nemotron-Flash-3B-Instruct
Text Generation
•
3B
•
Updated
•
1.15k
•
25
mistralai/Ministral-3-3B-Reasoning-2512
4B
•
Updated
•
28.7k
•
80
mistralai/Ministral-3-3B-Instruct-2512
4B
•
Updated
•
209k
•
166
Text Generation
•
1B
•
Updated
•
394k
•
339
Text Generation
•
3B
•
Updated
•
9.91k
•
170
Nanbeige/Nanbeige4-3B-Thinking-2511
Text Generation
•
4B
•
Updated
•
7.1k
•
162