Steffen Röcker's picture

Steffen Röcker PRO

sroecker

·

https://x.com/sroecker

AI & ML interests

Local models

Recent Activity

liked a model 1 day ago

lyraaaa/baguettotron-SAE-L48-8x-k16-774m

liked a model 1 day ago

kaitchup/Qwen3.5-27B-autoround-W4A16

upvoted a collection 1 day ago

Quantized Qwen3.5

View all activity

Organizations

liked 2 models 1 day ago

lyraaaa/baguettotron-SAE-L48-8x-k16-774m

Updated 1 day ago • 1

kaitchup/Qwen3.5-27B-autoround-W4A16

7B • Updated 3 days ago • 94 • 4

upvoted a collection 1 day ago

Quantized Qwen3.5

Verified models. Compatible with Transformers v5.3 and vLLM v0.16.1rc1 (nightly). Under evaluation. • 3 items • Updated 3 days ago • 4

liked a model 2 days ago

huihui-ai/Huihui-LFM2-24B-A2B-abliterated

Text Generation • Updated 2 days ago • 33 • 5

liked a model 3 days ago

huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated

Image-Text-to-Text • 36B • Updated about 7 hours ago • 8.3k • 129

liked a model 5 days ago

bknyaz/Qwen3-Coder-Next-REAM

Text Generation • 60B • Updated 19 days ago • 682 • 25

upvoted 2 collections 5 days ago

Qwen3-MoE

Compressed Qwen3 MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated 19 days ago • 3

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 5 days ago • 126

liked 3 datasets 6 days ago

peteromallet/dataclaw-peteromallet

Viewer • Updated 5 days ago • 549 • 4.01k • 246

crozai/croz-infbench-sharegpt

Viewer • Updated Mar 26, 2025 • 394 • 62 • 1

crozai/croz-coding-sharegpt

Viewer • Updated Mar 26, 2025 • 118k • 9 • 1

upvoted a collection 6 days ago

Qwen3.5

17 items • Updated about 8 hours ago • 656

liked a model 6 days ago

Qwen/Qwen3.5-35B-A3B-Base

Image-Text-to-Text • 36B • Updated about 8 hours ago • 5.23k • 104

liked a dataset 6 days ago

OpenMOSE/reap-calib-mix

Viewer • Updated 5 days ago • 91.9k • 57 • 6

liked a model 6 days ago

OpenMOSE/Qwen3.5-REAP-212B-A17B

212B • Updated 4 days ago • 255 • 13

upvoted a collection 10 days ago

gliner2 family

GLiNER2 extends the original GLiNER architecture to support multi-task information extraction with a schema-driven interface. This base model provid • 4 items • Updated 20 days ago • 31

liked a model 11 days ago

mit-oasys/rlm-qwen3-8b-v0.1

8B • Updated 11 days ago • 1.94k • 44

upvoted a collection 14 days ago

QED Nano

Artifacts for the QED Nano release • 9 items • Updated about 5 hours ago • 6

liked a model 15 days ago

LiquidAI/LFM2.5-1.2B-Instruct

Text Generation • 1B • Updated 6 days ago • 110k • 510

liked a model 17 days ago

MiniMaxAI/MiniMax-M2.5

Text Generation • 229B • Updated 14 days ago • 324k • • 1.06k