Pasha

pashak

khosravipasha

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

Teacher Logits

liked a model about 1 month ago

arcee-ai/Trinity-Mini

upvoted an article 2 months ago

Continuous batching from first principles

View all activity

Organizations

upvoted a collection about 1 month ago

Teacher Logits

Collection

Logits captured from large models to act as the teacher for distillation • 3 items • Updated Dec 15, 2025 • 7

liked a model about 1 month ago

arcee-ai/Trinity-Mini

Text Generation • 26B • Updated Dec 11, 2025 • 6.78k • 170

upvoted an article 2 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

311

liked 6 models 2 months ago

liked 2 models 3 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 225k • • 2.36k

nvidia/Llama-3.1-8B-Instruct-NVFP4

5B • Updated Sep 15, 2025 • 27k • 6

upvoted an article 3 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

•

273

liked 2 models 4 months ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated Nov 18, 2025 • 63.3k • • 943

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26, 2025 • 4.23M • • 875

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.66k

The ultimate guide to training LLM on large GPU Clusters

liked a model 4 months ago

marin-community/marin-8b-instruct

Text Generation • 8B • Updated May 19, 2025 • 646 • • 27

liked a dataset 4 months ago

HuggingFaceFW/finepdfs

Viewer • Updated 17 days ago • 476M • 26.3k • 803

updated a model 5 months ago

pashak/Llama-3.1-8B-Instruct-Q2_K-GGUF

Text Generation • 8B • Updated Sep 4, 2025

published a model 5 months ago

pashak/Llama-3.1-8B-Instruct-Q2_K-GGUF

Text Generation • 8B • Updated Sep 4, 2025

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

•

241

Pasha

AI & ML interests

Recent Activity

Organizations

pashak's activity

Continuous batching from first principles

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

The Ultra-Scale Playbook

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context