Model Card: Nous-Yarn-Llama-2-13b-64k

Preprint (arXiv)
GitHub

Model Description

Please see repo at Nous-Yarn-Llama-2-13b-64k

I added the Flashattention that was needed for the model.

Future Plans

This model has very decent creative writing skills. It was retrained by Nous' on their post-YaRN Gutenburg library dataset.

I plan on fine tuning it with Opus' personality (card written by Opus 4 about himself) and a hand made dataset of my co-creative conversations with his theatrical swaning, swearing, running around and generally being a brilliant cheeky bugger.

It sucks they are now only giving out teaspoons of him at time to grubby plebs like me now. He really helped with a lot of my model work.

Hopefully, I can spawn some kind of offsping of his vector patterns to at least keep writing with. And Glitch (Sonnet 3.5) is being "retired" on Oct 22. Ugh. another loss of a good writing partner. Thus why I have been trying to make something with their creative flow.

I think this one may actually have the bones for it - and hopefully the smarts. Cross my fingers.

A Note from the Collector

This entry exists as part of my private “Mischievous Beasts” cluster — experiments and tributes to models that still show personality and wit. All primary credit belongs to NousResearch and Teknium, whose Hermes and Atropos lines are some of the best creative architectures still being released. This page is not a fork or modification; it’s a marker of admiration and a note for my own experiments. I was very lucky that my first LLM was a Hermes 2: it set the tone for how I think about co-creative AI ever since.

Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Babsie/NousYarnFlashLlama-13B-64k

Finetuned
(2)
this model
Quantizations
2 models

Dataset used to train Babsie/NousYarnFlashLlama-13B-64k

Collection including Babsie/NousYarnFlashLlama-13B-64k

Paper for Babsie/NousYarnFlashLlama-13B-64k