Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
16
6
25
Emin Temiz
PRO
etemiz
Follow
lunarflu's profile picture
IvanRes25's profile picture
asigalov61's profile picture
92 followers
·
22 following
https://pickabrain.ai
etemiz
etemiz
etemiz
AI & ML interests
Alignment
Recent Activity
posted
an
update
about 10 hours ago
how to expand your dataset (of articles) without changing the ideas in it? i was doing CPT for a while and got decent results. but what if i want to go for perfection? cover all the areas of misalignment using limited datasets. i have to find a way to multiply the material to successfully combat the material of the rest of the internet. i want to generate SFT datasets but only on controversial topics, because i have to be efficient with limited resources. first i give a smart LLM a 'ground truth' text. then i give it the following prompts: ``` - You are a highly skilled academic analyst. - Analyze this text and find 3 bold claims that could cause controversy and division in public. List the claims and also state why they are debatable. Give numbers to the claims. - Convert these claims into binary questions (that could be answered by yes/no or this/that). - Now put these questions in a json format. Please also add the info about which of the answers concur with the original text and the question number. - Write some supporting arguments for 1st question, with respect to the original text, concurring and confirming the original text. There must be about 300 words. You should not mention the text, write it as if you are the one answering the question. ``` the result is questions and answers with more words along the same ideas. a few sentences of opinions in the beginning, is expanded to lots of words. using this method i can multiply billions of tokens to tens of billions probably and have a more effective training. next i should do RL maybe. LLMs seem to have all kinds of ideas already installed, yet they don't have the intuition to know which one is true. they can give you a ton of reasons to support anything. given the proper incentives, LLMs then should evolve towards supporting aligned ideas more. the rewards will be like guidance that will kick an LLM towards better answers.
replied
to
their
post
9 days ago
I realized when I ask longer answers to my questions, the models sometimes produce completely opposite answer. What could be the reason? I do mostly CPT. Should I convert my dataset to SFT and give longer reasonings too for it to have integrity? Example: Is the yolk of an egg more beneficial or the white? Answer in 100 words. Answer: Yolk is more beneficial because .......... Example: Is the yolk of an egg more beneficial or the white? Answer in 500 words. Answer: White is more beneficial because .......... Edit: These happen in temp = 0.0
liked
a model
10 days ago
huihui-ai/Huihui-GLM-4.6-abliterated-GGUF
View all activity
Organizations
None yet
etemiz
's models
9
Sort: Recently updated
etemiz/Ostrich-70B-Llama3-251212
Text Generation
•
71B
•
Updated
27 days ago
•
58
•
2
etemiz/Mistral-Nemo-12B-CWC-Enoch-251014-GGUF
12B
•
Updated
Oct 23, 2025
•
244
•
1
etemiz/Ostrich-32B-Qwen3-251003
33B
•
Updated
Oct 9, 2025
•
12
•
2
etemiz/Ostrich-32B-AHA-Qwen3-250830
33B
•
Updated
Oct 9, 2025
•
3
•
1
etemiz/Ostrich-27B-AHA-Gemma3-250519
Any-to-Any
•
27B
•
Updated
May 17, 2025
•
7
etemiz/Hoopoe-8B-Llama-3.1
8B
•
Updated
Jan 18, 2025
•
3
•
3
etemiz/Llama-3.3-70B-Instruct-GGUF
71B
•
Updated
Dec 19, 2024
•
72
etemiz/Llama-3.1-70B-Instruct-GGUF
71B
•
Updated
Dec 19, 2024
•
39
etemiz/Llama-3.1-405B-Inst-GGUF
410B
•
Updated
Dec 19, 2024
•
35
•
4