Zixi "Oz" Li's picture

Building on HF

13 16

Zixi "Oz" Li

OzTianlu

·

https://github.com/lizixi-0x2F

lizixi-0x2F

AI & ML interests

My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.

Recent Activity

reacted to their post with 🚀 about 4 hours ago

🚀 Geilim-1B-Instruct — Implicit Deep Reasoning, Zero Verbosity https://huggingface.co/NoesisLab/Geilim-1B-Instruct https://huggingface.co/collections/NoesisLab/geilim-large-language-models No <think> tags. No long CoT. Reasoning happens inside the hidden states, not in the output. What’s different 🧠 Implicit reasoning: deep causal reasoning without exposing chains 🕸️ ASPP (Adjacency-Structured Parallel Propagation): parent-only causal graph, O(n) message passing 🌊 π-flow: internal probability-space refinement instead of token-level deliberation ⚖️ Hybrid gating: learns when to use structure vs attention Why it matters Lower latency & token cost Cleaner, production-ready outputs CoT-level reasoning depth without verbosity tax Built on Llama-3.2-1B-Instruct, trained for math, logic, and commonsense. Designed for small-model reasoning at the edge. #ImplicitReasoning #SmallLLM #EfficientAI #ReasoningModels #ASPP #PiFlow

posted an update about 4 hours ago

🚀 Geilim-1B-Instruct — Implicit Deep Reasoning, Zero Verbosity https://huggingface.co/NoesisLab/Geilim-1B-Instruct https://huggingface.co/collections/NoesisLab/geilim-large-language-models No <think> tags. No long CoT. Reasoning happens inside the hidden states, not in the output. What’s different 🧠 Implicit reasoning: deep causal reasoning without exposing chains 🕸️ ASPP (Adjacency-Structured Parallel Propagation): parent-only causal graph, O(n) message passing 🌊 π-flow: internal probability-space refinement instead of token-level deliberation ⚖️ Hybrid gating: learns when to use structure vs attention Why it matters Lower latency & token cost Cleaner, production-ready outputs CoT-level reasoning depth without verbosity tax Built on Llama-3.2-1B-Instruct, trained for math, logic, and commonsense. Designed for small-model reasoning at the edge. #ImplicitReasoning #SmallLLM #EfficientAI #ReasoningModels #ASPP #PiFlow

liked a model about 9 hours ago

NoesisLab/Geilim-1B-Instruct

View all activity

Organizations

OzTianlu 's models

None public yet