Spectrum: Targeted Training on Signal to Noise Ratio Paper • 2406.06623 • Published Jun 7, 2024 • 15
Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation Paper • 2406.14971 • Published Jun 21, 2024
Training-Free Tokenizer Transplantation via Orthogonal Matching Pursuit Paper • 2506.06607 • Published Jun 7, 2025 • 2