The collection of datasets used in paper "Difficulty–Diversity Collaborative Filtering for Data-Efficient LLM Fine-Tuning"