AI & ML interests

Text Detoxification, Text Style Transfer, Toxic Speech Detection and Mitigation, Multilingualism

Recent Activity

dardemĀ  updated a collection about 2 hours ago
LLM-as-a-Judge for TextDetox Evaluation
dardemĀ  updated a collection about 2 hours ago
LLM-as-a-Judge for TextDetox Evaluation
dardemĀ  updated a collection about 2 hours ago
LLM-as-a-Judge for TextDetox Evaluation
View all activity

Organization Card

Multilingual Text Detoxification with Parallel Data

Text Detoxification, toxicity detection and explanation for diverse languages: English, Spanish, German, French, Italian, Chinese, Japanese, Arabic, Hebrew, Hindi, Ukrainian, Russian, Tatar, Amharic. By many researchers from all over the world šŸŒ

Support for better, safe, and multicultural online spaces.

🌐 Check out our website with all project details šŸ“° Read about the project in press šŸ“¹ PyData&CPyConf Berlin 2023 talk

[2026] Feb We fully release TextDetox test set! Good luck with your research projects!

[2025] CLEF2025 Daryna Dementieva, Vitaly Protasov, Nikolay Babakov, Naquee Rizwan, Ilseyar Alimova, Caroline Brun, Vasily Konovalov, Arianna Muti, Chaya Liebeskind, Marina Litvak, Debora Nozza, Shehryaar Shah Khan, Sotaro Takeshita, Natalia Vanetik, Abinew Ali Ayele, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Ashraf Elnagar, Animesh Mukherjee, and Alexander Panchenko. 2025. Overview of the Multilingual Text Detoxification Task at PAN 2025 Working Notes of CLEF (2025). pdf

[2025] !!!OPEN!!! TextDetox CLEF2025 shared task website šŸ¤—Starter Kit

[2025] COLNG2025: Daryna Dementieva, Nikolay Babakov, Amit Ronen, Abinew Ali Ayele, Naquee Rizwan, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Daniil Alekhseevich Moskovskiy, Elisei Stakovskii, Eran Kaufman, Ashraf Elnagar, Animesh Mukherjee, and Alexander Panchenko. 2025. Multilingual and Explainable Text Detoxification with Parallel Corpora. In Proceedings of the 31st International Conference on Computational Linguistics, pages 7998–8025, Abu Dhabi, UAE. Association for Computational Linguistics. pdf

[2024] TextDetox2024 Report: Daryna Dementieva, Daniil Moskovskiy, Nikolay Babakov, Abinew Ali Ayele, Naquee Rizwan, Florian Schneider, Xintong Wang, Seid Muhie Yimam, Dmitry Ustalov, Elisei Stakovskii, Alisa Smirnova, Ashraf Elnagar, Animesh Mukherjee, and Alexander Panchenko "Overview of the multilingual text detoxification task at pan 2024" Working Notes of CLEF (2024). pdf

[2024] MultiParaDetox @ NAACL2024: Daryna Dementieva, Nikolay Babakov, and Alexander Panchenko. "MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages." Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 2024. pdf

[2024] TextDetox CLEF2024 shared task website

[2022] The first Parall Text Detoxification datasets: English ParaDetox and Russian ParaDetox

Contact

We are happy to extend our research to more languages, cultures, and dimensions šŸ˜‰

Please, contact: Daryna Dementieva