--- title: README emoji: 🐢 colorFrom: blue colorTo: green sdk: static pinned: false --- # Mukayese: Turkish NLP Strikes Back Turkish Natural Language Processing is left behind in developing state-of-the-art systems due to a lack of organized benchmarks and baselines. We fill this gap with __Mukayese__ (Turkish word for "comparison/benchmarking"), an extensive set of datasets and benchmarks for several Turkish NLP tasks. All of the datasets and code have been made public in this repository. --- ## Updates - (22/03/2022) Summarization models are online on Huggingface! - (25/02/2022) Datasets have been made available through pre-release [v0.0.1](https://github.com/alisafaya/mukayese/releases/tag/v0.0.1) --- ## What to do with Mukayese ? With Mukayese, researchers of Turkish NLP will be able to: - Compare the performance of existing methods in leaderboards. - Access existing implementations of NLP baselines. - Evaluate their own methods on the relevant test datasets. - Submit their own work to be enlisted in our leaderboards. ## Mukayese's Mission The most important goal of Mukayese is to standardize the comparison and evaluation of Turkish NLP methods. As a result of the lack of a platform for benchmarking, Turkish NLP researchers struggle with comparing their models to the existing ones.