|
--- |
|
title: README |
|
emoji: 🐢 |
|
colorFrom: blue |
|
colorTo: green |
|
sdk: static |
|
pinned: false |
|
--- |
|
|
|
# Mukayese: Turkish NLP Strikes Back |
|
|
|
Turkish Natural Language Processing is left behind in developing state-of-the-art systems due to a lack of organized benchmarks and baselines. We fill this gap with __Mukayese__ (Turkish word for "comparison/benchmarking"), an extensive set of datasets and benchmarks for several Turkish NLP tasks. All of the datasets and code have been made public in this repository. |
|
|
|
--- |
|
## Updates |
|
|
|
- (22/03/2022) Summarization models are online on Huggingface! |
|
- (25/02/2022) Datasets have been made available through pre-release [v0.0.1](https://github.com/alisafaya/mukayese/releases/tag/v0.0.1) |
|
|
|
--- |
|
## What to do with Mukayese ? |
|
|
|
With Mukayese, researchers of Turkish NLP will be able to: |
|
|
|
- Compare the performance of existing methods in leaderboards. |
|
- Access existing implementations of NLP baselines. |
|
- Evaluate their own methods on the relevant test datasets. |
|
- Submit their own work to be enlisted in our leaderboards. |
|
|
|
## Mukayese's Mission |
|
|
|
The most important goal of Mukayese is to standardize the comparison and evaluation of Turkish NLP methods. As a result of the lack of a platform for benchmarking, Turkish NLP researchers struggle with comparing their models to the existing ones. |
|
|
|
|