--- title: README emoji: 👀 colorFrom: gray colorTo: blue sdk: static pinned: false --- CoIR (Code Information Retrieval Benchmark) is the **first** comprehensive benchmark designed to evaluate models' abilities in code retrieval. It includes four distinct code retrieval tasks: text-to-code retrieval, code-to-text retrieval, code-to-code retrieval, and hybrid-code retrieval. The benchmark encompasses a total of 10 datasets, covering 14 programming languages and containing 2 million evaluation documents. For comprehensive details, please refer to our [GitHub repository](https://github.com/CoIR-team/coir) and the associated [research paper](https://arxiv.org/abs/2407.02883)