--- title: README emoji: 🐠 colorFrom: yellow colorTo: yellow sdk: static pinned: false thumbnail: >- https://cdn-uploads.huggingface.co/production/uploads/63044350fc783bfc74462d5c/C1LPGkFkycvoNJS52HQjn.jpeg --- A community organization for Wikimedians interested in creating, contributing to, using, and writing about datasets and models. (Thumbnail from [Johnson, Kaffee, and Redi '24](https://arxiv.org/pdf/2410.08918))) # Datasets of interest * [wikimedia/wikipedia](https://huggingface.co/datasets/wikimedia/wikipedia) * [wikimedia/wikisource](https://huggingface.co/datasets/wikimedia/wikisource) * [Wikimedia Commons URLs](https://github.com/ryanrudes/wikimedia) (40M, from 2022, via Ryan Rudes. new data needed) * the [Nomic Atlas](https://huggingface.co/datasets/wikimedia/wikipedia/discussions/48) map of words in WP (2023) * [Wikipedia-based Image Text](https://huggingface.co/datasets/wikimedia/wit_base) ## Collections to review [Sourced from Wikimedia](https://huggingface.co/collections/davanstrien/sourced-from-wikimedia-64f9f2ac4639c9edf83effa2), [OpenLID](https://huggingface.co/laurievb/OpenLID) # See also * The HF organization for the [Wikimedia Foundation](https://huggingface.co/wikimedia) * [WikiProject AI](https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Artificial_Intelligence) on English Wikipedia * [Waikiki](https://meta.wikimedia.org/wiki/Waikiki) project on the Meta-wiki