Welcome to the Wikidata Analytics portal!

WikidataCon 2019: Wikidata External Identifiers Similarity Graph

This is the starting point to navigate a set of analytic systems that monitor the growth, usage, and structure of Wikidata.

The Wikidata Concepts Monitor (WDCM) system is a set of applications that monitor the re-use of Wikidata across the Wikimedia projects. All Wikimedia projects that maintain a client-side usage tracking with Wikibase provide data on how our items are used in the respective wikies. The WDCM system encompasses several dashboards that monitor and analyze such data by putting them in various perspectives, e.g. the type of re-use, the similarity structure across wikies and Wikidata items in respect to their co-usage patterns, biases in Wikidata re-use, geographical distribution, and more.

The Analytics menu will point to a number of dashboards that monitor various important Wikidata statistics, e.g. the number of pageviews it receives across different namespaces, the ratio of human to bot edits across the Wikidata classes, the Wikidata total reuse and coverage in different Wikimedia projects, specific dashboards like the one that tracks the results of the Wikidata Reference Treasure Hunt Game, and similar.

The Structural Systems section points towards the most complex analytical systems that we currently maintain, the Wikidata Languages Landscape and the Wikidata External Identifiers Landscape. Both systems combine information on the re-use of Wikidata in Wikimedia projects with the structural, ontological patterns in some particular Wikidata classes (e.g. languages, external identifiers) and external datasets to provide in-depth understanding of the development of Wikidata in the respective domains.

Some of the most interesting and important datasets that we produce and use in Wikidata Analytics are reviewed and provided for download in the Datasets section. The Reports section points to a selection of Wikidata analytic reports, e.g. the Wikidata ORES Quality Report. More about our work can be found in the Resources and About sections.

Goran S. Milovanović, Data Scientist for Wikidata, Wikimedia Deutschland: goran.milovanovic_ext@wikimedia.de
This is free software: all content and code is GPL v2.0 licensed.