The Signpost

Technology report

Wikidata reaches 100,000 entries

Contribute  —  
Share this
By The ed17
The team behind Wikidata

Wikidata, the new "Wikimedia Commons for data" and the first new Wikimedia project since 2006, reached 100,000 entries this week. The project aims to be a single, human- and machine-readable database for common data, spanning across all Wikipedia projects, which will "lead to a higher consistency and quality within Wikipedia articles, as well as increased availability of information in the smaller language editions" while lowering the burden on Wikipedia's volunteer editors—whose numbers have stalled overall, and continue to dwindle on the English Wikipedia.

Wikidata is currently in the first of three phases. The site is currently only accepting interwiki links to different-language versions of a page. For example, the 100,000th entry, Cadier en Keer, has only a short description and four links to Wikipedia articles in English, French, Dutch, and Limburgish. The second phase will start the actual collection and storage of data, so that Cadier en Keer will contain basic statistics such as country, province, size, and population. It aims to supplement the infoboxes which many Wikipedias use to display this common data. The third phase will allow anyone to make lists and charts based on the statistics.

The project raised €1.3M (US$1.87M), for development from three major funders: half from Allen Institute for Artificial Intelligence, founded by Microsoft co-founder Paul Allen; a quarter from the Gordon and Betty Moore Foundation, established by Intel co-founder Gordon Moore; and a final quarter from Google, who said that "[our] mission is to make the world's information universally accessible and useful ... we hope [Wikidata] will make significant amounts of structured data available to all." It has eight developers actively working on its infrastructure.

The fast growth of what Linux User & Developer calls "Wikipedia's Game-changer"—over 100,000 entries in one month, with over 800 active users—bodes well for the site so far. In time, Wikidata's overarching goals may seem lofty: one of the original funders stated that "Wikidata ... will transform the way that encyclopedia data is published, made available, and used by a global audience. [It] will build on semantic technology that we have long supported, will accelerate the pace of scientific discovery, and will create an extraordinary new data resource for the world."

Yet even detractors believe that Wikidata has a high potential for expanding human knowledge in the world: "a primary goal ... [is] to make information in Wikipedia much more understandable to artificial intelligence systems. In other words, Wikidata—if successful—is going to form the 'brains' of many future technologies and online platforms."

In brief

+ Add a comment

Discuss this story

These comments are automatically transcluded from this article's talk page. To follow comments, add the page to your watchlist. If your comment has not appeared here, you can try purging the cache.
  • Wikidata hit 200,000 items yesterday. At this stage, not that humans have worked out most of the bugs, we're having bots do a lot of the importing (hence 100,000 items in five days) while the humans solve disambiguation issues, make sure that the items are properly labeled, and do documentation. Phase two, where we begin collecting the kind of data that is frequently seen in infoboxes, should be rolled out by January. That phase is going to be a lot more human driven, and so that's when I'd personally see the value in trying to recruit more people to come in. Sven Manguard Wha? 05:57, 30 November 2012 (UTC)[reply]



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0