The Signpost

In this issue
+ Add a comment

Discuss this story

Inconsistent information

In the second sentence, I read "a decrease of 281,624 articles in the English Wikisource (a 27% drop)", then later in the page, I read "in the English Wikisource, which increased by 281,199 (a 27% drop)". Signpost 2015-04-06 says "a decrease of 281,624 articles in the English Wikisource (a 27% drop)". What's the right figures ? Cantons-de-l'Est (talk) 23:46, 21 May 2015 (UTC)[reply]

Sharp eyes! The first and last statements you have quoted are based on stats collected by a Perl script that I (try to) run manually each day. The "decrease of 281,624 articles" occurred between 02:56:35 UTC on 2015-03-29 and 01:47:31 UTC on 2015-03-30. The second statement is based on stats collected by EmausBot and posted to the various "/Tables" pages at Meta; see m:Article counts revisited/2015-03-29 changes to all recounted wikis for more details. In the case of the English Wikisource, the second quote should say "which decreased by 281,199" (I accidentally switched increase and decrease when I typed it up); this change occurred between 12:00 UTC on 2015-03-27 and 12:00 UTC on 2015-03-31. The differences in numbers are purely due to the different times at which the data were collected. This wouldn't have been an issue if I had saved a copy of the data my Perl script collected on 2015-03-29 and 2015-03-30, but I didn't think to do that before I ran the script too many times to still have access to the older information. (Oops.) - dcljr (talk) 22:53, 23 May 2015 (UTC)[reply]

Database hicups

You missed db hickups as a possible reason for article count mismatch. Updating the article count is not done in the same transaction as making the edit, so every now and then, something weird will probably happen and an article will not get counted. Over time, this probably makes a difference. Bawolff (talk) 03:40, 22 May 2015 (UTC)[reply]

Stateless, historically and globally consistent measurements

People interested in the issue of how to measure Wikimedia projects growth consistently may want to take a look at our work on metrics standardization and some of the principles we proposed to identify robust metrics. As to the issue of quality vs quantity, check out EpochFail's work on m:Research:Measuring_value-added and m:Research:WikiCredit --DarTar (talk) 19:46, 22 May 2015 (UTC)[reply]

Disambiguation pages

What it is not taken into account to exclude from the article count, like the redirects, are the thousands of disambiguation pages that by definition they are also not articles. Since they are already treated differently by MediaWiki, it would be easy to do. --geraki TL 07:47, 23 May 2015 (UTC)[reply]

Hmm. That definition seems to be what the English Wikipedia considers an "article" specifically for the purposes of its own guidelines and policies. You say it's "easy" to do, but I'm not sure how quickly pages can be checked in this way; it might be too slow to implement on the English WP, for example. (But IANAD, so I don't know.) - dcljr (talk) 00:07, 24 May 2015 (UTC)[reply]

Other projects

Any reasons why Commons and Wikispecies aren't recounted? I would imagine that Commons is tricky because it's more about files and media than pages. But Wikispecies seems straightforward. I do seem a potential recounting issue in Wikispecies. A lot of taxon authority pages only contain their nationality, area of classification (e.g. botany) and their birth date and death date (or year). These authority pages will have mainspace articles pointing into them but if they don't have a comma, category, image, or interwiki link, these authority pages will be excluded from the count. OhanaUnitedTalk page 00:32, 26 May 2015 (UTC)[reply]


The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0