Single-Page View Archives |
---|
| ||
Volume 3, Issue 41 | 8 October 2007 | About the Signpost |
| ||
(← Prev) | 2007 archives | (Next →) |
|
| |
Home | Archives | Newsroom | Tip Line | Shortcut : WP:POST/A |
|
This week, we introduce a new feature, the WikiProject Report. This week's report focuses on WikiProject Biography; suggestions for future projects to profile can be made on the tip line. This report is a work-in-progress, so I invite readers to suggest any changes in the format that you think might make it better.
This report has been added thanks to a suggestion from the survey concluded two weeks ago. We're working on other possibilities based on the responses; currently, I'm in the process of contacting a few key individuals within Wikimedia for another interview. I'll give more information once it becomes available. However, please continue to suggest new features if you think you've got a great idea.
Also, a request to add "previous" and "next" links to all features was added this week; you'll notice a new template at the bottom of each page, replacing the footer. I haven't yet gone through the archives to add the template to archived pages, so the links may not be that helpful for a few days. The design itself isn't set in stone, so if you've got a better idea, please let us know.
Thanks for reading the Signpost.
— Ral315
An academic study combining editing data with page view logs has added some new understanding about the quality and authorship of Wikipedia content. It concluded that frequent editors have the most impact on what Wikipedia readers see, while the effect of vandalism is small but still a matter of growing concern.
The results of the study are reported in a paper titled "Creating, Destroying, and Restoring Value in Wikipedia" (available in PDF), to be published in the GROUP 2007 conference proceedings. It was put together by a research group in the University of Minnesota department of computer science and engineering. Based on sampled data provided by the Wikimedia Foundation, showing every tenth HTTP request over a one-month period, they created a tool for estimating the page views for a Wikipedia article during a given timeframe.
In the absence of this type of data, previous studies have largely relied on an article's edit history for analysis. Interestingly, the study concluded that there is "essentially no correlation between views and edits in the request logs."
The study estimated a probability of less than one-half percent (0.0037) that the typical viewing of a Wikipedia article would find it in a damaged state. However, the chances of encountering vandalism on a typical page view seem to be increasing over time, although the authors identified a break in the trend around June 2006, late in the study period. They attributed this to the increased use of vandalism-repair bots.
Addressing the debate over "who writes Wikipedia", whether most of the work is done by a core group or occasional passersby, the study introduced a new metric which it called the "persistent word view" (PWV). This gives credit to the contributor who added a sequence of words to an article, combined with how many times article revisions with that contributor's words were viewed. The study came down largely in favor of the core group theory, concluding, "The top 10% of editors by number of edits contributed 86% of the PWVs". However, it may not necessarily refute Aaron Swartz's contention that the bulk of contributions often comes from users who have not registered an account; the Minnesota researchers excluded such edits from parts of their analysis, citing the fact that IP addresses are not stable.
The study built on previous designs for analyzing the quality of Wikipedia articles, notably the "history flow" method developed by a team from the MIT Media Lab and IBM Research Center and the color-coded "trust" system created by two professors from the University of California, Santa Cruz. In their own way, both earlier approaches focused on the survival of text in an article over the course of its edit history. Refining these with its page view data, the Minnesota study argued that "our metric matches the notion of the value of content in Wikipedia better than previous metrics."
Looking at the issue of vandalism, the study focused primarily on edits that would subsequently be reverted. Although the authors conceded this might include content disputes as well as vandalism, their qualitative analysis suggested that reverts served as a reasonable indicator of the presence of damaged content.
Statistically, they estimated that about half of all damage incidents were repaired on either the first or second page view. This fits in with the notion that obvious vandalism gets addressed as soon as someone sees it; even in the high-profile Seigenthaler incident it's unlikely that many readers saw the infamous version of the article at the time, as a previous Signpost analysis indicated. However, the study also found that for 11% of incidents, the damage persisted beyond an estimated 100 page views. A few went past 100,000 views, although the authors concluded after examining individual cases that the outliers were mostly false positives.
In a rare action, the English Wikipedia article about a country was moved to a different name last week — the article "Myanmar" became Burma. After an active debate, the requested move was carried out Tuesday, 2 October; as usual, the impact is more symbolic than real, since a redirect from one location would still always take you to the intended destination.
The immediate impetus for the move came from a surge in international attention being paid to events in Burma beginning last month. (The Signpost will follow the current convention and refer to the country as Burma in this article.) In response, Husond requested the move on 26 September, arguing that Burma is the most common English-language name, and that the Burmese opposition and many other countries do not recognize the name Myanmar. An extensive discussion followed, and after the requisite five days passed, Duja performed the move, concluding that "a significant majority of editors prefer Burma" while conceding that both sides had valid arguments.
Several themes provided the focal points for debate: whether it was appropriate to defer to an "official" name, either the Burmese government's choice of Myanmar or the English-speaking countries that use Burma; if the English Wikipedia should use a foreign-language name when an English one exists (an oft-debated similar situation exists for Côte d'Ivoire); and what the accepted standard is among professional news or academic sources. Even the question of what is the most common name, usually a critical inquiry for naming conventions, led people toward confusing and sometimes contradictory evidence. Many of the attempts to answer this question involved different forms of Google searches, with varying degrees of sophistication.
Burma's prominence in the news developed from mounting protests against the military government, whose legitimacy is not recognized by many Western countries. As the debate over the Wikipedia article was ongoing, coincidentally the media coverage turned to the cutoff of internet access in Burma, as the regime apparently sought to crack down on the circulation of news and images of the protests. Several reports in the press noted that Wikipedia also figured in the strategy of the protesters, with people constantly updating information on articles such as 2007 Burmese anti-government protests.
The country, like many others, is the focus of a WikiProject, where the name also was the focus of a dispute that produced a request for mediation in January. The process resulted in a change from using the name "WikiProject Burma/Myanmar" (also awkward because in the project namespace, it overlaps with subpage syntax) to "WikiProject Myanmar (Burma)". The accompanying category was not changed.
Scanning the interlanguage links shows that in other languages, the name of the corresponding article may use either form (with varying orthography). Myanmar appears to be more prevalent overall, but neither is consistently preferred. Curiously, the Norwegian Wikipedia has Burma in the Bokmål edition and Myanmar in its Nynorsk version (the two written standards for the Norwegian language).
This week's WikiWorld comic uses text from "W" and "Bert (Sesame Street)". The comic is released under the Creative Commons Attribution ShareAlike 2.5 license for use on Wikipedia and elsewhere.
Deleted! See discussion Commons:Commons:Deletion requests/File:W WikiWorld.png
A decision on the location of Wikimania 2008, the fourth-annual international Wikimedia conference, was announced on Tuesday. Alexandria's bid was declared the winning bid by Cary Bass on behalf of the Wikimania 2008 bid jury via an email to the foundation-l mailing list. Alexandria's bid was chosen from three cities making bids, the other two being Atlanta and Cape Town.
In his announcement, Bass pointed out that Alexandria "was found to be particularly strong in the areas of reflecting the Wikimedia Foundation's roots in geo-diversity and multi-lingualism, of the very exciting nature of the proposed venue and its local facilities, and of the particularly advanced nature of the financial planning." At the same time, he extolled the Atlanta bid for its community outreach and in-facility accommodation, as well as Cape Town's cultural diversity. Bass also pointed out that Cape Town's bid is the first very strong Wikimania bid originating from the Southern Hemisphere.
Meanwhile, no new information on Wikimania 2009 has been announced; as reported last week, official bidding will begin sometime this month, with a final decision expected sometime around December 15, 2007. So far, three unofficial bids have been worked on extensively: Buenos Aires, Singapore, and Perth. An expected bid from Toronto, following the withdrawal of their 2008 bid, has not yet been developed, and other cities have discussed hosting, with no major development of their potential bids. A list of all unofficial bids is also in development.
Magnus Manske created a tool called WatchFlickr, which runs on the toolserver. It runs through a category, finds articles without images, and searches for relevant free, Creative Commons-licensed images on Flickr. The tool can help with replacing non-free/fair use images, by allowing users to easily find free replacements on many related articles at once.
We reported on bug 9213 two months ago (see archived story). The bug caused problems with the 'you have new messages' bar for anonymous users; sometimes it wouldn't come up even though their talk page had been edited, and sometimes it wouldn't disappear even if they visited their talk page, bypassed their cache, or deleted their cookies; one of the implications of this was that when an anon received a warning message, they might not see it even if they had a static IP, and some anons complained about the constant new-messages bar that they had. The onwiki discussion page about the bug is Wikipedia:Administrator intervention against vandalism/Bug ID 9213.
This week, User:Tim Starling wrote a code correction (r26357) designed to fix this problem (among others); this was the first real indication that the cause of the problem had been found ("Bug 9213: Fixed the plainly broken user_newtalk updating and caching scheme."). The problem appears to have been with the way that data about users' new-message status was cached by the servers.
It was not clear immediately whether this correction had worked; the correction carried warnings (WARNING! NEEDS CAREFUL DEPLOYMENT and I tried to keep my changes roughly performance-neutral, but the update on Wikimedia should be watched carefully for performance problems.), the relevant bug tracker comment was more mildly worded than normal (Hopefully fixed in r26357.), and the bug was not closed at the same time (as is usual when a fix for a problem is 'committed' so that it will be applied on Wikimedia and other MediaWiki wikis). However, the change was applied at 18:00 UTC on 3 October (brion: scapped update including r26357 which tim warns may introduce performance problems. keep an eye out). In ais523's tests, the anon new-message bar now appears to be working, and the bug tracker entry is now marked "RESOLVED FIXED", indicating that the developers do not believe the bug poses a problem anymore. If the issues caused by this bug continue, they should be reported on bugzilla.
This week, the Finnish Arbitration Committee was elected; five users were elected to a two-year term, while five more were elected to one-year terms. Interestingly, in an election where the winners were determined by the number of support votes, all five of the users elected to one year terms were tied for 6th place, with 50 votes; an 11th user finished just short, with 49 votes. Full results of the election are available.
The Wikipedia problem - Apparently, co-founder Jimmy Wales gets about 10 emails a week from "students who end up in trouble because they cited the online encyclopedia in a paper and the information turned out to be wrong", but he has little sympathy for them. Wikipedia is increasingly becoming cited in undergraduate papers, and even professional librarians use it. However, the key is to treat Wikipedia as a starting point, and evaluate it for bias like any other work.
Wikipedia places a warning that using encyclopedias as a source may result in a failing grade at the top of its "Cite this article" page. (See Special:Cite/Abraham Lincoln, for example.)
Veni, Vidi, Wiki: Latin Isn't Dead On 'Vicipaedia' - The Wall Street Journal has an interesting article that covers the Latin Wikipedia, and the difficulty that it faces when writing about modern concepts that do not have native Latin expressions. For example, apart from the expected Roman history, the Vicipaedia contains an article on Britneia Spears (Britney Spears): "There isn't anything that doesn't belong in Vicipaedia", says one editor. As a result of this, battles are often raged to determine the correct formation of the Latin terms. It is said that the site is not used as a reference source as such, but as good practice for Latin practitioners and students.
Other mentions in the online media recently include:
WikiProject Biography is a WikiProject devoted to creating and improving Wikipedia's articles about people. The project was started by Ram-Man (talk · contribs) on 24 October 2002. It is now one of Wikipedia's most active projects, and currently has 400 members. Previously, it had a proposal to have project coordinators for all of its departments, but the page was marked as historical by Radiant! (talk · contribs) on 3 April 2007 because there had been no discussion about the proposal for one month.
The project's Summer 2007 Assessment drive began on 1 June 2007 and ended on 1 September 2007 with the hopes of lessening the project's backlog of around 113,000 unassessed, or unrated articles. As prizes, the top assessors, based on the number of articles they rated using the project's quality scale, received a barnstar for their work. By the end of the drive, over 100,000 articles were assessed, reducing the backlog by over half. 10,440 of the articles were assessed by the first prize winner, Ludahai. For comparison, the Spring 2007 drive, which came to a close on 24 March 2007, rated 40,000 articles; Sapphic assessed the most articles in that drive.
On 13 October, an IRC meeting will be held to help WikiProject members assess how to "revive" areas of the WikiProject that have gone stagnant.
The Biography project currently has 7 departments in order to split up and organize its workload.
Task forces and projects related to WikiProject Biography include:
While the project has a very large amount of members, it could always use more help. If you want to join it, you can visit its members page and sign up.
Five users were granted admin status via the Requests for Adminship process this week: hmwith (nom), JHunterJ (nom), Mtmelendez (nom), Nat (nom), and Samulili (nom).
Nine bots or bot tasks were approved to begin operating this week: SmackBot (task request), Detroiterbot (task request), ChandlerMapBot (task request), COBot (task request), SnakeBot (task request), CohesionBot (task request), CapitalBot (task request), SundarBot (task request), and OrBot (task request).
Eleven articles were promoted to featured status last week: Anabolic steroid (nom), Quneitra (nom), William Cooley (nom), Slavery in ancient Greece (nom), Authentic Science Fiction (nom), A Vindication of the Rights of Men (nom), The Apprentice (UK) (nom), Tau Ceti (nom), Brabham BT19 (nom), Golden Sun (nom), and Prince Louis of Battenberg (nom).
Seven lists were promoted to featured status last week: List of Powderfinger awards (nom), Prince of Wales Trophy (nom), William M. Jennings Trophy (nom), Lost (season 3) (nom), Timeline of Australian television (nom), List of colleges and universities in Vermont (nom), and List of Fate/stay night episodes (nom).
One portal was promoted to featured status last week: Portal:Africa (nom).
No topics or sounds were featured last week.
Four articles were de-featured last week: Henry Fonda (nom), Congo Free State (nom), Countdown (game show) (nom), and Doom (game) (nom).
No pictures, lists, portals, topics, or sounds were de-featured last week.
The following featured articles were displayed last week on the Main Page as Today's featured article: Eastern Suburbs & Illawarra railway line, 1981 Irish hunger strike, Kingdom Hearts, Orion, New York City, Pluto, and Guinea pig.
The following featured pictures were displayed last week on the Main Page as picture of the day: Alluvial Fan, Crater on Mars, Locust, Bougainville campaign, Maslenitsa, Great White Shark, and Einsatzgruppen.
Twelve pictures were promoted to featured status last week and are shown below.
This is a summary of recent technology and site configuration changes that affect the English Wikipedia. Note that not all changes described here are necessarily live as of press time; the English Wikipedia is currently running version 1.44.0-wmf.8 (f08e6b3), and changes to the software with a version number higher than that will not yet be active. Configuration changes and changes to interface messages, however, become active immediately.
The Arbitration Committee accepted one new case this week, and did not close any cases.