The Signpost
Single-page Edition
WP:POST/1
13 September 2010

News and notes
Page-edit stats, French National Library partnership, Mass page blanking, Jimbo on Pending changes
In the news
Public Policy Initiative, Houellebecq plagiarism?, Article revisions as book
Public Policy Initiative
Experiments with article assessment
Sister projects
Biography bloopers – update on the Death Anomalies collaboration
WikiProject report
Getting the picture – an interview with the Graphic lab
Features and admins
"Magnificent" warthog not so cute, says featured picture judge
Arbitration report
Tricky and Lengthy Dispute Resolution
Technology report
Bugs, Repairs, and Internal Operational News
 

Wikipedia:Wikipedia Signpost/2010-09-13/From the editors Wikipedia:Wikipedia Signpost/2010-09-13/Traffic report Wikipedia:Wikipedia Signpost/2010-09-13/In the media


2010-09-13

Bugs, Repairs, and Internal Operational News

Contribute  —  
Share this
By Jarry1250

Google Summer of Code: Sanyam Goyal

We continue a series of articles about this year's Google Summer of Code (GSoC) with Indian student Sanyam Goyal, who describes his attempt to overhaul the JavaScript component of the "Semantic" series of extensions, along with the core product itself, the "Semantic MediaWiki" extension. The extensions aim to realise the goals of the Semantic Web, where meaning is overlaid on top of content, allowing machines to more easily interpret it, but have not yet been approved for use on Wikimedia projects. Whilst somewhat controversial, the idea of a Semantic Web is considered by some including Tim Berners-Lee to be the future of the Web.


In brief

Not all fixes may have gone live to WMF sites at the time of writing; some may not be scheduled to go live for many weeks.

Wikipedia:Wikipedia Signpost/2010-09-13/Essay Wikipedia:Wikipedia Signpost/2010-09-13/Opinion


2010-09-13

Page-edit stats, French National Library partnership, Mass page blanking, Jimbo on Pending changes

Global page-edit statistics

Erik Zachte has posted an analysis of page edits on all Wikipedias by region (on his Infodisiac blog, a site dedicated to Wikimedia statistics). The analysis, similar to an earlier one focusing on global page views (see 18 January Signpost), was based on a 1 in 1000 sampling of Wikipedia's squid logs, and excludes known bots and web crawlers. While not perfectly accurate, the analysis does reveal several important editing trends:

Wikimédia France partnership with the French National Library

In April, the French chapter Wikimédia France signed an agreement with the Bibliothèque nationale de France (BnF, the National Library of France), to make about 1,400 public domain books from their digital library Gallica available for Wikisource (see 12 April Signpost).

A team of three volunteers from Wikimédia France then retrieved high-resolution image files (in the lossless but bulky TIFF format) and OCR files from the BnF, and produced DjVu files that were uploaded on Wikimedia Commons in July. The heavy compression used in conversion of image files to DjVu resulted in a substantial loss of quality. Since the support of TIFF was imminent (see Signpost coverage in April and August), all of the original, high-resolution TIFF files were uploaded on Wikimedia Commons at the end of August, for future reference.

The BnF's OCR files, which indicate the position of each word and all graphical elements such as illustrations in the books, allowed extraction of more than 22,000 image files, although many of them may be useless (detection errors, mere black lines), of limited interest (stamps, vignettes), or duplicates, and thus require human review before a mass-upload to Wikimedia Commons. Nonetheless, many interesting images, such as educational diagrams, novel illustrations, scientific schematics, portraits, and maps, were obtained. The team is currently investigating the possibility of making the files available to Wikisource contributors.

Darius Dhlomo, a Wikipedia contributor with more than 163,000 edits dating back to 2005, has been indefinitely blocked for extensive copyright infringements. Following debate on the user's talk page, the incident was transferred to contributor copyright investigations. Copy-pasted articles brought to light numbered almost 10,000 creations and possibly 25,000 infringements. Consensus was established for the automated mass blanking of all confirmed and suspected infringements by the user (about 17,000; see Task explanation) – roughly 10% of his article edits. Most of the articles are very short tabular stubs with little prose, explaining how they were not noticed for so long.

Manual repair efforts faltered due to the sheer number of articles. According to Uncle G, managing administrator and coder of the bot responsible for the mass blanking, the infringements were "on quite a large scale, and with a regular pattern." All articles created by Darius Dhlomo are now suspect and need to be reviewed for potential copyright infringement. The bot will roll back every article to the version immediately prior to Darius Dhlomo's first edit, based on a master list generated by VernoWhitney. The articles he created will not be deleted, but the bot will blank the page completely.

This short-term solution to the problem was announced on the project-wide watchlist notice; the long-term solution will require that editors review the copyright infringements and turn them into proper articles. The hope is that this Signpost article can help spread the word about user involvement in resolving the issue. Uncle G says this mountain can be moved "by a thousand teaspoons all digging together."

Jimbo weighs in on the Pending changes poll

Related articles
2010-09-13

Wikimedia Foundation endorses open-access petition to the White House; pending changes RfC ends
28 May 2012

The future of pending changes
16 April 2012

The pending changes fiasco: how an attempt to answer one question turned into a quagmire
29 August 2011

Bugs, Repairs, and Internal Operational News
11 October 2010

French million, controversial content, Citizendium charter, Pending changes, and more
27 September 2010

Page-edit stats, French National Library partnership, Mass page blanking, Jimbo on Pending changes
13 September 2010

Pending changes analyzed, Foundation report, Main page bias, brief news
6 September 2010

Pending changes poll, Public policy classes, Payment schemes debate, and more
23 August 2010

Collaboration with the British Museum and in Serbia, Interaction with researchers, and more
21 June 2010

Wikipedia better than Britannica, Pending changes as a victory of tradition, and more
21 June 2010

Bugs, Repairs, and Internal Operational News
14 June 2010

Pending changes goes live, first state-funded Wikipedia project concludes, brief news
14 June 2010

Hoaxes in France and at university, Wikipedia used in Indian court, Is Wikipedia a cult?, and more
14 June 2010

"Pending changes" trial, Chief hires, British Museum prizes, Interwiki debate, and more
7 June 2010


More articles

Jimbo Wales has made an Announcement about Pending Changes, having been asked to interpret the results of the Pending changes poll for the Foundation. Wales said his intent was to communicate the community's desires to the Foundation and not to act as a final authority on the matter. There is "absolutely no consensus for simply turning the system off and walking away", he said, citing the result of the poll (65/35% Support/Oppose, despite the large number of contributors who opposed the structure of the poll itself). He conceded there has been substantial, vocal, and articulate opposition to using a system of this kind at all, or to using it in its current form, and addressed three concerns:

Wales also took part in the ensuing discussion and responded to the comments on his page. Community members expressed their views following his statement on their concerns, suggesting an alternative straw poll for the future and discussing ways to resolve the issue in the meantime. Wales proposed a quick poll to determine what to do pending the availability of version 2.0, saying he has asked the Foundation for a firm schedule and will report back when he hears from them. The two proposed options for the poll would be to stop using the feature altogether or use it only on an evaluation basis. Rob Lanphier from the Foundation has advised that he will make a timeline available by September 17.

Briefly

Wikipedia:Wikipedia Signpost/2010-09-13/Serendipity Wikipedia:Wikipedia Signpost/2010-09-13/Op-ed Wikipedia:Wikipedia Signpost/2010-09-13/In focus


2010-09-13

Tricky and Lengthy Dispute Resolution

The Arbitration Committee opened no new cases, leaving one open.

Open case

Climate change (Week 14)

This case resulted from the merging of several Arbitration requests on the same topic into a single case, and the failure of a related request for comment to make headway. Innovations have been introduced for this case, including special rules of conduct that were put in place at the start. However, the handling of the case has been criticized by some participants; for example, although the evidence and workshop pages were closed for an extended period, no proposals were posted on the proposed decision page and participants were prevented from further discussing their case on the case pages (see earlier Signpost coverage).

The proposed decision, drafted by Newyorkbrad, Risker, and Rlevse, sparked a large quantity of unstructured discussion, much of it comprising concerns about the proposed decision (see earlier Signpost coverage). A number of users, including participants and arbitrator Carcharoth, made the discussion more structured, but the quantity of discussion has continued to increase significantly. Rlevse had said that arbitrators were trying to complete the proposed decision before last week but it was later made clear that he will no longer be voting on this decision. This week, arbitrators, particularly Shell Kinney, made further additions to the proposed decision and further attempts to manage the quantity of discussion.

Closed cases

Late yesterday, a request for clarification was filed in relation to the enforcement of this case - in particular, how discretionary sanctions should be enforced against editors. The filer, Littleolive oil, has also requested that the revert restriction that was imposed on her by Future Perfect at Sunrise be overturned by the Committee.

A request was filed two days ago to reimpose an Eastern European topic ban on Radeksz. The proposed topic ban was originally imposed at the conclusion of the case, but was lifted three months ago by the Committee. Skäpperöd, the filer, alleges that Radeksz has "returned to aggressive editing and battleground behaviour". Radeskz responded with an allegation that the filer, Skäpperöd, "regularly...resorts to attempts to have those who disagree with him banned rather than working on resolving the [content] dispute".

A request was filed three days ago to impose a topic ban on Ferahgo the Assassin from race and intelligence related articles. The filer, Wapondaponda, alleges that Ferahgo the Assasin is acting as a proxy or sockpuppet for Captain Occam - who was topic banned from the articles. Ferahgo the Assasin denied the allegation and has alleged that her contributions to the articles have been positive, while Captain Occam alleged that Wapondaponda is "drama-mongering". Wikipedia:Wikipedia Signpost/2010-09-13/Humour

If articles have been updated, you may need to refresh the single-page edition.



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0