Wikipedia:Wikipedia Signpost/2010-09-13/From the editors Wikipedia:Wikipedia Signpost/2010-09-13/Traffic report Wikipedia:Wikipedia Signpost/2010-09-13/In the media
We continue a series of articles about this year's Google Summer of Code (GSoC) with Indian student Sanyam Goyal, who describes his attempt to overhaul the JavaScript component of the "Semantic" series of extensions, along with the core product itself, the "Semantic MediaWiki" extension. The extensions aim to realise the goals of the Semantic Web, where meaning is overlaid on top of content, allowing machines to more easily interpret it, but have not yet been approved for use on Wikimedia projects. Whilst somewhat controversial, the idea of a Semantic Web is considered by some including Tim Berners-Lee to be the future of the Web.
“ | I aimed to improve and extend the Javascript for Extension:Semantic MediaWiki and some of its spinoff extensions including Extension:Semantic Result Formats, Extension:Semantic Form Inputs, Extension:Semantic Drilldown and most notably Extension:Semantic Forms. This includes transferring over much of the Javascript to use the jQuery JavaScript library, which is now becoming a MediaWiki standard.
In terms of what was accomplished:
|
” |
Not all fixes may have gone live to WMF sites at the time of writing; some may not be scheduled to go live for many weeks.
Wikipedia:Wikipedia Signpost/2010-09-13/Essay Wikipedia:Wikipedia Signpost/2010-09-13/Opinion
Erik Zachte has posted an analysis of page edits on all Wikipedias by region (on his Infodisiac blog, a site dedicated to Wikimedia statistics). The analysis, similar to an earlier one focusing on global page views (see 18 January Signpost), was based on a 1 in 1000 sampling of Wikipedia's squid logs, and excludes known bots and web crawlers. While not perfectly accurate, the analysis does reveal several important editing trends:
In April, the French chapter Wikimédia France signed an agreement with the Bibliothèque nationale de France (BnF, the National Library of France), to make about 1,400 public domain books from their digital library Gallica available for Wikisource (see 12 April Signpost).
A team of three volunteers from Wikimédia France then retrieved high-resolution image files (in the lossless but bulky TIFF format) and OCR files from the BnF, and produced DjVu files that were uploaded on Wikimedia Commons in July. The heavy compression used in conversion of image files to DjVu resulted in a substantial loss of quality. Since the support of TIFF was imminent (see Signpost coverage in April and August), all of the original, high-resolution TIFF files were uploaded on Wikimedia Commons at the end of August, for future reference.
The BnF's OCR files, which indicate the position of each word and all graphical elements such as illustrations in the books, allowed extraction of more than 22,000 image files, although many of them may be useless (detection errors, mere black lines), of limited interest (stamps, vignettes), or duplicates, and thus require human review before a mass-upload to Wikimedia Commons. Nonetheless, many interesting images, such as educational diagrams, novel illustrations, scientific schematics, portraits, and maps, were obtained. The team is currently investigating the possibility of making the files available to Wikisource contributors.
Darius Dhlomo, a Wikipedia contributor with more than 163,000 edits dating back to 2005, has been indefinitely blocked for extensive copyright infringements. Following debate on the user's talk page, the incident was transferred to contributor copyright investigations. Copy-pasted articles brought to light numbered almost 10,000 creations and possibly 25,000 infringements. Consensus was established for the automated mass blanking of all confirmed and suspected infringements by the user (about 17,000; see Task explanation) – roughly 10% of his article edits. Most of the articles are very short tabular stubs with little prose, explaining how they were not noticed for so long.
Manual repair efforts faltered due to the sheer number of articles. According to Uncle G, managing administrator and coder of the bot responsible for the mass blanking, the infringements were "on quite a large scale, and with a regular pattern." All articles created by Darius Dhlomo are now suspect and need to be reviewed for potential copyright infringement. The bot will roll back every article to the version immediately prior to Darius Dhlomo's first edit, based on a master list generated by VernoWhitney. The articles he created will not be deleted, but the bot will blank the page completely.
This short-term solution to the problem was announced on the project-wide watchlist notice; the long-term solution will require that editors review the copyright infringements and turn them into proper articles. The hope is that this Signpost article can help spread the word about user involvement in resolving the issue. Uncle G says this mountain can be moved "by a thousand teaspoons all digging together."
Jimbo Wales has made an Announcement about Pending Changes, having been asked to interpret the results of the Pending changes poll for the Foundation. Wales said his intent was to communicate the community's desires to the Foundation and not to act as a final authority on the matter. There is "absolutely no consensus for simply turning the system off and walking away", he said, citing the result of the poll (65/35% Support/Oppose, despite the large number of contributors who opposed the structure of the poll itself). He conceded there has been substantial, vocal, and articulate opposition to using a system of this kind at all, or to using it in its current form, and addressed three concerns:
Wales also took part in the ensuing discussion and responded to the comments on his page. Community members expressed their views following his statement on their concerns, suggesting an alternative straw poll for the future and discussing ways to resolve the issue in the meantime. Wales proposed a quick poll to determine what to do pending the availability of version 2.0, saying he has asked the Foundation for a firm schedule and will report back when he hears from them. The two proposed options for the poll would be to stop using the feature altogether or use it only on an evaluation basis. Rob Lanphier from the Foundation has advised that he will make a timeline available by September 17.
Wikipedia:Wikipedia Signpost/2010-09-13/Serendipity Wikipedia:Wikipedia Signpost/2010-09-13/Op-ed Wikipedia:Wikipedia Signpost/2010-09-13/In focus
The Arbitration Committee opened no new cases, leaving one open.
This case resulted from the merging of several Arbitration requests on the same topic into a single case, and the failure of a related request for comment to make headway. Innovations have been introduced for this case, including special rules of conduct that were put in place at the start. However, the handling of the case has been criticized by some participants; for example, although the evidence and workshop pages were closed for an extended period, no proposals were posted on the proposed decision page and participants were prevented from further discussing their case on the case pages (see earlier Signpost coverage).
The proposed decision, drafted by Newyorkbrad, Risker, and Rlevse, sparked a large quantity of unstructured discussion, much of it comprising concerns about the proposed decision (see earlier Signpost coverage). A number of users, including participants and arbitrator Carcharoth, made the discussion more structured, but the quantity of discussion has continued to increase significantly. Rlevse had said that arbitrators were trying to complete the proposed decision before last week but it was later made clear that he will no longer be voting on this decision. This week, arbitrators, particularly Shell Kinney, made further additions to the proposed decision and further attempts to manage the quantity of discussion.
Late yesterday, a request for clarification was filed in relation to the enforcement of this case - in particular, how discretionary sanctions should be enforced against editors. The filer, Littleolive oil, has also requested that the revert restriction that was imposed on her by Future Perfect at Sunrise be overturned by the Committee.
A request was filed two days ago to reimpose an Eastern European topic ban on Radeksz. The proposed topic ban was originally imposed at the conclusion of the case, but was lifted three months ago by the Committee. Skäpperöd, the filer, alleges that Radeksz has "returned to aggressive editing and battleground behaviour". Radeskz responded with an allegation that the filer, Skäpperöd, "regularly...resorts to attempts to have those who disagree with him banned rather than working on resolving the [content] dispute".
A request was filed three days ago to impose a topic ban on Ferahgo the Assassin from race and intelligence related articles. The filer, Wapondaponda, alleges that Ferahgo the Assasin is acting as a proxy or sockpuppet for Captain Occam - who was topic banned from the articles. Ferahgo the Assasin denied the allegation and has alleged that her contributions to the articles have been positive, while Captain Occam alleged that Wapondaponda is "drama-mongering". Wikipedia:Wikipedia Signpost/2010-09-13/Humour