The Signpost
Single-page Edition
WP:POST/1
10 September 2012

From the editor
Signpost adapts as news consumption changes
Op-ed
Fixing Wikipedia's help pages one key to editor retention
In the media
Author criticizes Wikipedia article; Wales attacks UK government proposal
Featured content
Not a "Gangsta's Paradise", but still rappin'
WikiProject report
WikiProject Fungi
Special report
Two Wikipedians set to face jury trial
News and notes
Researchers find that Simple English Wikipedia has "lost its focus"
Technology report
Mmmm, milkshake...
Discussion report
Closing Wikiquette; Image Filter; Education Program and Momento extensions
 

Wikipedia:Wikipedia Signpost/2012-09-10/From the editors Wikipedia:Wikipedia Signpost/2012-09-10/Traffic report


2012-09-10

Author criticizes Wikipedia article; Wales attacks UK government proposal

Contribute  —  
Share this
By The ed17

Philip Roth's open letter to Wikipedia

Anthony Hopkins played the lead character in the film adaptation of Philip Roth’s The Human Stain, the subject of Roth’s complaint

Philip Roth, a widely known and acclaimed American author, wrote an open letter in the New Yorker addressed to Wikipedia this week, alleging severe inaccuracies in the article on his The Human Stain (2000).

The saga began on Wikipedia in late August, when an IP editor—claiming to be Roth’s official biographer—removed this paragraph from the article:

Salon.com critic Charles Taylor argues that Roth had to have been at least partly inspired by the case of Anatole Broyard, a literary critic who, like the protagonist of The Human Stain, was a man identified as Creole who spent his entire professional life more-or-less as white.[1] Roth states there is no connection, as he did not know Broyard had any black ancestry until an article published months after he had started writing his novel.[2]

The IP was reverted within a minute, with the edit summary "Can you verify that?" Nineteen minutes after the revert, the IP removed the paragraph again, saying "the reference to Anatole Broyard ... is wholly inaccurate and therefore pointless. I am Roth's biographer, and have removed it at his request." The article was edited again six minutes later by Parkwells (talk · contribs), who over the next two hours added a significant amount of content to the article. The entire process was seemingly concluded within three total hours, and the article remained in this state until Roth’s open letter was published on 7 September. The new content relevant to Roth’s complaint read:

Kakutani and other critics were struck by the parallels to the life of Anatole Broyard, a writer and the New York Times literary critic in the 1950s and 1960s who was of Louisiana Creole mixed-race descent and passed for white.[3][4][5]

Roth said that he had not learned about Broyard's ancestry until after starting to write this novel.[6]

In this open letter, which was first brought to the community's attention by a Wikimedia Foundation employee, Roth was highly critical of Wikipedia. Interestingly, the ‘interlocutor’, most likely Roth's biographer, either emailed or was emailed by an English Wikipedia administrator, who said that removing the claim would require "secondary sources", even though they acknowledged that "the author is the greatest authority on their own work." This email is what led Roth to publish in the New Yorker, giving the real inspiration for the novel and its protagonist, Coleman Silk, in great detail: the experience of Melvin Tumin, a long-tenured professor of sociology at Princeton, with a seemingly innocuous question which turned into multiple major accusations of racism.

The question, "Does anyone know these people? Do they exist or are they spooks?", was prompted by the constant absence of two students from his class. Unfortunately for Tumin, 'spooks' happened to be an old derogatory term for African Americans, and both students turned out to be from that race. It was only several months later that Tumin could clear his name, after "several lengthy depositions" and what Roth described as a "witch hunt".

It appears that while Wikipedia was correct both before and after the removals—the article versions noted that the claim was a literary reviewer's opinion, and that Roth had rebutted the claim—the article never stated the genesis of the book, leaving the Wikipedia article with phrases Roth called "not from the world of truthfulness but from the babble of literary gossip".

However, it also appears that the open letter was the first time Roth has identified Tumin's story as the basis for The Human Stain.

References

  1. ^ Taylor, Charles (April 24, 2000). "Life and life only". Salon.com.
  2. ^ Philip Roth interview at bloomberg.com
  3. ^ Taylor (2000), "Lie and Life Only, Salon, Quote: "The thrill of gossip become literature hovers over “The Human Stain”: There’s no way Roth could have tackled this subject without thinking of Anatole Broyard, the late literary critic who passed as white for many years. But Coleman Silk is a singularly conceived and realized character, and his hidden racial past is a trap Roth has laid for his readers..."
  4. ^ Lorrie Moore, "The Wrath of Athena", New York Times, 7 May 2000, accessed 20 August 2012. Quote: "In addition to the hypnotic creation of Coleman Silk -- whom many readers will feel, correctly or not, to be partly inspired by the late Anatole Broyard -- Roth has brought Nathan Zuckerman into old age, continuing what he began in American Pastoral."
  5. ^ Brent Staples, "Editorial Observer; Back When Skin Color Was Destiny, Unless You Passed for White", New York Times, 7 September 2003, accessed 25 January 2011. Quote: "This was raw meat for Philip Roth, who may have known the outlines of the story even before Henry Louis Gates Jr. told it in detail in 'The New Yorker' in 1996. When Mr. Roth's novel about passing -- The Human Stain -- appeared in 2000, the character who jettisons his black family to live as white was strongly reminiscent of Mr. Broyard."
  6. ^ Robert Hilferty (2008-09-16). "Philip Roth Serves Up Blood and Guts in 'Indignation' (Update1)". Bloomberg. I knew Anatole slightly, and I didn't know he was black. Eventually there was a New Yorker article describing Anatole's life written months and months after I had begun my book.

In brief


2012-09-10

Mmmm, milkshake...

August engineering report published

In August 2012:
  • 97 unique committers contributed patchsets of code to MediaWiki (up by five from June)
  • The total number of unreviewed commits steady at 360.
  • About 35 shell requests were processed (no change).
  • 25 developers received developer access to Git and Wikimedia Labs (down by 55).
  • Wikimedia Labs now hosts 120 projects (up by six), 214 instances (up by three) and 587 users (up by 28).

Engineering metrics, Wikimedia blog

A slide outlining Echo, a new notifications system that would replace watchlists and is already in development

The Wikimedia Foundation's engineering report for August 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month (as well as brief coverage of progress on Wikimedia Deutschland's Wikidata project, phase 1 of which is edging its way towards its first deployment). Three of the four headline items in the report have already been covered in the Signpost: the site outage caused by a fibre cut in early August, refinements to the active editors metric, and major work on the Wiki Loves Monuments app, launched last week. The report drew attention to the work of the WMF's Internationalization team on the Universal Language Selector (ULS; see previous Signpost coverage), Project Milkshake to create "generic jQuery components for commonly needed internationalisation features" and WebFonts.

Other items covered in the report include work by WMF Performance Engineer Asher Feldman to expand the number of MySQL servers in the Foundation's secondary data centre Ashburn, and by Tim Starling to write a new Redis-based client for session handling. The two developments are linked insofar as both will be needed for the Foundation to meet its target of making the Virginia site Wikimedia's primary data centre within the next quarter, in an attempt to boost performance. Elsewhere, WMF Security Engineer Chris Steipp worked on adding two new major features to the AbuseFilter extension (global rules and global throttling), for improved detection and prevention of cross-wiki spam. It was, however, a slower month for the TimedMediaHandler, Echo, OAuth and ResourceLoader 2.0 projects.

The monthly report is also a good source of tech news that had otherwise slipped under the radar—in this case, that work on Flow (a talk page reform project) will start in January and that a new mirror for Wikimedia data has been found ("network management solutions" firm), who have also agreed to replicate non-essential data such as "page view files, archives, and more, as well as a full copy of our media files". This month saw the creation of the "Micro Design Improvements" team, an ad-hoc group of staffers who will look at "small but useful design" improvements for MediaWiki, including this week a proposed reworking on the edit window.

Among other news, the first Wikipedia Engineering Meetup (held on 15 August in WMF headquarters in San Francisco), first mentioned in last month's report, attracted approximately 100 developers. The series of two-monthly meetings is intended "to showcase Wikimedia's interesting engineering problems and products to the local developer community"; the inaugural meetup "featured talks about Mobile engineering, Analytics and the VisualEditor".

In brief

Signpost poll
MediaWiki Foundation
You can now give your opinion on next week's poll: Would you be interested in downloading a similar Signpost app onto your iOS device (iPhone, iPad, iPod Touch)?

Not all fixes may have gone live to WMF sites at the time of writing; some may not be scheduled to go live for several weeks.

Wikipedia:Wikipedia Signpost/2012-09-10/Essay Wikipedia:Wikipedia Signpost/2012-09-10/Opinion


2012-09-10

Researchers find that Simple English Wikipedia has "lost its focus"

Readability of Simple English and English Wikipedias called into question

In its September issue, the peer-reviewed journal First Monday published The readability of Wikipedia, reporting research which shows that the English Wikipedia is struggling to meet Flesch reading ease test criteria, while the Simple English Wikipedia has "lost its focus".

The statistical method developed by Flesch (1948) focuses on two core components of the concept of readability: word length and sentence length. The test is widely used in the US, with areas of application ranging from Pentagon files to life insurance policies. The concept has been adapted for other languages, including a German version by Toni Amstad (1978). The Flesch test uses the following formula to indicate the readability of a given text:[1]

Higher scores indicate material that is easier to read, and lower scores that it is more difficult. While in theory the results can vary widely due to the artificial construction of very complex or simple sentences, in practice natural English typically results in a score between 0 and 100, which can be interpreted as shown in the table.

Score Notes
90.0–100.0 Very easy
80.0–90.0 Easy
70.0–80.0 Fairly easy
60.0–70.0 Standard
50.0–60.0 Fairly difficult
30.0–50.0 Difficult
0.0–30.0 Very difficult

The authors assume that the English Wikipedia should score around 60–70 on average ("standard"), and Simple English, which explicitly aims at audiences with less advanced literacy skills, around 80 ("easy"). An older study, Besten and Dalle (2008), had found on the basis of the same test method that the overall readability of Simple had decreased from around 80 in 2003 to just above 70 in 2006.

The 2012 study examined two 2010 database dumps it sampled from English Wikipedia and Simple. For the study, the scientists filtered out lists, redirects, and disambiguation pages, and removed components such as tables, headings, and images. Thus, the study examined 88% of the English and 85% of Simple Wikipedia's articles in the database dump. In a second step, the methodology excluded short articles with fewer than six sentences (due to their likely wide fluctuation in readability).

The analysis found that English Wikipedia articles scored 51 on average ("fairly difficult") with more than 70% of all articles scoring less than the set goal of 60 ("standard"). Simple scored 62 on average ("standard") with 95% of all entries below the set 80 ("easy") goal. In addition, a set of around 9600 respective articles was comparable between both Wikipedia versions; Simple scored 61 on these, while the related English Wikipedia articles scored 49.

The paper argues that the creation of Simple as a solution for readability issues of the English Wikipedia with some audiences has run into difficulties. The average reading ease of Simple, while still above the English Wikipedia, declined compared to the findings of Besten and Dalle in 2008 (2003: 80, 2006: just above 70) to 62 on average. Based on the outlined methodology, the authors conclude that Simple has "lost its focus … this version now seems suitable for the average reader, instead of aiming at those with limited language abilities."

The English Wikipedia findings indicate that the results of another study in 2010, focusing on the readability of English Wikipedia entries on cancer (Signpost coverage), cannot be fully generalized. The paper in 2010 found that articles in the targeted topic area scored about 30 on average.

However, both studies show that the English Wikipedia potentially excludes major segments of the English-speaking world, including (for example) large parts of the US public. According to a major study on literacy in the US in 2002, 21–23% (extrapolated: more than 40 million people) "demonstrated skills in the lowest level of prose, document, and quantitative proficiencies".

The authors of the study on readability of Wikipedia have set up a demo site where users can calculate the readability of English and Simple English Wikipedia pages based on the automatic measure they deployed in the paper.

Brief notes

  • WMF RfC: The ongoing RfC on whether to establish a legal fees assistance program for volunteer role-specific risks that go beyond the contributor defense policy already in place is in full swing.
  • Audit committee call for volunteers: The WMF audit committee, overseeing financial and audit issues on behalf of the WMF board, has published a call for volunteers to serve on the 2012–13 committee.
  • English Wikipedia report
    • Arbitration Committee: the committee took several actions this week, including a desysop and passing two motions regarding prior cases (Falun Gong 2, The Troubles). Four clarification and amendment requests remain open.
    • Main-page redesign competition: 23 proposals have been lodged, and discussion about the issue continues on the competition talk page. Editors are welcome to submit their own proposals until 30 September.
    • Pending Changes update: The Request for Comment on Pending Changes Level Two continues. Currently, there is an 11–2 vote on the following proposition: "... if there's no clear consensus for one particular point of view after a week (which seems likely at the moment), is there any objection to trying to come up with a third option in a 'committee' that anyone could join?" That committee page is at WP:PC2012/Committee.

Wikipedia:Wikipedia Signpost/2012-09-10/Serendipity


2012-09-10

Fixing Wikipedia's help pages one key to editor retention

Peter Coombe is a community fellow with the Wikimedia Foundation who is researching help pages. He is also an editor on English Wikipedia, Wikimedia Commons, and Wikinews under the username the wub.
The views expressed in this op-ed are those of the author only. Readers are invited to respond or offer critical commentary in the comments section, while those wishing to author their own op-ed may use the Signpost's opinion desk.


Peter Coombe, Wikimedia community fellow

Much like article content, the English Wikipedia's help pages have grown organically over the years. Although this has produced a great deal of useful documentation, with time many of the pages have become poorly maintained or have grown overwhelmingly complicated. There are several issues with the current network of help pages:

  • Too long. Wikipedia:Citing sources is one of the most viewed of all help pages, and it's more than 8000 words long.
  • Widely varying complexity. If you're a new user who wants to learn what the "edit summary" box is for, you're probably not interested in precisely which MediaWiki message defines the autogenerated summary for page blanking. But we told you anyway (at least we used to—thankfully someone has removed this now).
  • Accumulated cruft. Early on in my fellowship I found a help page that was just about how to draw chemical structures using ASCII art. So I thought, that's quaint, it's from 2003, obviously things were a bit different then. I checked with the people at WikiProject Chemicals, and it turns out ASCII art was never encouraged. This was just someone's random page, in Wikipedia space, telling people how to do something that had always been against guidelines. (It was deleted).
  • Navigation is poor. One thing people complain about is that after they arrive at a useful page, it's often hard to find it again. A pattern we see over and over is people adding links on their own userpage when they find useful pages, building their own navigation system, so that they don't lose the useful page. People really shouldn't have to resort to this: pages should be findable.
  • Fragmentation and duplication. We have at least four pages about how to make tables in wiki markup, and it's unclear which is most suitable for a given problem. And they all need to be kept up to date.

For some idea of the scale of the problem, the main help landing page – Help:Contents – now gets around 10,000 hits a day. It's my belief that improving Wikipedia's help system is one of the most important steps we can take to improve editor recruitment and retention. That's why for the past few months I've been working as a Wikimedia community fellow to research how we can improve help pages.


How disorganised is this? Partial map of Wikipedia's help pages, created by User:Ironholds. Full version (5 MB png)


Community project

There were already some community attempts to improve the help pages, such as the Help Project, but sadly they had become fairly inactive. Because improving and maintaining the help pages is such a huge ongoing task, there's no way I can do it on my own, and I really don't want the efforts to end with my fellowship. Therefore I've worked to revive the project: the homepage received a major overhaul, and there's now a monthly newsletter for members and a regularly updated statistics page covering all the pages within scope. (If you want to get involved, or just keep up with the latest developments, do sign up!)

Learning more

The next major step was a large survey, taking in both new and experienced users, to find out what they are looking for help on, how they find it, and what they think of the existing pages. The full results and conclusions are available. Unsurprisingly, "how to use wiki markup" and "how to start a new page" are the most popular topics among new users. What is surprising is that people rate the help on these topics as ok. It's still not great and could certainly use improvement, but it's better than the others. The help topics people really didn't like are how to add references and how to add images. This is fairly consistent across all experience ranges, from newly registered users with no edits to old hands with thousands of edits.

Recently we've also been able to deploy the article feedback tool to help pages. This should allow us to get extremely valuable feedback: until this deployment and the survey I conducted, we really had very little evidence on what users thought of them.

Tutorials

One thing the Help Project created in the past were a couple of "Introduction to ..." tutorials: Introduction to policies and guidelines and Introduction to talk pages. These focused tutorials have a friendly tone and don't overwhelm new users with details. They've been very well received by the new users who find them, so I decided to make the tutorials we do have more prominent, and developed new tutorials in the same vein on topics that the survey suggested would be valuable: Referencing, uploading images, and navigating Wikipedia. These are brand new, so please edit and improve them! But do try to avoid making them too long and detailed, or adding too many links.

The new tutorials make use of vertical tabs, which were well received in usability tests.


"Helped by people"

Probably the clearest finding in the survey is that experienced editors love the results they get from asking questions on another user's talk page, but new users aren't really aware of that as an option. The same is true to a lesser extent of asking questions at the Help Desk, or in IRC, as clinched by one respondent's comment: "I was helped by people, not help pages." The personal touch certainly seems to ease things along, and that's why it's great that we have new initiatives like the Teahouse, and more friendly warning messages that explicitly invite questions on the warner's talk page. Part of my work to redesign the navigation will try to make these question pages more visible to those who could benefit from them. This is particularly true of the Reference Desk, even though the article feedback tool has only been deployed for a few days we have already seen many factual questions appearing in the feedback for help pages.

Making help findable

Over the next month I'll be focusing on the final part of my project: redesigning Help:Contents, the main entry point into our help system. At the moment this page is a mess, with too many subpages, too many links, and not enough explanation. Many of the links that do exist are misleadingly labelled. This was borne out by usability tests I conducted, where people found it difficult to navigate and find the help they were looking for.

A serious problem with Help:Contents is that it has to speak to many different audiences:

  • Readers looking for info ("-1" edits). The biggest volume. Not a primary target of the fellowship, but we certainly need to better support them, help them find places like the Reference desk, and maybe we could entice them to edit with calls to action.
  • Readers who want editors to fix something. Similar to the above.
  • Brand new editors (0–1 edits) looking to get started, most likely with step-by-step tutorials and hand-holding.
  • New editors (10–100 edits) looking for specific help on a topic.
  • Experienced editors and admins (100+ edits) looking for more advanced help.

At the moment links relevant to these different groups are all mixed together. The aims of the redesign are to better funnel these different users to where they can get the right kind of help, and to better expose the personal help mentioned previously for those who want it—and, one hopes, to make the page look a bit more attractive too! Again I'll be doing usability tests to try to identify any potential problems with the new design, and to confirm that it's better than the existing one.

How to get involved

If you're interested in improving help pages, please do join the Help Project and the discussions on its talk page. There are also some open tasks you could get started on. It's going to be a long haul, but this work is something that could really make a big difference to the future of Wikipedia. Wikipedia:Wikipedia Signpost/2012-09-10/In focus Wikipedia:Wikipedia Signpost/2012-09-10/Arbitration report Wikipedia:Wikipedia Signpost/2012-09-10/Humour

If articles have been updated, you may need to refresh the single-page edition.



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0