The Signpost

In the media

The world's Wikipedia gaps; Google and Wikipedia accused of tying Ben Carson to NAMBLA

A decade ago, Nkandla was the setting of an award-winning documentary, The Orphans of Nkandla, which resulted in the creation of The Africa Project. It is a matter of record that AIDS and poverty have ravaged many children's lives in KwaZulu-Natal. But Hay's observation about minor language versions of Wikipedia remains broadly correct. Indeed, a slide shown at Wikimania 2014 indicated that of Wikipedia's then-284 (today: 291) language versions,

The implications for quality are obvious.

Deploring Wikipedia's "cumbersome self-created bureaucracy and inter-editor sniping", Hay suggests that these global imbalances are unlikely to right themselves: while it may be tempting to think that the more established Wikipedias are bigger and more developed merely because they had several years' head start on smaller language versions, the smaller language versions show no sign of replicating the extraordinary boom the English Wikipedia underwent in its early years. In fact, Hay argues, the global volunteer base shrank by a third between 2007 and 2013.

Hay then proceeds to place his hopes in auto-translation apps, and reviews two multilingual projects:

  1. Omnipedia, a project being developed by researchers at Northwestern University, "capable of culling, comparing, and automatically translating data from 25 different Wikipedia language editions simultaneously, presenting them in simplified form", and
  2. Manypedia, an Italian project, online today, that "can automatically translate two Wikipedia articles side by side and point out incongruous information between them – or just translate an existing article into a different language".

Hay suggests that "complementary data from across all the world's Wikipedias" could be mined and translated "back to your native language site, thus attaining the online encyclopedia's egalitarian ideal". This is an overly optimistic view, given the present day's appalling, practically unreadable quality of many machine translations, which would leave prospective readers of Wikipedias stocked with machine translations profoundly frustrated – a point that can be verified by looking at some of Manypedia's article translations.

The English translation of the Persian article on "Third World" for example (enter http://www.manypedia.com/#!|en|Third_World|fa as the URL and click "Translate" in the right-hand panel) includes gems like

Imagine a Zulu reader trying to learn about physics or chemistry from a text that is as proficiently authored in Zulu as the above passage is clear and concise English.

There is little reason to argue with Hay's conclusion, however:

AK

Google and Wikipedia accused of tying Ben Carson to NAMBLA

United States presidential candidate Ben Carson

Breitbart accuses (Oct. 27) Wikipedia and Google of having prominently linked the name of Ben Carson, an acclaimed pediatric neurosurgeon and a Republican candidate for President of the United States in the 2016 presidential election, to a pedophile advocacy group, the North American Man-Boy Love Association (NAMBLA).

As evidence Breitbart shows a screenshot of a Google search results page, which lists Carson's Wikipedia biography as the top result (below the sponsored link and the "In the news" section), with "North American Man-Boy Love", "Seventh-day Adventist Church" and "Craniopagus twins" highlighted as hyperlinked key points in blue.

A Carson campaign spokesperson told Breitbart,

The spokesman blamed "pranksters" for the inappropriate highlight.

NAMBLA is mentioned in Wikipedia's biography of Carson because the term occurs in a 2013 comment of Carson's that is quoted verbatim in the article, and in which Carson said, "Marriage is between a man and a woman. No group, be they gays, be they NAMBLA, be they people who believe in bestiality, it doesn’t matter what they are. They don’t get to change the definition." (Carson subsequently apologized for the remark.) The acronym NAMBLA in the quotation has from time to time been hyperlinked in the Wikipedia article.

While the Carson team's frustration with the Google entry is understandable, it seems speculative to suggest that the hyperlink must have been placed so as to increase the term's chances of appearing in the Google snippet, or that Google staff specifically selected the term to appear in its snippet from the many available.

It bears mention though that according to Wikipedia's manual of style, quotations should generally remain free of hyperlinks. At the time of writing, the Google snippet no longer references NAMBLA. AK



Do you want to contribute to "In the media" by writing a story or even just an "in brief" item? Edit next week's edition in the Newsroom or contact the editor.
+ Add a comment

Discuss this story

These comments are automatically transcluded from this article's talk page. To follow comments, add the page to your watchlist. If your comment has not appeared here, you can try purging the cache.

WP:MOS says that linking from within the quote should be avoided "as much as possible". However, looking at the present version of the article I don't see a link to NAMBLA at all, and I imagine that even among Americans there are many who have absolutely no idea what that acronym actually refers to, so a link is appropriate. There will be no difference if we link to it inside or outside the quote so far as Google's rankings are concerned. And whatever their problem is, I doubt that one link in our article is what is to blame for it.

We should not be rearranging articles trying to decide on search rank. It's not our problem, and even if it were, maybe a lot of people actually search Google for Carson NAMBLA because they want to see this quote and use it for some purpose (whether anti gay marriage or anti Carson or whatever, none of our business). Or maybe there are just a lot of news sites and opinion pieces that talk about Carson and NAMBLA within a headline --- your own included!

Let's not focus on trying to guess what our article might make happen, but instead focus on having it be as informative as possible. He said this thing, it had a certain resonance in a certain segment of society, history needs to know about it. Wnt (talk) 18:26, 1 November 2015 (UTC)[reply]

  • Regarding the story about The world's Wikipedia gaps, as a contributor to the Wikipedia version in Norwegian Bokmål/Riksmål I don't think machine translation is the way, its just not good enough. The problem is our biggest competitor: The English language Wikipedia. A well-known Norwegian journalist said some years ago that media use Wikipedia all the time, the English language version, that is. Why someone would contribute to the version in Norwegian Bokmål/Riksmål was beyond his understanding.
What we need to do, both the WMF and local chapters is to highlight the inequality of Wikipedia. After all its not that bad in Europe, quite a few are able to read some English. But in places like Sri Lanka, where the local languages are next to non-existent on Wikipedia, it means that the local elites have all they need, while the rest is as bad off as ever. Ulflarsen (talk) 20:53, 1 November 2015 (UTC)[reply]

The problem with keeping quotes free of hyperlinks is that frequently without them the reader would be clueless as to what the quoted speaker is talking about. Hawkeye7 (talk) 21:15, 1 November 2015 (UTC)[reply]

Hawkeye7 I agree. I forget how I learned I'm not supposed to hyperlink in quotes, but there was one I included in an article and I knew people would be likely to want to learn the definition or significance of some term that was used. I didn't think there was any other way. While I've forgotten exact details, I'm pretty sure the quote would have been hard to understand without the hyperlink.— Vchimpanzee • talk • contributions • 19:43, 6 November 2015 (UTC)[reply]



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0