The Signpost

Recent research

Wikipedia can increase local tourism by 9%; predicting article quality with deep learning; recent behavior predicts quality

Contribute   —  
Share this
By Marco Chemello, Federico Leva, Morten Warncke-Wang, and Tilman Bayer

A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.

"Wikipedia matters": a significant impact of user-generated content on real-life choices

Reviewed by Marco Chemello and Federico Leva

Improving Wikipedia articles may contribute to increasing local tourism. That's the result of a study[1] published as preprint a few weeks ago by M. Hinnosaar, T. Hinnosaar, M. Kummer and O. Slivko. This group of scholars from various universities – including Collegio Carlo Alberto, the Center for European Economic Research (ZEW) and Georgia Institute of Technology – led a field experiment in 2014: they expanded 120 Wikipedia articles regarding 60 Spanish cities and checked the impact on local tourism, by measuring the increased number of hotel stays in the same cities from each country. The result was an average +9 % (up to 28 % in best cases). Random city articles were expanded mainly by translating contents taken from the Spanish or the English edition of Wikipedia into other languages, and by adding some photos. The authors wrote: "We found a significant causal impact of user-generated content in Wikipedia on real-life choices. The impact is large. A well-targeted two-paragraph improvement may lead to a 9 % increase in the visits by tourists. This has significant implications both in macroeconomic and microeconomic scale."

The study revises an earlier version[supp 1] which declared the data was inconclusive (not statistically relevant yet) although there were hints of a positive effect. It's not entirely clear to this reviewer how the statistical significance was ascertained, but the method used to collect data was sound:

Curiously, while the authors had no problems adding their translations and images to French, German and Italian Wikipedia, all their edits were reverted on the Dutch Wikipedia. Local editors may want to investigate what made the edits unacceptable: perhaps the translator was not as good as those in the other languages, or the local community is prejudicially hostile to new users editing a mid-sized group of pages at once, or some rogue user reverted edits which the larger community would accept? [PS: One of our readers from the Dutch Wikipedia has provided some explanations.]

Assuming that expanding 120 stubs by translating existing articles in other languages takes few hundreds hours of work and actually produces about 160,000 € in additional revenue per year as estimated by the authors, it seems that it would be a bargain for the tourism minister of every country to expand Wikipedia stubs in as many tourist languages as possible, also making sure they have at least one image, by hiring experienced translators with basic wiki editing skills. Given that providing basic information is sufficient and neutral text is generally available in the source/local language's Wikipedia, complying with neutral point of view and other content standards seems to be sufficiently easy.

Improved article quality predictions with deep learning

Reviewed by Morten Warncke-Wang

A paper at the upcoming OpenSym conference titled "An end-to-end learning solution for assessing the quality of Wikipedia articles"[2] combines the popular deep learning approaches of recurrent neural networks (RNN) and long short-term memory (LSTM) to make substantial improvements in our ability to automatically predict the quality of Wikipedia's articles.

The two researchers from Université de Lorraine in France first published on using deep learning for this task a year ago (see our coverage in the June 2016 newsletter), where their performance was comparable to the state-of-the-art at the time, the WMF's own Objective Revision Evaluation Service (ORES) (disclaimer: the reviewer is the primary author of the research upon which ORES' article quality classifier is built). Their latest paper substantially improves the classifier's performance to the point where it clearly outperforms ORES. Additionally, using RNNs and LSTM means the classifier can be trained on any language Wikipedia, which the paper demonstrates by outperforming ORES in all three of the languages where it's available: English, French, and Russian.

The paper also contains a solid discussion of some of the current limitations of the RNN+LSTM approach. For example, the time it takes to make a prediction is too slow to deploy in a setting such as ORES where quick predictions are required. Also, the custom feature sets that ORES has allow for explanations on how to improve article quality (e.g. "this article can be improved by adding more sources"). Both are areas where we expect to see improvements in the near future, making this deep learning approach even more applicable to Wikipedia.

Recent behavior has a strong impact on content quality

Reviewed by Morten Warncke-Wang

A recently published journal paper by Michail Tsikerdekis titled "Cumulative Experience and Recent Behavior and their Relation to Content Quality on Wikipedia"[3] studies how factors like an editor's recent behavior, their editing experience, experience diversity, and implicit coordination relate to improvements in article quality in the English Wikipedia.

The paper builds upon previous work by Kittur and Kraut that studied implicit coordination,[supp 2] where they found that having a small group of contributors doing the majority of the work was most effective. It also builds upon work by Arazy and Nov on experience diversity,[supp 3] which found that the diversity of experience in the group was more important.

Arguing that it is not clear which of these factors is the dominant one, Tsikerdekis further extends these models in two key areas. First, experience diversity is refined by measuring accumulated editor experience in three key areas: high quality articles, the User and User talk namespaces, and the Wikipedia namespace. Secondly, editor behavior is refined by measuring recent participation in the same three key areas. Lastly he adds interaction effects, for example between these two new refinements and implicit coordination.

Using the more refined model of experience diversity results in a significant improvement over baseline models, and an interaction effect shows that high coordination inequality (few editors doing most of the work) is only effective when contributors have low experience editing the User and User talk namespaces. However, the models that incorporate recent behavior are substantial improvements, indicating that recent behavior has a much stronger impact on quality than overall editor experience and experience diversity. Again studying the interaction effects, the findings are that implicit coordination is most effective when contributors have not recently participated in high quality articles, and that contributors make a stronger impact on content quality when they edit articles that match their experience levels.

These findings ask important questions about how groups of contributors in Wikipedia can most effectively work together to improve article quality. Future work is needed to understand more about when explicit coordination is most useful, and the paper points to the possibility of using recommender systems to route contributors to groups where their experience level can make a difference.

Briefly

Predicting book categories for Wikipedia articles

Reviewed by Morten Warncke-Wang

"Automatic Classification of Wikipedia Articles by Using Convolutional Neural Network"[4] is the title of a paper published at this year's Qualitative and Quantitative Methods in Libraries conference. As the title describes, the paper applies convolutional neural networks (CNN) to the task of predicting the Nippon Decimal Classification (NDC) category that a Japanese Wikipedia article belongs to. This NDC category can then be used for example to suggest further reading, providing a bridge between the online content of Wikipedia and the books that are available in Japan's libraries.

In the paper, a Wikipedia article is represented as a combination of Word2vec vectors: one vector for the article's title, one each for the categories it belongs to, and one for the entire article text. These vectors combine to form a two-dimensional matrix, which the CNN is trained on. Combining the title and category vectors results in the highest performance, with 87.7% accuracy in predicting the top-level category and 74.7% accuracy for the second-level category. The results are promising enough that future work is suggested where these will be used for book recommendations.

The work was motivated by "recent research findings [indicating] that relatively few students actually search and read books," and "aims to encourage students to read library books as a more reliable source of information rather than relying on Wikipedia article."

Conferences and events

See the research events page on Meta-wiki for upcoming conferences and events, including submission deadlines.

Other recent publications

Other recent publications that could not be covered in time for this issue include the items listed below. contributions are always welcome for reviewing or summarizing newly published research.

Compiled by Tilman Bayer
Ephraim Chambers' Cyclopaedia (1728)


References

  1. ^ Hinnosaar, Marit; Hinnosaar, Toomas; Kummer, Michael; Slivko, Olga (2017-07-17). "Wikipedia Matters" (PDF): 22. {{cite journal}}: Cite journal requires |journal= (help)
  2. ^ Dang, Quang-Vinh; Ignat, Claudia-Lavinia (2017-08-23). An end-to-end learning solution for assessing the quality of Wikipedia articles. OpenSym 2017 - International Symposium on Open Collaboration. doi:10.1145/3125433.3125448.
  3. ^ Tsikerdekis, Michail. "Cumulative Experience and Recent Behavior and their Relation to Content Quality on Wikipedia". Interacting with Computers: 1–18. doi:10.1093/iwc/iwx010. Retrieved 2017-08-01. Closed access icon / author's pre-print
  4. ^ Tsuji, Keita (2017-05-26). Automatic Classification of Wikipedia Articles by Using Convolutional Neural Network (PDF). QQML 2017 - 9th International Conference on Qualitative and Quantitative Methods in Libraries.
  5. ^ Heracleous, Loizos; Gößwein, Julia; Beaudette, Philippe (2017-06-09). "Open strategy-making at the Wikimedia Foundation: A dialogic perspective = The Journal of Applied Behavioral Science": 0021886317712665. doi:10.1177/0021886317712665. ISSN 0021-8863. {{cite journal}}: Cite journal requires |journal= (help) Closed access icon author's preprint
  6. ^ Tinati, Ramine; Luczak-Roesch, Markus (2017). "Wikipedia: a complex social machine". ACM SIGWEB Newsletter: 1–10. ISSN 1931-1745. Closed access icon
  7. ^ Dolmaya, Julie McDonough (2017-04-03). "Expanding the sum of all human knowledge: Wikipedia, translation and linguistic justice". The Translator. 23 (2): 143–157. doi:10.1080/13556509.2017.1321519. ISSN 1355-6509. Closed access icon
  8. ^ Youngwhan Lee; Heuiju Chun (2017-04-03). "Nation image and its dynamic changes in Wikipedia". Asia Pacific Journal of Innovation and Entrepreneurship. 11 (1): 38–49. doi:10.1108/APJIE-04-2017-020. ISSN 2071-1395. Retrieved 2017-08-01.
  9. ^ Brendan Luyt (2017-05-25). ""A wound that has been festering since 2007": The Burma/Myanmar naming controversy and the problem of rarely challenged assumptions on Wikipedia". Journal of Documentation. 73 (4): 689–699. doi:10.1108/JD-09-2016-0109. ISSN 0022-0418. Closed access icon
  10. ^ Kwon, Okyu; Son, Woo-Sik; Jung, Woo-Sung (2016-11-01). "The double power law in human collaboration behavior: The case of Wikipedia". Physica A: Statistical Mechanics and its Applications. 461: 85–91. doi:10.1016/j.physa.2016.05.010. ISSN 0378-4371. Closed access icon
  11. ^ Martinelli, Luca (2016-03-02). "Wikidata: la soluzione wikimediana ai linked open data". AIB studi. 56 (1). ISSN 2239-6152.
  12. ^ Ameen, Saleem; Chung, Hyunsuk; Han, Soyeon Caren; Kang, Byeong Ho (2016-12-05). Byeong Ho Kang; Quan Bai (eds.). Open-domain question answering framework using Wikipedia = AI 2016: Advances in Artificial Intelligence. Australasian Joint Conference on Artificial Intelligence. Lecture Notes in Computer Science. Springer International Publishing. pp. 623–635. ISBN 9783319501260. Closed access icon
  13. ^ Kennedy, Krista (2016). Textual curation: Authorship, agency, and technology in Wikipedia and Chambers's Cyclopædia. The University of South Carolina Press. ISBN 978-1-61117-710-7. Closed access icon
Supplementary references:
  1. ^ Hinnosaar, Marit; Hinnosaar, Toomas; Kummer, Michael; Slivko, Olga (2015). Does Wikipedia matter? The effect of Wikipedia on tourist choices. ZEW Discussion Papers.
  2. ^ Kittur, Aniket; Kraut, Robert E. (2008). Harnessing the Wisdom of Crowds in Wikipedia : Quality Through Coordination. Computer-Supported Cooperative Work. doi:10.1145/1460563.1460572.
  3. ^ Arazy, Ofer; Nov, Oded (2010). Determinants of Wikipedia Quality : The Roles of Global and Local Contribution Inequality. Computer-Supported Cooperative Work. doi:10.1145/1718918.1718963.
S
In this issue
+ Add a comment

Discuss this story

These comments are automatically transcluded from this article's talk page. To follow comments, add the page to your watchlist. If your comment has not appeared here, you can try purging the cache.
Without having read the full paper myself yet: The review already says that there was indeed a control group. That said, keep in mind that this is a preprint and not yet peer-reviewed, as also already pointed out in the review, which furthermore states that "it's not entirely clear to this reviewer how the statistical significance was ascertained".
BTW I took a slightly closer look at the statistical methods of a somewhat similar discussion paper involving two of the same authors (Slivko and Kummer) in this review two issues ago: "How does unemployment affect reading and editing Wikipedia ? The impact of the Great Recession", and also briefly compared them to that of another author in the subsequent issue.
Regards, Tbayer (WMF) (talk) 05:35, 6 August 2017 (UTC)[reply]
Very illuminating, thanks! Regards, Tbayer (WMF) (talk) 05:35, 6 August 2017 (UTC)[reply]
I was pointed elsewhere to this explanation by the research leader: in this talkpage archive. You can read it, it's in English. Basically they admit that the translator added copyrighted text to the Dutch articles (which implies that they didn't do this on purpose, and that this only happened in Dutch), translated from tourism brochures. This text shows that the research leader was aware of this at least after the fact, and I'm quite surprised this is not disclosed in the publication. effeietsanders 12:46, 6 August 2017 (UTC)[reply]
And on the German Wikipedia (edits included [1], [2], [3], [4]), there was some milder pushback regarding e.g. lack of references, travel guide language, and unencyclopedic illustrations, to which one of the researchers responded, stating later that there had been an attempt to address the issues. See de:Benutzer_Diskussion:Oltau/Archiv/2014#Spanischkurs and de:Benutzer_Diskussion:Mefk81#Spanischkurs. Regards, Tbayer (WMF) (talk) 22:06, 6 August 2017 (UTC)[reply]
Oh, once you hit a wall on nlwiki, it is typically much harder to recover. I won't say it's one of the more welcoming projects. But the way this was conveniently left out of the article is striking - especially as it could signal impacts on other languages where the same method was used. effeietsanders 14:07, 7 August 2017 (UTC)[reply]
Well, the authors didn't in any way imply that it was the Dutch Wikipedia's fault. One of my hypotheses was that their translator to Dutch had done a significantly worse job than the others, and I see this is indeed the case. Thanks! --Nemo 06:44, 15 August 2017 (UTC)[reply]



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0