The Signpost

Analysis

Uncovering scientific plagiarism

Contribute  —  
Share this
By Debora Weber-Wulff and Graf Isolan
Debora Weber-Wulff (User:WiseWoman) is a professor at Hochschule für Technik und Wirtschaft in Berlin. Both authors are active on the VroniPlag Wiki, WiseWoman on the German Wikipedia.

Have you ever found yourself sitting with some text, thinking: "Where have I read this before?" Wikipedians face this question every day, when they have to deal with plagiarized content. But plagiarism does not just affect the quality and credibility of articles; nor is it just an issue for university professors and school-teachers marking their students' assignments. It is found at all levels of university research, right up to the writing of scientific papers and doctoral theses.

Over the past year and a half, the German academic community has been rocked by continual plagiarism scandals. Two wiki-based groups have been instrumental in uncovering "text parallels" in doctoral theses by jurists, scientists, industry managers, and politicians. The latest plagiarism to have been exposed was a textbook warning about taking material from the German Wikipedia – while itself plagiarizing Wikipedia in at least 18 places.

Karl-Theodor zu Guttenberg, German minister of defence 2009–11. He was derided at the time as "Baron cut and paste", and "zu Googleberg".
On 16 February 2011 the daily newspaper Süddeutsche Zeitung published the suspicions of a law professor from Bremen, Germany, that the doctoral thesis of the minister of defence, Karl-Theodor zu Guttenberg, contained extensive plagiarism; zu Guttenberg called the accusations "absurd", insisting he would fix the odd erroneous footnote in a second edition.

This angered a number of scientists who had found blatant plagiarism just by googling pieces of text from the thesis. They tried documenting the plagiarism collaboratively using Google Docs, but the platform could not support the more than one hundred people who wanted to edit the document simultaneously. Some computer scientists in the group decided that a wiki would solve the problem, so they moved to the Wikia platform, founded by Jimmy Wales in 2004.

As one of the initiators, User:PlagDoc, describes in an essay recently co-authored with a journalist and published in German and in English, the choice of a wiki enabled an investigative crowdsourcing effort of tremendous proportions: GuttenPlag Wiki. When the dust settled, zu Guttenberg had his doctorate revoked (63% of the lines on 94% of the pages in the thesis submitted were plagiarised) and stepped down as a government minister, moving to the US to escape the heat. The GuttenPlag Wiki received the Grimme Online Award in the "Special" category in 2011; a representative of Wikia accepted the prize as a representative of a group of more than 20,000 occasional and daily editors on the site (press release).

It didn't stop there. In April 2011, large amounts of plagiarism were found in a PhD thesis by the daughter of a high-ranking former Bavarian politician. Those interested in investigating this decided to set up a new wiki for the documentation, VroniPlag Wiki (website). In quick succession, more and more plagiarised theses were documented on the same platform, because people did not want to have to set up a new wiki for each case. Although far fewer contributors are working on this wiki than on the Guttenplag Wiki, they have continually documented plagiarism since the site's inception.

The wiki has an anonymous drop-box where people suggest theses that should be scrutinised, but many tips come in by email, either to the anonymous email addresses set up for the purpose, or to the few people who are reachable by their real name. People come and go, often working intensively on a particular case. Some have stayed and been active on all of the new cases. The group has coalesced into a team of around ten administrators and a handful of sympathetic onlookers, along with the obligatory trolls. A workflow has been set up for collaboratively and transparently documenting plagiarism, announcing the name of the author only when it's clear that a document contains a significant number of text parallels.

Currently 26 cases are documented on the site. Of these, eight doctorates have been rescinded (with several lawsuits pending); three have been declared to be within the bounds of acceptability by the awarding universities, although those institutions have provided no explanations for the substantial numbers of text parallels. The extensive documentation has demonstrated that plagiarism is not just an occasional incident, but something that the German university system must now get serious about. Case 25, unusually not a thesis but a textbook for law students on scientific methods in the age of the Internet, was a striking case that would be humorous if it were not so serious: not only was the chapter on plagiarism plagiarized, it warned of the dire consequences of taking material from Wikipedia, while lifting a good 18 pieces themselves. The book was promptly withdrawn by the publisher after it was outed on VroniPlag Wiki.

Massive text parallels have been documented on VroniPlag Wiki in two dissertations from Poland and Denmark, suggesting that plagiarism in university research degrees is widespread. The Danish case is also interesting, as the plagiarist is a Pakistani citizen who published many papers as well as his dissertation on "terrorist" networks – partly by taking text blocks – often word-for-word – from older papers about criminal networks and just replacing the word "criminal" with "terrorist". Other cases not on VroniPlag Wiki have involved the Romanian minister of education, the Romanian prime minister, the Hungarian president, an official in Thailand, and a parliamentarian in South Korea. Documentation is also underway in Russia concerning the dissertation of their new education minister.

In Germany, many universities apparently seem unable to come to terms with the ethics of Internet-based research and publishing methods. The administrations have tended to react to revelations of plagiarism among their graduates in a way that might be labeled Kafka-esque; and there is no real in-university support for plagiarism education or detection, no training for tutors or teachers, no procedure for dealing with lower levels of plagiarism.

The work at these wikis shows how urgent it is to educate people about plagiarism and how to avoid it. Scientific online publishing would also contribute to reducing the amount of plagiarism: if it can be indexed by a search engine, it can more easily be found by software or a simple search on three to five terms from a paragraph.

GuttenPlag Wiki and VroniPlag Wiki are now taken seriously and have contributed to accelerating the otherwise glacial progress in this area in the (German) university system. The writing is on the wall now, with public reaction on-side, although there are significant pockets of resistance; for example, an open letter penned by eight high-ranking former heads of German universities and research organizations and published on 14 June in the Süddeutsche Zeitung requested that this "undignified spectacle" [of published evidence of plagiarism] cease immediately and that the universities be left to their own devices to carry on as before. Public discussion like this about scientific matters does not happen often in Germany.

The experiences of the past year and a half have shown that plagiarism is a widespread phenomenon – not only in Germany. It affects universities large and small, in many fields of study at all levels. Plagiarists may think they are being smart to be re-using electronically available materials for their own texts – but they forget that there are people well-versed with online research instruments and scientific texts who are no longer willing to let others achieve scientific merit by illegitimate means. Using wiki technology to collaboratively fight plagiarism, the latter have joined forces and have become major new players in the scientific community.

+ Add a comment

Discuss this story

These comments are automatically transcluded from this article's talk page. To follow comments, add the page to your watchlist. If your comment has not appeared here, you can try purging the cache.
The Internet is shortening the distance, raising awareness but in some cases gives students, researchers the possibilities that are lowering the education and resarch quality (plagiarism - copy & paste, …). Plagiarism is “at home” everywhere. I would like to inform you about the positive effects that caused the nationwide Central Repository of Theses and Dissertations and nationwide Plagiarism Detection System (we are using the name ANTIPLAG to speak about of both systems) in Slovakia.

All Slovak higher education institutions are obligatory users of ANTIPLAG since 2010. Since September 1st 2011 there is public access (open access) enabled to the central repository. In two months we will have in the CR more than 300 thousand of theses and dissertations.

Independent international research project “Impact of Policies for Plagiarism in Higher Education Across Europe” (IPPHEAE, EU funded, 2010-2013, Project Lead Partner: Coventry University, United Kingdom) carried out a survey in all EU countries. They prepared country reports for 27 EU countries (http://ippheae.eu/project-results). In the report “Plagiarism Policies in Slovakia” you can read this:

“There were some notable differences between the Slovak surveys and the EU average. Almost all Slovak students (99%!) become aware of plagiarism before or during their bachelor studies. The EU average shows that 20% of students become aware of plagiarism during their masters/PhD degree or are still not sure about it.”

" ... Slovak students are the most aware of plagiarism among all EU countries"

"The most outstanding example of good practice is definitely the existence of national repository of theses. As it is run centrally and universities are obliged to upload their theses, students from all institutions have theoretically the same conditions. The other aspect is that the software tool provides just a protocol for matching with other sources. The decision about whether a given case is plagiarism or not lies with teachers and/or the examination committee and these may not always follow the same procedures."

"Compared to other countries, Slovakia should be praised for its achievements. And it already was: The European Commission has awarded the Slovak Centre of Scientific and Technical Information the European Prize for Innovation in Public Administration." More: http://ec.europa.eu/research/innovation-union/index_en.cfm?section=admin-innovators

"The responses from Slovak students demonstrated the highest level of understanding about plagiarism within the whole Europe. Their unwillingness (in comparison with other countries) to receive more training on plagiarism is therefore understandable. The research team of the IPPHEAE project would also like to praise Slovakia for existence of national repository of theses and built-in plagiarism detection tools."

In case of your interest you can contact us, your questions and feedback are welcome; you have the possibility to read these papers:

Barrier to thriving plagiarism. Conference paper - The 5th International Plagiarism Conference, 2012. Available at: http://archive.plagiarismadvice.org/documents/conference2012/finalpapers/Kravjar_fullpaper.pdf

Strategies and responses to plagiarism in Slovakia. PLAGIARISM ACROSS EUROPE AND BEYOND - Conference Proceedings, 2013, pp. 201-215 Available at: http://ippheae.pefka.mendelu.cz/files/proceedings.pdf.

The Occurrence of the Terms akademická etika and akademická integrita in Texts on the Internet and in the Media, 2013. Available at: https://www.vedatechnika.sk/Blog/Lists/Posts/Post.aspx?ID=89 or http://ncpvat.cvtisr.sk/buxus/generate_page.php?page_id=938.


  • Yes, plagiarism is huge and at the same time the global problem. It is interesting that is not solved on a country level. The only exception, in my opinion, is Slovakia. Slovakia has an unique experience with plagiarism fight at higher education institutions on a national level. There are operated two cooperating systems: Nationwide Plagiarism Detection System + Nationwide Repository of Theses and Dissertations (since May, 2010). More details you can find here:[[1]].

Kravjar (talk) 09:16, 6 November 2012 (UTC)[reply]

.
  • Great article about a great solution to a huge problem. It should be clear that the problem of plagiarism starts at the top. University administrators need to take action to prevent plagiarism at all levels, not fight against the exposure of plagiarism as an "undignified spectacle." Plagiarism likely occurs at all university levels in all countries, simply because administrators do very little to stop it. No one country should be singled out, but those countries where a doctorate is often used as a resume-polisher for politicians, are particularly at risk. Based on my experiences in Russia and Eastern Europe in general, there could be huge problems there. I also expect that there are similar problems in the UK and the US. No country is immune, as far as I can tell. Smallbones (talk) 14:16, 3 July 2012 (UTC)[reply]
    • I wonder whether the next move is to boost participation at those wiki sites, perhaps both from the ranks of the movement and by attracting outsiders in. More participants means more can be scrutinised, and a greater chance the process will eventually force the universities and supervisors/instructors to take the matter more seriously. In my view, supervisors need a good boot up the ... on this. It's the cost of doing business, and they need to be professionally thorough in conveying the ethics to their charges. Nothing less will do. Tony (talk) 04:34, 4 July 2012 (UTC)[reply]
      • I have not seen any, but I would love to be directed toward, or have someone create, and automated plagarism checking tool for Wikipedia, and perhaps it could be used beyond Wikipedia. Do we have any such tools? Judgesurreal777 (talk) 00:53, 7 July 2012 (UTC)[reply]
      • I have been testing plagiarism detection software since 2004 [2], but the software does not live up to our expectations. If you do have two texts you want to compare, VroniPlag Wiki has implemented Dick Groene's Sim_text in Java-Script for coloring similarities in texts: [3]. You put one text in the left box, one in the right and press "Texte vergleichen!". --WiseWoman (talk) 15:36, 7 July 2012 (UTC)[reply]



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0