Plagiarism, as Wikipedia's article on the topic explains, "is the use or close imitation of the language and ideas of another author and representation of them as one's own original work." At best it is intellectual sloppiness and at worst outright theft.[1] As Robin Levin Penslar notes in Research Ethics: Cases and Materials, "The real penalty for plagiarism is the abhorrence of the community of scholars."[2] It can bring a community into disrepute. Wikipedia's editors should create their own articles, not adopt the work of others. But while this is an easy approach to recommend, plagiarism may not be as simple as it first seems—it is often committed inadvertently. The best way to prevent plagiarism is to understand clearly what it is, how to avoid it, and how to address it when it appears.
Wikipedia is not a primary source and contains no original research; therefore, everything that appears on Wikipedia should be rooted in a reliable source. The problem with plagiarism is not that it involves the use of other people's ideas, but rather that other people's words or ideas are misrepresented—specifically that they are presented as though they were "an editor's own original work". Even if contributors provide a citation for a sentence, it may still be plagiarism if they do not clearly indicate with quotation marks the duplication of the source's wording. Citations are universally understood as indicating a source for information, not as a license to copy the original wording.
There are three major ways to plagiarize:
Plagiarism is not the same as copyright infringement: material can be plagiarized from both copyrighted and public domain sources.[4] One report about a plagiarism scandal on Wikipedia claimed that "Wikipedia editors ... declared a handful [of the allegedly plagiarized articles] to be OK because copied passages came from the public domain."[5] If this was indeed the reaction of Wikipedia editors, they were mistaken. To clarify this, think of the famous opening line of Jane Austen's novel Pride and Prejudice (1813): "It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of a wife."[6] The text of this novel, like the text of the 1911 Encyclopedia Britannica, is in the public domain. However, these are Austen's words and even though no one owns the copyright to them any longer, we need to acknowledge that the wording is hers. By inserting this sentence without quotation marks into an article, Wikipedia editors would be plagiarizing Austen.[7] Apart from the ethical need to credit her for her words, Wikipedia has a scholarly duty to inform its readers of the source of such a sentence, including the page number where the sentence can be found in the source.
Wikipedia policies say much about copyright violation, but far less about plagiarism. The guideline on the topic was written in 2008. Wikipedia's co-founder Jimmy Wales took a clear stand on the issue in 2005: "Let me say quite firmly that for me, the legal issues [surrounding plagiarism] are important, but far far far more important are the moral issues. We want to be able, all of us, to point at Wikipedia and say: we made it ourselves, fair and square."[8]
Not every fact contained in a Wikipedia article requires attribution. When a fact is "common knowledge"—that is, generally known—it is not plagiarism to repeat it, even if contributors learned it from a specific reference. For example, it is commonly known that Emily Dickinson published very few poems during her lifetime.[9] Generally, if information is mentioned in many sources, especially general reference sources, and easily found, it is considered common knowledge. It is also acceptable to reproduce non-creative lists of basic information, such as an alphabetical directory of actors appearing in a film. While Wikipedia's verifiability policy encourages the citing of such information, a failure to do so is not plagiarism.
Although common knowledge and non-creative lists of basic facts do not "belong" to a source and do not require attribution to avoid plagiarism, less commonly known information, opinions and creative text do. Likewise, the creative presentation even of common knowledge, belongs to its original author. Contributors can safely re-use the fact, but not the language unless it is a title, as for a job or a creative work, or utterly devoid of creativity, such as a common phrase. From a copyright standpoint, the level of creativity required to claim ownership is minimal. The United States Supreme Court has indicated that under US copyright law, which governs copyright matters on Wikipedia, "[t]he vast majority of works make the grade quite easily, as they possess some creative spark, 'no matter how crude, humble or obvious' it might be."[10] Similarly, most text will be creative enough that its replication will be plagiarism. Accordingly, while text such as "Dickinson was born on December 10, 1830" can be copied without quotation marks, care must be taken not to rely too much on the presumption that text is not creative. Further, one cannot copy an entire source in this way, claiming that it is "common knowledge" or uncreative text. In such cases, it can come down to the length of a string of exactly copied words; good editors get a feel for where it's starting to be dishonest not to attribute.
Less commonly known facts or interpretations of facts must be cited to avoid plagiarism, and creative text must either be quoted or properly revised.
To construct articles that read smoothly while still remaining faithful to their sources, it is essential to learn how to properly use other people's ideas and words. Wikipedia contributors need to know when to give credit, how to adapt source material so that it can be used in an article, and when to use quotations.
When editors want to use verbatim excerpts of a source, there is one simple way to avoid plagiarism: use direct quotations. The words from the source should be reproduced exactly as they appear in the original, enclosed within quotation marks, and identified by an inline citation after the quotation. However, direct quotations should not be overused. They run the risk of copyright infringement if the sources used are not free. Wikipedia's non-free content guidelines offer some guidance on when to use direct quotations and remind us that the "[e]xtensive quotation of copyrighted text is prohibited." But even when free sources are used, the overuse of direct quotation produces articles that are simply collections of quotations. The risk is a fragmentary effect in which the broader context of the quoted material is unclear, and readers are left to piece together the information, which often involves shifts in writing style.
Quotations should generally be used in the following situations:
Source text is usually adapted using a combination of paraphrase and summary. These two styles generally differ in their level of detail. A summary is more likely to be used for longer expanses of text and to cover only the major points in a passage, omitting or touching lightly on examples or definitions; a summary is generally expected to be considerably shorter than the original source. By contrast, paraphrasing is more likely to be closer to the original and may be nearly as long as or even longer than the source.
Adapting source text, whether by paraphrasing or summarizing, is a valuable skill, and contributors to Wikipedia need to be alert to the potential for inadvertent plagiarism. Many editors believe that by changing a few words here or there—or even by changing a great number of the words found in the original source—they have avoided plagiarism. This is not necessarily the case. Nor does the mere rearrangement of clauses, sentences, or paragraphs avoid the problem.
In this example, Wikipedia's article text is an attempt at paraphrasing the source. However, almost all of the original word choice, word order and sentence structure is retained.
Analysis:
In terms of both plagiarism and copyright, the author of a text not only "owns" the precise, creative language he or she uses, but less tangible creative features of presentation, which may incorporate the structure of the piece and the choice of facts. In terms of plagiarism, but not copyright, the author also "owns" the facts or his or her interpretation of them, unless these are, as mentioned above, common knowledge. Revising to avoid plagiarism means completely restructuring a source in word choice and arrangement while giving due credit for the ideas and information taken from it.
In this paraphrase, the language and structure of the passage has been significantly altered, making it an original expression of the ideas. The ideas have, of course, been properly credited.
Source:
Paraphrase:
This adaptation, from the featured article about Thomas Eakins' The Swimming Hole, displays attribution of opinion and uses a combination of paraphrase and quotation:
Unfortunately, there is no hard and fast rule for how much revision is necessary to avoid plagiarizing. In evaluating copyright concerns, the United States courts adopt a "substantial similarity" test that compares the pattern and sequence of two works, finding such similarity where "the ordinary observer [reading two works], unless he set out to detect the disparities, would be disposed to overlook them, and regard their aesthetic appeal as the same."[17] Even if all of the language is revised, a court may find copyright infringement under the doctrine of "comprehensive non-literal similarity" if "the pattern or sequence of the two works is similar".[18] Likewise, plagiarism may exist if readers comparing the two works would come away with a sense that one is copied from or too heavily based on another.
Editors should always compare their final drafts with the sources they have used to make sure that they have not accidentally come too close in language and structure or failed to attribute when necessary.
One way editors can minimize the tendency to reuse text is to not copy and paste text into their working drafts. Instead, editors should assemble and organize their notes, excerpts, and other source materials by topic. This can be done either in hard copy or by using an electronic filing system. Editors should then read and absorb what the sources say and proceed to writing a draft version, in their own words, of each topic. These drafts can be assembled according to the editor's own organizational schema. There are a number of ways to organize material; editors should not slavishly follow a source's structure, either in overall organization, or in the composition and arrangement of sentences and paragraphs within each section. This method reduces the temptation (and makes it harder) to adopt verbatim language and organization from the sources.
At the same time, when taking notes from a source for their own use, editors may find it useful to take them verbatim, with quotation marks, if they will not have access to that source as they are writing their final draft. If a different language is used in note-taking, an editor may find him or herself accidentally restoring some of the author's original words when constructing a draft. Being able to see at a glance exactly how the source was written can help avoid this.
Use multiple sources, if possible. Editors may find it more difficult to avoid following that text too closely if they rely on only one source, as they will necessarily be limited to those details selected by the author of that original source. It is not impossible to revise and reorganize a single source sufficiently to avoid plagiarism or copyright infringement, but it is more difficult.
Editors should be careful not to add plagiarized material to Wikipedia, and can help to protect the integrity of the project by spotting plagiarism and helping to correct it. When large sections of a source are copied word-for-word into an article, it is often easy to spot and repair. The use of ideas or uncommon facts without credit, possibly the most common form of plagiarism, can be repaired by sourcing. Detecting and dealing with subtler forms of plagiarism may be more challenging, but is usually possible.
Red flags for plagiarism include:
If you suspect plagiarism, you may wish to start by checking the article's history. If the article has a multi-authored feel but appears to be largely single-authored, there could be reason for concern, as this may suggest a contributor has borrowed too heavily from the diction of multiple sources. It may be worth checking the contribution history of an editor across a number of articles, to see if there is a discernible authorial voice or if there is a pattern of such inconsistency. There may be a history of such issues on the editor's talk page.
Another good starting point is to review the article's sources. Particularly when plagiarism results from misunderstanding—rather than intent to deceive—a contributor may clearly identify the sources from which s/he has plagiarized, and even link to them. If the source is in another language, for instance, the contributor may be under the mistaken belief that the act of translation is a sufficient revision to eliminate concerns of plagiarism. On the contrary, whether or not the work is free, the obligation remains to give credit to authors of foreign language texts for their creative expression, information and ideas, and, if the work is unfree, direct translation is likely to be a copyright violation as well.[19][20] Concerned readers can also use search engines and automated plagiarism detection. When searching manually, it is helpful to isolate small sections of text from an article. However, some results found this way may be from mirrors and forks of Wikipedia itself, particularly if the article is not newly created.
There are templates such as {{Copypaste}} or {{Close paraphrase}} that are added to the top of a suspect section or article and may draw attention to the problem; concerns might be noted at an appropriate WikiProject or forum. Just as Wikipedia currently has no clear guideline or policy on plagiarism, it has no clear forum for addressing plagiarism concerns. However, Wikipedia:WikiProject Copyright Cleanup stands to assist where plagiarism may co-exist with copyright infringement, and, even where it doesn't, project members may be able to assist with plagiarism.
This article or section appears to have been copied and pasted from a source, possibly in violation of a copyright. Please edit this article to remove any non-free copyrighted content and attribute free content correctly. Follow the Guide to layout and the Manual of Style. Remove this template after editing. |
If an article seems to follow the language and structure of another work too closely, first consider whether it is a matter of copyright infringement or plagiarism. If the source is not free and the text may represent a legal concern for Wikipedia, follow the procedures set out at Wikipedia's copyright violations policy. If the source is free, steps should be taken to remedy plagiarism. Wikipedia's proposed guideline on plagiarism suggests politely discussing concerns with the contributor. Further steps may need to be taken to address contributors who persist in plagiarism after being made aware of the problem, through Requests for comment or—if the contributor proves disruptive—through a report at the administrator's incidents noticeboard. The plagiarism will also need to be repaired as soon as possible. If it can be attributed, revised or turned into a usable quotation, it should be. If the editor who discovers the problem is unable to repair it or uncertain of how it should be addressed, it should be brought to the attention of other contributors.
As the main page says, Wikipedia is "the free encyclopedia that anyone can edit". Anyone can, and should, repair plagiarism.
To avoid charges of plagiarism, authors of scholarly works ... always give proper credit to the sources of their ideas and facts, as well as any words they borrow. This is so even if the work borrowed from is in the public domain.
A "derivative work" is a work based upon one or more preexisting works, such as a translation, musical arrangement, dramatization, fictionalization, motion picture version, sound recording, art reproduction, abridgment, condensation, or any other form in which a work may be recast, transformed, or adapted.... Subject to sections 107 through 122, the owner of copyright under this title has the exclusive rights to do and to authorize any of the following:...(2) to prepare derivative works based upon the copyrighted work....
... large-scale cribbing of foreign-language texts might occur during the process of translation.... The practice persists even though the most flagrant violators are eventually accused and dismissed from their posts.
Discuss this story