Under the heading rerum causas cognescere, the first ever Wikidata conference got under way in the Tagesspiegel building with two keynotes. One was on YAGO, a knowledge base conceived ten years ago featuring automatic compilation from Wikipedia. The other keynote was from manager Lydia Pintscher, on the "state of the data". Interesting rumours flourished around Magnus Manske's mix'n'match tool and its 600+ datasets, mostly in digital humanities, and its adoption by the WMF. One also heard of an imminent Wikibase incubator site. Announcements came in talks: structured data on Wikimedia Commons is scheduled to make substantive progress by 2019. The lexeme development effort on Wikidata is not expected to make the Wiktionary sites redundant, but may facilitate future automated compilation of dictionaries.
And so it went, with five strands of talks and workshops running until 11 pm on Saturday. Wikidata applies to GLAM work via metadata. It may be used in education, raises issues such as author disambiguation, and lends itself to different types of graphical display and reuse. Many millions of SPARQL queries are run on the site every day. Over the summer a large open science bibliography has come into existence there.
Wikidata's fifth birthday party on the Sunday brought WikidataCon to a close. See a dozen and more reports by other hands. CM
Two incidents recently occurred by individuals with a special position of trust in the English Wikipedia. Both involved paid editing; one resulted in a community ban, and the other is undergoing discussion at Arbcom and noticeboards with at least three administrators asking for revocation or resignation of the sysop bit.
The first case involves KDS4444 who was granted OTRS access as part of the volunteer response team in September 2015. OTRS is the ticketing system across all Wikimedia projects and is used, among other things, by companies and individuals who ask for changes to be made in their articles without violating the conflict of interest guideline. It was alleged at the Administrators' noticeboard (AN) that KDS4444 had used his OTRS access to identify candidates for paid editing work, then email them with offers in the 300 dollar range to make edits on their articles. KDS4444 had his OTRS access revoked on 21 October and was community banned from English Wikipedia on November 17. At the AN discussion some editors wanted to use less drastic remedies on ENWP for an editor who had contributed over 150 articles. Others harshly criticized his actions "actively soliciting for paid work" in a position of trust as "completely unconscionable" and "[in] defiance of community norms".
The second case involves Mister Wiki, a commercial Wikipedia editing firm, and Salvidrim! who has been an administrator on English Wikipedia since January 2013. The Conflict of interest Noticeboard (COIN) case (permlink) was followed by an Arbcom filing (permlink). Salvidrim! and others (who are not administrators) are declared paid editors for Mister Wiki. Salvidrim! had created an alternate account for paid editing, Salvidrim! (paid) and declared that they were working for Mister Wiki. But problematic interactions between the Mister Wiki team members were identified at COIN, including approving one anothers' drafts at Articles for creation (AfC). An administrator (TonyBallioni) reviewing the activity stated that Salvidrim! "as a sysop actively asked an AfC reviewer to move an article you had been paid to edit out of draft space" and that "breach of the trust we place in administrators and is why I think you should resign as a sysop"; another administrator (Doc James) said "These sorts of activities have a significant potential to harm our shared brand" and joined at least one other administrator (JzG) in a request for either de-sysopping or a new Requests for adminship. The request for an Arbcom case was filed on November 21, after Salvidrim! declared he would not voluntarily give up the bit "today, or tomorrow, or this week".
The last well known sysop abuse of position incident was the 2015 Wifione Arbcom case which resulted in de-sysop and ban (see previous Signpost coverage).
A discussion "Should Wikipedians be allowed to use community granted tools in exchange for money?" was begun by Doc James at Village Pump before either of these two incidents was brought to ENWP noticeboards. B
Following a case that closed on 16 October 2017, Arthur Rubin was desysopped "for repeatedly not meeting the community expectations and responsibilities of administrators as outlined in WP:ADMINACCT". Specific incidents included removing permissions from a user during an ANI discussion – which reversed another admin's administrative action without prior notification or discussion – and failing to provide evidence for claims made against another user.
A new search interface is now available as a beta feature on MediaWiki.org. The interface enhances the Special:Search page experience, by changing the way namespaces and other advanced parameters are specified. This makes it easier for users to discover and utilise existing search features, such as intitle:.
When you click on the arrow in the advanced parameters section, it expands to give you a form to set parameters for advanced searches. Explanations on each parameter can be found next to each input field. Namespaces can be selected from the "Search in", where you can either type them in or select from a dropdown list.
The idea came from a workshop series on advanced search held by WMDE's software development department, as part of their Technical Wishes Project. The beta feature was made available to test wikis and Mediawiki.org on 21 November 2017, and will also be available on Arabic and German Wikipedias from 29 November.
Further information is available on MediaWiki.org and Meta-wiki.
The annual Community Tech wishlist survey is underway for 2017. Since early November, 220 proposals in 15 categories have been submitted from 284 contributors. The proposal phase has concluded; voting begins on 27 November at meta:2017 Community Wishlist Survey.
So far this year, the team has completed five of the top ten wishes: Rewrite XTools (#5), Wikitext editor syntax highlighting (#6), Warning on unsuccessful login attempts (#7), Fix Mr. Z-bot’s popular pages bot (#9), and User rights expiration (#10). Other top wishes, as well as anti-harassment tools and projects for smaller groups, are being worked on or investigated.
New user scripts to customise your Wikipedia experience
Latest tech news from the Wikimedia technical community: 2017 #43 – #47. Please tell other users about these changes. Not all changes will affect you. Translations are available on Meta.
ccnorm_contains_any
when you create an abuse filter. This can be used to look for multiple words or phrases within a string. It will find words where some characters have been replaced. You can read more in the documentation. [8]get_matches
. You can use it to store matches from regular expressions – one of each capturing group. You can read more in Phabricator.https://ru.wikipedia.org/wiki/Википедия#.D0.98.D1.81.D1.82.D0.BE.D1.80.D0.B8.D1.8F
instead of https://ru.wikipedia.org/wiki/Википедия#История
. This will soon be fixed. Old links will still work. [10][11]http://-{zh-cn:foo.com; zh-hk:bar.com; zh-tw:baz.com}-
must be replaced. You will have to write -{zh-cn: http://foo.com ; zh-hk: http://bar.com ; zh-tw:http://baz.com }-
instead. This only affects languages with Language Converter enabled. Examples of such languages are Chinese and Serbian. This will happen next week. [13][14]importScript( 'User:Eizzen/SkinSwitcher.js' ); // Backlink: User:Eizzen/SkinSwitcher.js
importScript( 'User:Eizzen/AutoPurge.js' ); // Backlink: User:Eizzen/AutoPurge.js
importScript( 'User:The Voidwalker/alwaysEditSectionLink.js' ); // Backlink: User:The Voidwalker/alwaysEditSectionLink.js
importScript( 'User:Mvolz/displayContributions.js' ); // Backlink: User:Mvolz/displayContributions.js
importScript( 'User:Anne drew Andrew and Drew/PageDetails.js' ); // Backlink: User:Anne drew Andrew and Drew/PageDetails.js
importScript( 'User:Anne drew Andrew and Drew/SetupAutoArchive.js' ); // Backlink: User:Anne drew Andrew and Drew/SetupAutoArchive.js
importScript( 'User:Kku/Scripts/BacklinkTitle.js' ); // Backlink: User:Kku/Scripts/BacklinkTitle.js
importScript( 'User:Eizzen/PageCreator.js' ); // Backlink: User:Eizzen/PageCreator.js
importScript( 'User:Eizzen/LastEditor.js' ); // Backlink: User:Eizzen/LastEditor.js
importScript( 'User:Mr. Stradivarius/gadgets/DiffOnly.js' ); // Backlink: User:Mr. Stradivarius/gadgets/DiffOnly.js
importScript( 'User:The Voidwalker/histFilter.js' ); // Backlink: User:The Voidwalker/histFilter.js
importScript( 'User:Evad37/rater.js' ); // Backlink: User:Evad37/rater.js
With the promotion of S. O. Davies to a featured article (FA) in 2016, Brianboulton became Wikipedia's third featured article centurion. Despite having semi-retired from content creation in recent years, he remains an active user, and still produces the occasional FA.
Thank you so much for your time. I look up to you as someone to aspire to be, and I'm sure the Wikipedia community feels similarly.
Hey folks!
I've been working on some cool technology to help WikiProjects. I've learned that many WikiProject organizers struggle to get members and manage the tasks of maintaining a WikiProject, with the result that lots of WikiProjects become inactive. I made a bot that scans RecentChanges for people who have been working on or near a WikiProject's tagged articles and who are likely to be productive members of the project; it then recommends these editors to project organizers who can choose whether to invite them. So far, six WikiProjects have signed up and have been using the bot. Here's what some of them have to say:
I'm now ready to make the tool available to more WikiProject organizers. If you're interested and want to try it for yourself, contact me on my talk page and I'll help you get set up!
In the meantime, keep reading to see an example of what the recommendations look like and read more quotes from project organizers who've already tried the tool.
Username | Why we recommend this editor | First Edit Date | Total Edits in ENWP | Recent Activity Level | Invite |
---|---|---|---|---|---|
Deadpool | Deadpool just joined Wikipedia and made the first edit on an article within the scope of your project, Article:Agent X. | 2017-9-14 | 1 | New Editor | invite |
Spider-Man | Spider-Man just joined Wikipedia and made the first edit on an article within the scope of your project, Article:Marvel Team-Up. | 2017-9-11 | 4 | New Editor | invite |
Hulk | Hulk made 77 out of their most recent 500 edits to articles within the scope of your project. | 2010-5-24 | 606 | Very Active | invite |
Wolverine | Wolverine made 49 out of their most recent 500 edits to articles within the scope of your project. | 2006-1-23 | 2132 | Active | invite |
Iron Man | Iron Man edited articles similar to articles your project members edited. For example, Iron Man and your project member Superman edited 9 of the same articles in their most recent 500 edits. | 2017-7-26 | 191 | Active | invite |
Thor | Thor edited articles similar to articles your project members edited. For example, Thor and your project member Captain America edited 8 of the same articles in their most recent 500 edits. | 2009-5-19 | 28552 | Very Active | invite |
More thoughts from our interviews with some Wikipedians who already have used our system:
How important it is for your project to recruit new members?
What is your experience using our tool?
What would you say to other WikiProject organizers who are considering signing up to get recommendations?
Some other comments from Wikipedians:
Again, if you like what you see and you want to test it out, contact me on my talk page! Bobo.03 (talk) 20:57, 4 October 2017 (UTC)
Media reported on some errors in, or "hacking" of, Wikipedia, however short term. Maybe this is a good sign of the work's importance as a global media institution but maybe not a good sign of assumption of correctness for realtime events like a coup d'état or a televised beauty contest.
Vandalism of Phineas Gage was labeled "hacking" by the International Business Times and attributed to GNAA trolls in a campaign to smear BuzzFeed reporter Joseph Bernstein for reporting on the alt-right. (Jain, Rishabh (15 November 2017). "Wikipedia Hack Targets BuzzFeed Reporter Who Exposed Hedge Fund Billionaire's Alt-Right Connection". International Business Times.) The 2017 Zimbabwean coup d'état left the Wikipedia biography of Robert Mugabe in disarray, indicating for a period that he was no longer president, when (according to the current article revision) he was only under house arrest. (Austin, Winifred (16 November 2017). "Mugabe's Wikipedia profile updated to former president". Daily Post. Nigeria.) The International Business Times scorned Wikipedians again, holding up a revision of Miss World 2017 that listed the wrong winner – before results were actually announced, and which was reverted after seven minutes – as proof that "Wikipedia page [sic] can be edited by anyone and you cannot trust the platform." (Sharma, Dishya (18 November 2017). "Miss World 2017 winner: Miss Indonesia Achintya Holte Nilsen is the winner? Wikipedia says so!". International Business Times. India.)
And in the feature-not-a-bug category: According to a Washington Times op-ed by Robert H. Knight, "Wikipedia is Britannica — but without factual safeguards" because his edits to American Civil Rights Union with the self-identified account Truthwins09 were reverted and COI identified as a senior fellow employed by the group. Interestingly, for readers interested in finding out more about American Civil Rights Union, the Washington Times displays a synopsis of the Wikipedia article (with credit to Wikipedia). (Knight, Robert (29 October 2017). "'Whackapedia' and its error fest". The Washington Times.)
Wired magazine published another prediction of Wikipedia's end in "How Social Media Endangers Knowledge". The good news: "Trump's rise ... kicked in a significant flow of funds that has stabilized the nonprofit's balance sheet." The bad news: too many people are Amusing Ourselves to Death and not enough of us turning away from television-like media streams, reflected even in popular Wikipedia content which "tend[s] to revolve around television series or their cast".
Concerns about Wikipedia's accuracy and relevance go back to its very beginning; see for instance the December 2006 Signpost analysis of the previous year's Wikipedia Seigenthaler biography incident.
"People are strange when you're in Stranger Things ... " Netflix's acclaimed sci-fi horror series tops the list for a second week, and brings in eight of its cast members to the top 25 articles for this week, more entries than It back in September and slightly less than the deaths of Carrie Fisher and Debbie Reynolds last December.
"Faces look happy when you have won..." Also heavily on the list is baseball, as the Houston Astros (#8) win their first World Series and bring along three of their players and the wife of one of them.
"Men seem wicked when you're unwanted..." Right behind Stranger Things is another sexual scandal in Hollywood (Harvey Weinstein wasn't enough!), with Kevin Spacey (#2) accused of sexual advances on a 14-year-old fellow actor (#3), leading many other men to reveal the acclaimed performer had done the same with them.
"Streets are uneven when you're down" The rest of the list has four holdovers (two Hollywood movies, a Netflix series, and the week's holiday), the Day of the Dead along with the deceased themselves (the yearly list and a Korean actor), and Reddit discovering how to literally make jewellery out of waste (#9).
For the week of October 29 to November 4, the most popular articles on Wikipedia, as determined from the WP:5000 report were:
Rank | Article | Class | Views | Image | Notes |
---|---|---|---|---|---|
1 | Stranger Things | 2,431,740 | The second season of Netflix's critically acclaimed sci-fi horror series was released on October 27, returning viewers to a small town in 1980s Indiana. One year after an interdimensional monster kidnapped and/or killed a few people before being destroyed by a psychokinetic girl, the characters try to return their lives to normal. But no, it will be a scary Halloween for them. | ||
2 | Kevin Spacey | 2,254,711 | Kevin Spacey is a great actor, but it's sad to see that off the screen he was capable of being a sleazeball on par with Keyser Söze, Lex Luthor, and Frank Underwood. After several men accused Spacey of sexual harrassment/assault, Hollywood turned on him, and even worse, Spacey didn't realize outing himself as a gay man as everyone is calling you a pervert is insensitive. It all started with ... | ||
3 | Anthony Rapp | 1,264,121 | ... a reveal by Anthony Rapp - an actor best known for the play Rent who's currently playing an officer in Star Trek: Discovery – that Spacey had tried to molest him at the age of 14. | ||
4 | Thor: Ragnarok | 1,135,260 | The 17th film in the Marvel Cinematic Universe arrived in the remaining international markets this week, including the United States, where it opened atop the box office with a hefty $121 million weekend. Heavy on the comedy and regarded as the best of the three Thor films, Ragnarok was directed by New Zealander Taika Waititi and adding the likes of Jeff Goldblum, Cate Blanchett and Tessa Thompson to previous Thor mainstays like Chris Hemsworth, Tom Hiddleston and Idris Elba, as well as roping in other MCU heroes like Mark Ruffalo's Hulk and Benedict Cumberbatch's Doctor Strange. | ||
5 | Millie Bobby Brown | 1,057,845 | Portrayer of the most memorable character of Stranger Things (#1), a girl with psychokinesis known simply as Eleven. And somehow Brown been named among the sexiest people on TV despite being only 13 (see #3 for a case of how wrong this is). | ||
6 | Halloween | 1,018,599 | In a reversal from last year, the holiday on October 31st got more views than Mexico's carnival of the cadaverous celebrated the following day. | ||
7 | Day of the Dead | 645,380 | |||
8 | Houston Astros | 632,813 | Following a long history of either mediocrity or suffering, and a season where their hometown was hit by a hurricane, the Houston Astros finally could celebrate being the best team in baseball by winning the 2017 World Series in seven games over the LA Dodgers | ||
9 | Fordite | 604,699 | Reddit learned about this jewel made out of hardened car paint. | ||
10 | Deaths in 2017 | 586,760 | November has arrived, marking eleven months straight without the death list leaving, even in a week light on high profile departures. |
It's curious how the current trend for "binge watching" works, isn't it? For a conventionally broadcast series, you would expect it to stay at the top of popularity charts as viewers watched it every week, while for a series which all launches at once like how Netflix does it, you'd expect an initial burst as everyone watched at once, before a quick fall away. And yet, Stranger Things remains at number one for a third consecutive week, with star Millie Bobby Brown (#6) also making it into the list.
The only article other than ST to break a million views was baseball player Roy Halladay, who sadly died this week. Thor: Ragnarok is third. Allegations against actor Kevin Spacey (#4) make the list, as do allegations made on behalf of late actor Corey Haim (#10). The Paradise Papers (#9) was a big story making the news this week; while there's also the usual entries provided by Reddit and UFC.
For the week of November 5 to 11, the most popular articles on Wikipedia, as determined from the WP:5000 report were:
Rank | Article | Class | Views | Image | Notes |
---|---|---|---|---|---|
1 | Stranger Things | 1,619,424 | A third week at number one for Netflix's sci-fi-horror extravaganza, thusly becoming the second thing to threepeat in 2017. The previous was It, also a horror set in the 1980s. There's your key to success, Hollywood. Anyone wants to buy my script for a horror movie set at the 1985 World Snooker Championship final, my talk page is unprotected. | ||
2 | Roy Halladay | 1,359,778 | Roy Halladay, American baseball player from 1998 to 2013 who played for the Toronto Blue Jays and Philadelphia Phillies in his career, died on November 7 after his ICON A5 aircraft crashed into the Gulf of Mexico. In his highly-successful career, Halladay was called up for the Major League Baseball All-Star Game eight times and twice won the prestigious Cy Young Award for best pitcher. | ||
3 | Thor: Ragnarok | 1,350,754 | The seventeenth Marvel Cinematic Universe film is still number one at the US box office. Directed by Taika Waititi, the film brings back many of the much-loved characters from past MCU films, and includes Doctor Strange. The film has thus far made $215m in the US, making it the highest grossing Thor film thus far. Does this bode well for our chances of Thor 4? That's difficult to say. | ||
4 | Kevin Spacey | 882,622 | A series of allegations of sexual impropriety made against the actor Kevin Spacey are continuing to hit the headlines. Big news this week was the announcement that Spacey had been sacked from upcoming movie All the Money in the World, with Christopher Plummer being drafted in to refilm all his scenes. The film is still scheduled for release on December 22. | ||
5 | UFC 217 | 767,172 | November 4 saw the 217th Ultimate Fighting Championship major event, the second to be held at Madison Square Garden in New York. Three title fights were held as part of the eleven-fight card, with all three champions surrendering their titles, with Rose Namajunas taking the UFC Women's Strawweight Championship and T.J. Dillashaw (pictured) recapturing the UFC Bantamweight Championship, before in the headline fight Georges St. Pierre defeated Michael Bisping to become the UFC Middleweight Championship holder, incidentally becoming the fourth fighter to win titles in two different UFC weight classes. | ||
6 | Millie Bobby Brown | 756,870 | The first of four Stranger Things stars, young English actress Brown stars as psychokinetic young girl Eleven. Brown's performance in the role in last year's season saw her nominated for a Primetime Emmy Award for Outstanding Supporting Actress in a Drama Series. | ||
7 | Christopher Paul Neil | 745,098 | Reddit TIL, discussing the convicted Canadian child molester Neil, who appeared in over 200 photographs depicting child abuse. His face was obscured by a swirl effect, which was reversed by computer experts at the German Federal Criminal Police Office. Neil was arrested in Thailand in 2007. | ||
8 | Deaths in 2017 | 722,052 | The list of those we have lost continues to be in the top 10. | ||
9 | Paradise Papers | 638,995 | Anything popular these days gets a sequel, so here is the follow-up to last year's Panama Papers. A number of big names have been implicated to have kept their money offshore to avoid tax, including Queen Elizabeth II (pictured). Although since she is the Queen of the Bahamas, is it really offshore to have money there? Other big names implicated in the leak include Ellen Johnson Sirleaf, President of Liberia; Yukio Hatoyama, former Prime Minister of Japan; and Paul Hewson, current Bono of U2. | ||
10 | Corey Haim | 623,374 | Deceased actor Corey Haim was allegedly sexually assaulted by co-star Charlie Sheen during the filming of 1986's Lucas. Actor Dominick Brascia made these allegations, which Sheen denies. Haim's mother has claimed that Sheen had not assaulted her son, and rather, Brascia had abused Corey. |
32 featured articles were promoted.
24 featured lists were promoted.
10 featured pictures were promoted.
A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.
Much of the existing Wikipedia research is based on the freely licensed datasets published by the Wikimedia Foundation: Content dumps, pageview numbers, Clickstream samples, etc. But some individual researchers are giving back too. An example for this is the TokTrack dataset, described in an accompanying paper[1] as
Tracking authorship and provenance of Wikipedia article text is by no means a new topic (see e.g. meta:Research:Content persistence). However, the paper's authors assert that their method provides much higher accuracy than earlier efforts such as Wikitrust. One of them, Fabian Flöck, has been studying this problem with other researchers for years (cf. our coverage from 2012 and 2014: "Precise and efficient attribution of authorship of revisioned content", "Better authorship detection, and measuring inequality", "New algorithm provides better revert detection"; the present dataset is generated by their "Wikiwho" algorithm, which also underlies a browser extension called "Whocolor").
What's more, the paper points out that "this data would be exceedingly hard to create by an average potential user" for the entire English Wikipedia due to the computational effort involved ("around 25 days on a dedicated Ubuntu Server [...] with 122 GB RAM and 20 cores"; for comparison, a community-created tool, "WikiBlame", which is linked from every revision history page on English Wikipedia, can take several minutes to find the provenance of an individual token in a single article).
After describing the dataset and the underlying methodology, the paper also briefly presents some insights that can be derived from it about the history of English Wikipedia. First, it looks at the number of added and surviving tokens over time, observing that
It highlights "a surprising spike in Oct. 2002 (also in absolute additions)". Although not mentioned in the paper, this is very likely the effect of bot contributions by User:Ram-Man of US geographical content. Figure 2(b) in the paper also seems to indicate that more than half of these October 2002 additions were still live 14 years later.
Analyzing the "persisting" tokens (that had not been removed within 48 hours) by user group, the authors observe:
The remainder of the paper uses the dataset to study editing controversies. First, the authors define two measures of how controversial an article is, both yielding evolution, Mustafa Kemal Atatürk and Bob Dylan (in that order) as the three most controversial articles as of October 2016 (based on the surviving content at that time only). They also find that "barneys" was the top most conflicted string token.
Lastly, they examine the frequency of edits that undo other edits partially or totally, where the token-based data enables a more sophisticated approach than simpler types of revert analysis. They find that
However, they caution that since "content added by one revision can over (a long) time be corroded by many small changes [...] 'revert' cannot per se be equated with antagonism here, as these numbers include the complete spectrum from minor corrections to full-on opinion clashes and vandal fighting."
See the research events page on Meta-wiki for upcoming conferences and events, including submission deadlines.
Other recent publications that could not be covered in time for this issue include the items listed below. contributions are always welcome for reviewing or summarizing newly published research.
Please assume that all this content was created and edited in good faith. There is no need to disparage the efforts of well-meaning contributors to the encyclopedia. And some editors do not have a full grasp of the English language. You can't read the following content without multiple question marks appearing over your head (assuming that you are a cartoon character) and we often take ourselves too seriously. Some readers may be appalled that such content exists for the whole world to see on their cell phones, but so it goes. Reading Wikipedia for its entertainment value is a hobby of some (author included) and for the rest of you, well, you just don't get it. See if you can guess the article.
Almost all content in this article is taken from the linked articles; see their page histories for attribution.