The Signpost
Single-page Edition
WP:POST/1
4 September 2024

News and notes
WikiCup enters final round, MCDC wraps up activities, 17-year-old hoax article unmasked
In the media
AI is not playing games anymore. Is Wikipedia ready?
Recent research
Simulated Wikipedia seen as less credible than ChatGPT and Alexa in experiment
News from the WMF
Meet the 12 candidates running in the WMF Board of Trustees election
Wikimania
A month after Wikimania 2024
Serendipity
What it's like to be Wikimedian of the Year
Traffic report
After the gold rush
Humour
Local man halfway through rude reply no longer able to recall why he hates other editor
 

Wikipedia:Wikipedia Signpost/2024-09-04/From the editors


File:Recibimiento Francisca Crovetto y Yasmani Acosta (Gbf0662).jpg
Alex Ibañez
CC
0
0
450
2024-09-04

After the gold rush

Contribute   —  
Share this
By Igordebraga, Vestrian24Bio, ltbdl, Marinette2356, CAWylie, Alexysun. Ollieisanerd
This traffic report is adapted from the Top 25 Report, prepared with commentary by Igordebraga, Vestrian24Bio, ltbdl, Marinette2356, Alexysun. Ollieisanerd, and CAWylie.

We were staying in Paris (July 28 to August 3)

Rank Article Class Views Image Notes/about
1 Imane Khelif 6,746,991 Would be nice if the Olympics (#3) propelled an athlete to the top of this list simply for excelling in sport. Instead, the gender controversies that are all the rage nowadays manifested once Imane Khelif, an Algerian boxer competing in the women's 66 kg division, won her opening bout in less than a minute, with just one punch.
2 Simone Biles 4,580,661 Three years after a much hyped appearance in the Tokyo Olympics that didn't pan out because she felt ill during the initial competitions, the most decorated gymnast in history is dominating the gymnastics competitions in Paris like she did in Rio, having won three golds in team, individual all-around and vault, and has become the most successful U.S. gymnast in the Olympics (and third overall) with ten medals, seven of which are golden. Her closest competitor is another Black gymnast from the Americas, Brazilian Rebeca Andrade, who won a gold in Biles' absence in Tokyo, but so far only managed to gain two silvers and a bronze.
3 2024 Summer Olympics 3,486,142 France receives the biggest multi-event sport in the world, mostly in host city Paris, but with some sports being held in 15 other Metropolitan France cities, and going as far as Tahiti for the surfing competitions. 32 sports are being contested, including the debut of breakdancing, and for the third time a controversy made Russian athletes compete with a different collective name. After the Russian doping scandal led to them being the Olympic Athletes from Russia and the Russian Olympic Committee, this time the Russian invasion of Ukraine propelled a ban of just about every Russian and Belarusian athlete, and the select few who could enter are competing as Individual Neutral Athletes.
4 Deadpool & Wolverine 3,467,395 Again the Marvel Cinematic Universe provides a movie full of nostalgia, fanservice and multiversal shenanigans. Only this time it's far from family-friendly entertainment, as the transition of the X-Men from the Fox film series to the Marvel Studios stable is led by the ultraviolent and potty-mouthed anti-hero Deadpool, who tries to prevent the destruction of his universe by teaming up with the most famous of the Mutants, Wolverine, who in spite of being another Canadian fond of bloodshed, is not as welcoming to the buffoonery of the "Merc with a Mouth". The combined power of Ryan Reynolds and Hugh Jackman in their signature roles, along with the usual action and comedy (only more graphic this time around - there is a man getting his skin ripped off and over 100 F-bombs!) and added tributes to many past Marvel movies, led to Deadpool & Wolverine becoming a smash hit, with positive reviews and massive box office intakes - the $200 million budget alone was covered by the North American opening weekend, and analysts think a billion dollars worldwide is very possible, in spite of high content ratings.
5 Kamala Harris 2,522,226 The American vice-president is now officially the Democrats' candidate for the 2024 United States presidential election, quite progressive to rely on a Black woman (no matter if the competition questions her ethnicity). Expect the next edition to have high views for her and the guy Harris chose as her running mate.
6 Katie Ledecky 1,951,185 Two American women returning to Olympic glory. Ledecky is the most decorated female swimmer ever, and in her fourth Olympic appearance reached 14 medals with the four she got at #3, including gold in both the 800m and 1500m freestyle races. Lee was the gymnastics team standout in Tokyo once #2 bailed out, winning the all-around competition, whereas this time she shared the team gold with Biles and was behind her and Andrade in the all-around podium.
7 Sunisa Lee 1,604,952
8 Michael Phelps 1,590,737 Two athletes not competing at #3 but present in Paris for other reasons. Phelps is the male equivalent of #6, who became the most decorated Olympian ever by dominating the pools in four different games (this after not winning anything in his debut!), with 28 medals and only 5 of them not being gold; this year, though, his appearance rather included a video with Snoop Dogg, who is showing up in a lot of competitions. Owens will never be an Olympian, given American football is far from entering the programme, but was cheering on wife #2, which always leads to amusing pictures, since he's one head taller than her.
9 Jonathan Owens 1,382,247
10 India at the 2024 Summer Olympics 1,295,100 No surprise in seeing this here, or that the country did not perform well in spite of its huge population. Still, the first week of #3 had three bronze medals from shooting, two with air pistols and one with rifles. Near medaling was achieved with fourth places in both shooting and archery. As a sidenote, the Indian flag bearers at the opening ceremony were two people good with rackets: shuttler P. V. Sindhu (who didn't get her third Olympic medal due to falling in the first round of the playoffs) and Sharath Kamal of table tennis.

Let's show them we are better (August 4 to 10)

Rank Article Class Views Image Notes/about
1 Tim Walz 6,659,696 Harris (#8), the vice-president and Democratic candidate for the upcoming United States election, has picked the progressive Governor of Minnesota to be her running mate.
2 2024 Summer Olympics 3,234,409 The Games of the 33rd Olympiad hosted in Paris is reaching its conclusion this Sunday, with only one question remaining, whether US or China will finish atop the medal table. As much as the competitions were entertaining, the Games saw their fair share of problems, like the Olympic Village having no air conditioning and insufficient food, the Seine not being clean enough yet still serving as swimming venue for two sports, and the surfing competition held in Tahiti having an unfortunate lack of waves during its decisive semifinals and finals.
3 Deadpool & Wolverine 2,452,770 X(-Men) gon' give it to ya! The team-up of Ryan Reynolds and Hugh Jackman as two anti-heroic Mutants fond of slashing people in their debut at the Marvel Cinematic Universe is wrecking the box office, and should soon join another comic book-based movie, Joker, in making a billion dollars in spite of an R-rating.
4 Simone Biles 1,740,385 The most decorated gymnast ever had a dominating performance at #2, winning golds with the U.S. team, the individual all-around, and the vault. Only the last day of competition had Biles being surpassed, as she first missed the podium altogether with a fifth place in the balance beam, and then getting the silver at the floor, beaten by her Brazilian friendly rival Rebeca Andrade, who Biles made sure to bow to in the medal presentation (the other woman paying respects, Jordan Chiles, is currently threatened to lose her bronze).
5 Imane Khelif 1,726,957 #2 could simply be the pinnacle of this Algerian boxer's career, having won the gold medal. Yet Khelif earned a lot of attention for less flattering reasons, given that after quickly winning her first fight, she was subject to accusations of the most outrageous sort, leading to her opening a criminal complaint against the transvestigation full of cyberbullying that in her words, "harms human dignity".
6 Armand Duplantis 1,622,327 Still in #2, this Swedish pole vaulter successfully defended the gold medal he had first earned in Tokyo 2020, breaking his own world record in the process. The record-setting jump of 6.25 m earned extra attention for its heartwarming follow-up, as Duplantis rushed to the stands to kiss his girlfriend.
7 Sheikh Hasina 1,483,799

The daughter of the "founder of Bangladesh", Sheikh Mujibur Rahman, Hasina served as the prime minister of Bangladesh from January 2009 to August 2024. Her reign was marked by government corruption, democratic backsliding, enforced disapperances, and extrajudicial killings. Domestically, she was criticised as being too close to India.

Protests and riots broke out in June. Initially, they were meant to reform the quota system, which prescribes quotas to government jobs, but evolved into anti-government protests. At least a thousand protesters died, with many more injured. The movement eventually demanded Hasina's resignation on 4 August. She resigned on 5 August, and has fled to India.

In the meantime, Nobel Peace Prize winner Muhammad Yunus has taken over the seat as a caretaker leader.

8 Kamala Harris 1,470,024 I can't wait for this election to end.
9 Noah Lyles 1,363,713 Last year, this American sprinter earned attention as his response to winning three golds in the 2023 World Athletics Championships was complaining about a habit of the American major leagues: "You know the thing that hurts me the most? I have to watch the NBA Finals and they have 'world champion' on their head. World champion of WHAT? The United States?" #2 made Lyles become both world and Olympic champion by winning by a chin the most prestigious race, the 100 metres. He contracted COVID-19 in the days before the 200m race, but decided to compete in the final regardless: eventually, despite winning the bronze, he was so exhausted that he left the track in a wheelchair. And the basketballers decided to remember Lyles' swipe by celebrating their gold medal by posting on social media "Are we world champs now?" (to which the response was "No, you're Olympic champions, the basketball world champions are those who win the World Cup, and you didn't.")
10 Vinesh Phogat 1,359,234 This Indian freestyle wrestler competed under the 50kg women's category at #2 and had qualified for the Final, even defeating the reigning Olympic and world champion Yui Susaki in the first round. But during the weigh-in on the morning of the finals, she was disqualified for being above the stipulated weight by 100 g (3.5 oz) and was relegated to last place in the classification. Although she had appealed against the decision to the Court of Arbitration for Sport, she was ultimately declared as "Lost by forfeit", breaking the hearts of many Indians in the process.

Just like a prayer, you know I'll take you there (August 11 to 17)

Rank Article Class Views Image Notes/about
1 Deadpool & Wolverine 1,865,839 A few weeks after its release, the sole Marvel Cinematic Universe movie of the year managed to top the Report. It's no surprise, as it was a sure fire way to get a hit by teaming up the two X-Men that earned solo movies, the overtly irreverent Deadpool and the embittered and grumpy Wolverine, and fans also liked to see along with the expected action and comedy the unexpected return of characters from non-MCU Marvel adaptations (including from a movie that never came out). Deadpool & Wolverine made over a billion dollars and surpassed Joker as the highest-grossing R-rated movie, although the clown from the Distinguished Competition will have a chance to earn its belt back in October, when Joker: Folie à Deux will probably make some people go gaga.
2 Alien: Romulus 1,299,377 Like the Predator two years ago, the Alien got another chance at the movies. Set between the first and second installment of the series, Alien: Romulus has a group of scavengers raiding an abandoned space station, only to discover the place was used to study a particularly vicious alien creature who is subsequently out to get them. Reviewers and fans alike were impressed at how director Fede Alvarez made Alien: Romulus both frightening and stylistically faithful to the earlier Alien movies, and already made back its budget in a single weekend with $108 million worldwide.
3 2024 Summer Olympics 1,252,858 The Games of the XXXIII Olympiad hosted by Paris concluded last Sunday, with US finishing atop the table for the fourth consecutive time and overall 19th time - it was a tight affair, though, given the US had the same number of golds as China (not helped by Russia's absence). The event called it a day with the Olympic flag being handed over to Tom Cruise, who then carried it to Los Angeles, the host city of the next Olympics.
4 It Ends with Us 1,032,327 This 2016 romance novel, about dealing with domestic violence and emotional abuse, nearly spawned a coloring book in 2023, until author Colleen Hoover wisely changed her mind. Instead, it was adapted into a film (#8) that released last week.
5 Deaths in 2024 981,509 They say an end can be a start
Feels like I've been buried, yet I'm still alive...
6 Stree 2 893,252 This Bollywood sequel to the 2018 film was released last Friday coinciding with the Indian Independence day and opened to positive reviews from critics. The film has already emerged as the sixth highest-grossing Indian film of 2024 and third highest-grossing Hindi film of 2024.
7 Rachael Gunn 842,250 "Raygun" had a rough week. She entered the Olympics as a breakdancer with her Australian team (albeit not in the proper attire), scored zeroes in the first round against three competitors, and quickly became the target of online bullies, to the point that a petition on Change.org was made regarding her "unethical conduct" and whether or not she should have even been on an Olympic team. AOC executive Matt Carroll saw the veiled bullying of an entry and called for its subsequent removal. Gunn herself has lashed out at the internet trolls.
8 It Ends with Us (film) 824,641 The Justin Baldoni-directed adaptation of #4 opened second at the box office, right behind #1. The competition between husband and wife Ryan Reynolds and #10's latest cinema releases over the top spot certainly does resemble last year's unforgettable battle between a doll and an atomic bomb.
9 Kamala Harris 762,832 Americans don't really know what Harris stands for, apparently, so they go to Wikipedia.
10 Blake Lively 701,559 The wife of #1 star Ryan Reynolds plays the lead character in #8. Though the film did come in second at the box office, right behind Marvel's latest release, Lively's presence on this list is most likely enhanced due to the feud with her co-star and director Justin Baldoni and the unusual press tour of It Ends with Us, which had the two lead stars promoting the film separately (unlike the currently inseparable Reynolds and Hugh Jackman), as well as Lively framing her movie like a celebratory girls' night, despite its heavy subject on domestic violence and physical abuse, and promoting her new haircare line.

I close my eyes, Heaven help me (August 18 to 24)

Rank Article Class Views Image Notes/about
1 Kamala Harris 2,232,813 The 2024 Democratic National Convention was held from August 19 to 22 in Chicago: a loud, boisterous convention with lots and lots of speeches. It's also where delegates selected the presidential nominee. Harris, the current Vice-President who was literally the only candidate standing, was selected.
2 Mike Lynch (businessman) 2,079,337 The British tech tycoon died by drowning on August 19, aged 59, after his superyacht Bayesian sank off the coast of Sicily during a violent storm. He had just been fully acquitted of fraud during an American criminal trial in June, a case where he had just a 0.5% chance of acquittal. Lynch's co-defendant in the trial, Stephen Chamberlain, had just been killed after being hit by a car whilst running on August 17.
3 Stree 2 1,468,938 This Bollywood comedy horror film released last week, has made ₹505 crore at the box office (against a budget of ₹50 crore) and already is the second highest-grossing Indian film of 2024, behind only Kalki 2898 AD.
4 Alien: Romulus 1,294,533 The latest installment in the 45-year old franchise about slimy and particularly invasive extraterrestrials, announced at the 2019 CinemaCon and taking place between the first two films of the franchise, opened last week to positive reviews from critics and has grossed $129 million worldwide so far.
5 Alain Delon 1,269,691 An icon of French cinema, who worked for at least six decades (which included forays into Hollywood like Lost Command and Red Sun, plus playing Julius Caesar in Asterix at the Olympic Games), actor Alain Delon died at the age of 88 of B-cell lymphoma.
6 Tim Walz 1,187,274 #20 on last week’s report. #1 on the week before last week. You probably already know who he is: The Democrats' VP pick. A slight boost in page views this week can be attributed to the DNC occurring this week, where Harris and Walz were official locked in as the Democratic Party's nominees for the presidential election in November.
7 Robert F. Kennedy Jr. 1,175,853 The son of Robert F. Kennedy, Kennedy Jr. is an environmental lawyer turned anti-vaxxer who ran for US president, but gave up on August 23, subsequently endorsing Donald Trump, due to dismal polling and campaign funds running out. He blamed his failed campaign on Democrats, and he could potentially have a position in Trump's administration if he wins.
8 Deadpool & Wolverine 1,111,265 The savior of the MCU, released a month ago has made $1.16 billion worldwide, and became the second-highest-grossing film of 2024 behind another Disney movie. It has now surpassed the Civil War to become the 8th highest grossing film in the franchise and is expected to surpass the original Avengers film soon.
9 Donald J. Harris 1,038,889 Yes, he is #1's father, and his English Wikipedia page most likely rose to #9 because Elon Musk claimed that the Stanford University emeritus professor is a "Marxist economist" (did you mean: Marxian economist) in his live conversation with Donald Trump on X on August 12, telling people to look it up if they didn't believe him.
10 Deaths in 2024 998,439 Limitless undying love which shines around me like a million suns
It calls me on and on across the universe

Exclusions

Most edited articles

For the July 19 – August 19 period, per this database report.

Title Revisions Notes
List of Kamala Harris 2024 presidential campaign endorsements 2678 A laundry list of people supporting the Vice-President in the upcoming election. These include even Republicans and conservatives, showing how controversial her opposition is.
Deaths in 2024 2153 Our version of the obituary, and the period had among its deceased actress Gena Rowlands, executive Susan Wojcicki, voice actress Rachael Lillis and musician Greg Kihn.
2024 Venezuelan presidential election 1974 Hugo Chávez used questionable tactics to remain in power in Venezuela, and his successor Nicolas Maduro is more of the same, as there was strong evidence that opposing candidate Edmundo González Urrutia had more votes in the latest presidential election but the incumbent government insisted they still won through fraudulent claims. The Venezuelans protested, leading to an attempted crackdown by the government, and many countries are questioning the election results.
2024 Wayanad landslides 1842 India is infamous for heavy rain, and a consequence of this was that the Wayanad district of Kerala saw hillsides collapsing in the early hours of July 30, sending torrents of mud, water, and boulders. It is the deadliest tragedy in Kerala history, with reports of over 420 fatalities, 397 injuries, and more than 118 people still missing.
United States at the 2024 Summer Olympics 1785 Most countries treat the United States in the Olympics like the antagonists in sports movies. Even if the hosts have representatives in all sports, the U.S. still are the country with the most athletes (592, as opposed to 573 for France), who seem to win just about every competition – and while Paris was an exception, there are occasions where all three medals go to Americans. And to make matters worse, when Team USA don't have the most gold medals, their media starts counting by total medals so they remain as the top team. With that out of the way, the U.S. was again atop the medal table with 40 gold medals and 126 total. And they are the next hosts, so don't be surprised if the numbers are even bigger in 2028 (even if not as massive as the last time Los Angeles had the Games).
Non-cooperation movement (2024) 1627 As mentioned above, a protest against the government of Bangladesh that eventually led to its Prime Minister, Sheikh Hasina, resigning.
Deadpool & Wolverine 1517 After a bumpy 2023 for Disney, the company is making all the money in 2024 with two billion dollar movies, Inside Out 2 and the sole Marvel Cinematic Universe release of the year, featuring two Mutant heroes and a cluster of cameos and role reprisals. Given how full the Marvel Studios schedule is, no word on when a proper X-Men movie will be made.
2024 Summer Olympics opening ceremony 1465 A rainy affair that included the Parade of Nations being boats sailing down the Seine, things like a masked torchbearer and Gojira playing a heavy metal version of "Ça Ira" in front of decapitated Marie Antoinettes (here's hoping Australia copies that by doing a Mad Max tribute, complete with flaming guitar, in Brisbane 2032!), and a (supposed) recreation of The Last Supper that made conservatives angry.
2024 Bangladesh quota reform movement 1436 Bangladesh has a quota system for government jobs. A movement initially focused on restructuring it eventually expanded against what many perceive as an authoritarian government, and the hundreds of protestors and civilians, most of whom were students, were often met with armed resistance by the police and other government forces, leading to 354 dead and thousands injured, including children. Once the movement refused negotiations with Prime Minister Sheikh Hasina due to the violence, it evolved into the aforementioned non-cooperation movement, who proceeded to basically take over capital Dhaka, leading Hasina to resign and flee to India.
India at the 2024 Summer Olympics 1409 Another instance of India not being a sports potency like its neighbor that also has over a billion people, with six medals (China usually gets that in a single day... or sport), none golden – at most Neeraj Chopra tried to defend his Tokyo title and got the silver, with the remaining five bronzes being three in shooting, and one each in field hockey and wrestling. And that's not counting all the close calls (4th places at archery, badminton, shooting and weightlifting) and Vinesh Phogat being disqualified just when she was guaranteed at least silver. In any case, Los Angeles 2028 has a possible podium, as Twenty20 cricket will be one of the competitions.
Bigg Boss OTT (Hindi Digital series) season 3 1399 One of the Indian versions of Big Brother has a streaming spin-off, with the "OTT" standing for "over-the-top".
Great Britain at the 2024 Summer Olympics 1397 The UK remain reaping the sports investments made for London 2012, as they matched the 65 medals won as hosts, albeit the 14 golds were the lowest amount since the 9 of Athens 2004.
2024 United Kingdom riots 1330 Shortly after the 2024 France railway arson attacks, things got even worse across the English Channel, with looting and hate crimes along with the fires. It started with a mass stabbing in Southport on July 29, and misinformation was spread that the attacker was a Muslim migrant or asylum seeker (the one arrested suspect is a British citizen of Rwandan descent), leading to an attack to a mosque the following day, followed by many oft-violent far-right, anti-immigration protests until August 5, leading to over a thousand arrests.
Chronological summary of the 2024 Summer Olympics 1286 How high can I jump
How high can I throw
How high can I run
How long can I hold my breath and stay underwater and wave my legs around in perfect unison with my partner who really doesn't understand me
Or my Olympic dream...
China at the 2024 Summer Olympics 1229 With Russia banned (aside from a small contingent of athletes) due to that awful thing that doesn't end, it was a tighter race between the U.S. and the last team to beat them at the medal table. China had 40 of their 91 medals be golden (including all in table tennis!), and given the Americans had the same amount, the Asians only got down to second place due to tiebreaker by number of silvers. No word if they repeated the 'laughable sore loser excuse to claim the top spot' – just like the U.S. shifts to total medals, after Tokyo 2020 China tried to say they were #1 by counting the medals of Taiwan and Hong Kong.



File:Shizuki and Jun Kasai on June 3, 2009.jpg
DPS-fan Magyar
CC BY-SA 3.0
70
450
2024-09-04

AI is not playing games anymore. Is Wikipedia ready?

Contribute   —  
Share this
By Bri, Jayen466, Oltrepier, Smallbones and HaeB

Portland pol's publicly-paid profile: Part II

See previous coverage: "Portland politician spends $6,400 in taxpayer dollars to 'spruce up his profile on Wikipedia'" about the article Rene Gonzalez (politician)

The 2020 Oregon Ballot Measure 107 allows campaign finance disclosure regulations in the state of Oregon, which may have been violated by the Gonzalez campaign, in addition to Gonzalez authorizing irregular expenditures of taxpayer funds not allocated to campaigning. Alt-weekly Portland Mercury said "It's unclear which fund the money for the Wikipedia edits came from, and why the money didn't instead come from Gonzalez's mayoral campaign funds."

Two Portland-based television stations had stories on an investigation into the expenditures. KOIN, the CBS affiliate, said that Gonzalez claims "the money went to train staff on how to follow Wikipedia standards", not to conduct impermissible campaigning; KGW, the NBC affiliate, also carried a full story about the case, titled "Commissioner Rene Gonzalez now the subject of Portland campaign finance investigation". – B

Is Wikipedia ready to play the game of Jum-AInji?

A transformer might think this image depicts "The Transformer", but it does not (it is, however, depicting an instance of Japanese hardcore)

In a recent article for The New Yorker, titled Was Linguistic A.I. Created by Accident? (paywalled), Stephen Marche focuses on the role of chance and good luck in the research that led to the landmark 2017 AI paper "Attention Is All You Need", which introduced the transformer architecture. The paper was originally supposed to focus on using the transformer to make English-to-German translations.

Instead, as part of the AI model's training process, the Google team asked the transformer to read Wikipedia entries for two days, covering almost half of the platform's pages. The model was then asked to create five new Wikipedia-style articles from scratch, all about made-up subjects called "The Transformer": a fictitious Japanese hardcore punk band formed in 1968, a fictitious video game, a fictitious 2013 Australian sitcom, a fictitious studio album by an alternative metal group called Acoustic, and even a fictitious science-fiction novel. At first reading, the articles produced by Transformer on the made-up topics all looked like real Wikipedia articles: they were almost too good, "filled with inconsistencies, but [...] also strikingly detailed", suggesting that AI had made a jump of twenty or more years of progress:

Why was a neural network designed for translating text capable of writing imaginative prose from scratch? "I was shocked, blown away," (researcher Aidan) Gomez recalled. "I thought we would get to something like this in twenty years, twenty-five years, and then it just showed up." The entries were a kind of magic, and it was unclear how that magic was performed.
— Was Linguistic A.I. Created by Accident?, Stephen Marche

The historical bond between Wikipedia and machine-learning based natural language processing goes back even further. The first attempts to provide the encyclopedia with text generated using artificial neural networks trace back to at least 2009.

But artificial intelligence and large language models are not just derived from Wikipedia; they are important topics for discussion and policy about the platform's future.

The rapid rise of ChatGPT has raised the most interest and sparked dozens of research efforts towards the implementation of LLMs in the creation and improvement of Wikipedia articles, among other tasks, with the STORM system prototype being the latest example. The Wikimedia Foundation has taken note of AI's progress, for example, by expanding its Machine Learning team and even testing an experimental ChatGPT plugin between July 2023 and February 2024. The Signpost itself has included DALL-E-generated images in various articles. On the other hand, in somewhat Jumanji style, the more we get invested in the AI game, the more traps we discover: without proper checks and balances, machine-generated content can pose a threat to the integrity of Wikipedia, should the number of unsourced and fictitious articles keep increasing and causing more problems with COI-related material and disinformation.

The Spanish newspaper El País recently interviewed Wikimedian and Wikimedia España member Miguel Ángel García, along with the WMF's Director of Machine Learning, Chris Albon (in Spanish, free registration might be required). García, who joined Wikipedia in 2006, noted how many newly-registered users introduce themselves by "[pasting] a giant text, apparently well-structured and well-developed", which turns out to be poorly-written and redundant after a closer look. Luckily, the platform is usually able to handle this material through mechanisms such as speedy or proposed deletion, as well as the continuous efforts of its volunteers, which have also been acknowledged by Albon. (Everyone interested can give a helping hand by joining initiatives such as the WikiProject AI Cleanup.)

However, both expressed concerns over the long-term impact of automatic content on the encyclopedia: while García is mainly worried about the incorporation of "pseudo-media" hosting bot-generated articles as sources on Wikipedia - a phenomenon that could actually be mitigated through reports at the noticeboard - Albon took a brief detour from his usually optimistic view on AI tools, explaining that "if there's a detachment between the places where knowledge is created, like Wikipedia, and the places where it is accessed, like ChatGPT, we're at risk of losing a generation of volunteers". He also said that LLMs providing the platform with poorly-sourced or unreferenced content could "introduce an unprecedented amount of disinformation" on the Internet, since "users will not be able to easily distinguish accurate information from [AI] hallucinations"; quite an ironic situation to find ourselves in, considering that chatbots such as ChatGPT and Google Gemini are being fed with thousands of Wikipedia articles as part of their training schedules.

Titled "ENC-AI-CLOPEDIA. AI is mining the sum of human knowledge from Wikipedia. What does that mean for its future?", a separate interview by Sherwood News (the media arm of trading platform Robinhood Markets) also featured Albon, together with his colleague Lane Becker, Senior Director of Earned Revenue at the Wikimedia Foundation and president of its for-profit subsidiary Wikimedia LLC, which runs Wikimedia Enterprise.

The interviewer first confronted them with "Data from Similarweb [which] shows that traffic to Wikipedia has been in decline" since about 2020. In response, Albon pointed to the Foundation's own (presumably more precise) pageview and unique devices data, with Becker asserting that "We have not seen a significant drop in traffic on Wikimedia websites that can directly be attributed to the current surge in AI tools." (This conclusion is somewhat in contrast with two recent academic papers, see our coverage: "ChatGPT did not kill Wikipedia, but might have reduced its growth", "'Impact of Generative AI': A 'significant decrease in Wikipedia page views' after the release of ChatGPT")

However (similar to Albon in the El País interview), Becker voiced "concern [...] about the potential impact that these AI tools could have on the human motivation to continue creating and sharing knowledge. When people visit Wikipedia directly, they are more likely to become volunteer contributors themselves. If there is a disconnect between where knowledge is generated (e.g. Wikipedia) and where it is consumed (e.g. ChatGPT or Google AI Overview), we run the risk of losing a generation of volunteers." (Not mentioned, but presumably on Becker's mind as well, was the fact that these visitors are also, via Wikipedia's well-known donation banners, the Foundation's most important source of revenue by far.)

Asked "How do you feel about practically every LLM being trained on Wikipedia content?", Becker stressed that "we welcome people and organizations to extend the reach of Wikipedia's knowledge. Wikipedia is freely licensed and its APIs are available for free to everyone, so that people all over the world can use, share, add to, and remix Wikipedia content." However, "We urge AI companies to use Wikimedia's free APIs responsibly and include recognition and reciprocity for the human contributions that they are built on, through clear and consistent attribution. They should also provide pathways for continued growth and maintenance of the human-created knowledge that is used to train them" - such as "Clearly attributing knowledge back to Wikipedia", but also, for "high-volume commercial reusers of Wikipedia content to use our opt-in paid for product, Wikimedia Enterprise." Becker shared that its total revenue (i.e. not accounting for the staffing and other costs of Wikimedia Enterprise itself) "for FY 2022-23 was $3.2 million - representing 1.8% of the Wikimedia Foundation's total revenue for the period." However, he declined to disclose how much of that came from Google (one of the few publicly known customers, another one being yep.com).

S, O, H

See also in this issue's News and notes: "AI policy positions of the Wikimedia Foundation"

In brief

Red clover for Clovermoss
See previous Signpost coverage about the controversy surrounding this article, as well as the discussion about the reliability of the Anti-Defamation League on the Israeli-Palestinian conflict, here and here.



Do you want to contribute to "In the media" by writing a story or even just an "in brief" item? Edit our next edition in the Newsroom or leave a tip on the suggestions page.


Wikipedia:Wikipedia Signpost/2024-09-04/Technology report Wikipedia:Wikipedia Signpost/2024-09-04/Essay Wikipedia:Wikipedia Signpost/2024-09-04/Opinion


File:Giza Pyramids during "Forever is Now" exhibition.jpg
Mona Hassan Abo-Abda
CC BY-SA 4.0
75
0
450
2024-09-04

WikiCup enters final round, MCDC wraps up activities, 17-year-old hoax article unmasked

Contribute   —  
Share this
By Bri, Ciell, Headbomb, Andreas Kolbe, Oltrepier, and HaeB

The WikiCup gears up for its final round

TKTK

The 2024 WikiCup, hosted by users Cwmhiraeth, Epicgenius and Frostly, is entering its final phase, after Round 4 ended on 29 August. A total number of 135 users, including the late Vami IV, joined the contest at the start of this year; however, just eight of them have made it to the ultimate showdown. Here are the finalists, ranked from first to last as per their scores in the latest round:

Since its creation back in 2007, the WikiCup has strived to "encourage content creation and improvement and make editing on Wikipedia more fun", and this year's edition is no exception: according to the official data, competitors have so far contributed to 44 featured articles, 72 featured lists, 385 good articles, 94 In the News credits, and over 300 Did You Know credits; thanks to their efforts, 38 articles were also added to featured topics and good topics.

On behalf of The Signpost, we would like to thank the judges and every participant in the 2024 WikiCup, and wish good luck to the eight finalists.

O

Journals cited by Wikipedia compilation now tracks free DOIs

TKTK
Tired of running into paywalls as you try to find new information? Look for the green free-access lock () next to DOIs and other identifiers in citations!
Related articles
Open Access

Tens of thousands of freely available sources flagged
4 December 2023

Top scholarly citers, lack of open access references, predicting editor departures
27 March 2022

The Wikipedia SourceWatch
31 March 2019

New guideline for technical collaboration
4 November 2016

Wikimedia Foundation adopts open-access research policy
25 March 2015


More articles

As of 18 August, the Journals cited by Wikipedia (JCW) compilation (see previous Signpost coverage) now tracks the number of distinct DOIs present on Wikipedia, and how many are flagged with |doi-access=free. Several of these are automatically tracked and tagged as free to read by templates and bots (see previous Signpost coverage). As of the 1 August dump, the compilation kept track of 3.70M citations, of which 2.41M had DOIs. Of the citations that had DOIs, 661,103 were identified as free to read, or about 27.44%.

The 17–18 August 2024 update of the CS1/CS2 modules further identified the Leibniz International Proceedings in Informatics (doi prefix 10.4230) and the Living Reviews journal series (doi prefix 10.12942) as free-to-read registrants, as well as 11 individual journals that can be identified by the starting pattern of DOIs (like 10.1046/j.1365-8711..., 10.1093/mnras.., and 10.1111/j.1365-2966... for the Monthly Notices of the Royal Astronomical Society). Citation bot will automatically flag those with |doi-access=free when it runs on the article (see our guide on how to use Citation bot yourself).

If you notice a DOI link that takes you to a free-to-read article that wasn't flagged by the bot, you can flag the citation manually with |doi-access=free. You can also try to use WP:OABOT (see our guide on how to use OAbot yourself). If you are aware of fully free-to-read journals/publishers that aren't already kept track of by the CS1/CS2 templates (see CS1/2 FAQ), leave a note at Help talk:CS1 and User talk:Citation bot.

Following the 20 August dump, the compilation kept track of 3.72M citations, of which 2.42M had DOIs. Of the citations that had DOIs, 663,976 were identified as free to read, or about 27.46% (up from 27.44%). It took a few days for the server cache to clear and tracking categories to be populated. I estimate that the 'true' count should have been about 666K, mostly due to MNRAS and MNRAS Letters being identified as free to read.[a]

Related to the JCW update, all CS1/2 templates (like {{cite journal}} and {{citation}}), and the standalone templates {{doi}} and {{doi-inline}}, now support the flagging of free-to-read DOIs with |doi-access=free. The standalone versions, however, are not currently supported by any bot, nor do they have tracking categories.

Thanks to Trappist the monk for their efforts on templates and the identification of free-to-read publishers/journals (I was also involved), as well as the maintainers of Citation bot, JL-Bot, and OAbot (particularly AManWithNoPlan, JLaTondre and Nemo bis) for facilitating the mass-tagging of free-to-read articles.

  1. ^ Update: Following the 1 September dump, most of the caching issues were resolved, and we have a count of 3.73M citations, of which 2.42M had DOIs (an increase of 15,261 since 1 August). Of the citations that had DOIs, 668,036 were identified as free to read, or about 27.56%. An increase of 6,933 free DOIs (both new and newly-identified), representing 0.11% of all DOI citations, since 1 August.

H

AI policy positions of the Wikimedia Foundation

In a blog post, the Wikimedia Foundation provides an overview of several statements it has submitted since last year in response to

[...] governments and international organizations [...] seeking stakeholder feedback about how [AI] policies should be formulated in order to best serve the public interest. [...] The Foundation’s comments have fallen into two categories. Some are directly relevant to the work being done by volunteer Wikipedia editors around the world, such as on copyright and openness of foundational AI models. Others applied our values and the valuable lessons we have learned from our AI/ML work to benefit public interest projects focused on free knowledge and the online information ecosystem—i.e., decentralized community-led decision-making, privacy, stakeholder inclusion, and internet commons.
— "AI for the people: How machines can help humans improve Wikipedia" (Wikimedia Foundation)

For example, in a response to the US Copyright Office's Request for Comments on AI and Copyright, the Foundation states that it "generally supports uses of Wikipedia content for purposes including AI model development", but (as summarized in the blog post) argues that

At a minimum, AI developers who include Wikipedia in the training data used to create large language models (LLMs) should publicly acknowledge that use and give credit to Wikipedia and the volunteer editors who made this rich source of raw materials for LLMs.

At the same time, the Foundation's statement indicates that this attribution might not always be legally required, depending on whether courts decide that the unauthorized use of copyrighted content in training of such AI models is covered by fair use (in which case the attribution requirements of Wikipedia's CC BY-SA 4.0 license would be moot). The Foundation refrains from taking a categorical position on this legal question: "Based on our analysis, we do not believe that training AI models should either be categorically fair use or categorically not fair use. Rather, the particulars of the training process and the way courts view the purposes of a use should inform whether a particular training process is fair or not." The analysis does however offer some detailed if speculative observations on how courts might evaluate the four fair use factors in this context. For example, it is argued that because "the vastness of the datasets used in training mean that any single copy [of a copyrighted work] is barely a drop in the ocean of the whole", judges may want to focus on "the extent to which a work is weighted in the development of a model": "Hypothetically, if a copyright protected work was manually weighted to have an outsized impact in model development, then one could argue that although the uses of other full works may be fair, the amplification of one particular work in the training set is not." (Various LLMs are known to have weighted Wikipedia more highly than other parts of their training dataset, for example GPT-3.)

On the other hand, the Wikimedia Foundation's statement also urged the Copyright Office to take not only the perspective of copyright owners into account, but also that of the users of copyrighted works and of AI-based tools – noting that "The Foundation is somewhat uniquely positioned as both the host of a primary source of training material for generative AI and a user of many AI and ML tools that aid human editors with the creation of free knowledge." In particular, it cautions to keep public interest in mind in possible future changes to copyright laws and AI regulations, e.g.

On the use of data, specifically, we encourage regulators and legislators to align their approaches with existing models, such as the European Union’s inclusion of an exemption for text and data mining in the Directive on Copyright in the Digital Single Market, that enable public interest research and other beneficial uses of protected works.

[...] we encourage the Office to consider the potential impacts that changes to copyright law could have on competition among AI developers. If copyright law changes are enacted such that the acquisition and use of training materials becomes more expensive or difficult, there is a risk that dominant firms with greater resources will become further entrenched while smaller companies, including nonprofit organizations, struggle to keep up with mounting development costs.

H

The Farewell of the MCDC

MCDC group photo 2024

Chosen by communities, selected by affiliates, and appointed by the WMF, the Movement Charter Drafting Committee (MCDC), a committee of 15 Wikimedians, first took on the job of drafting a Charter for the Wikimedia movement in November 2021.

There were multiple feedback rounds, a lot of conversations, more discussions and a final ratification vote where the community and affiliate support was overwhelming (albeit with a low turnout in both cases due to the voter eligibility criteria), but the WMF's Board of Trustees decided the draft was not good enough (not safe to try). As reported in the previous issue of The Signpost, the Foundation published three pilot projects to take the work forward.

In August 2024, the committee (which still included 11 people), shared their process and ratification reflections pre-Wikimania. Before dissolving on 30 August, they also published their recommendations for next steps, including a response to the three pilots proposed by the WMF.

Ciell, former MCDC member

Brief notes

WLM 2023 winner from Egypt, Giza Pyramids during "Forever is Now" exhibition by Mona Hassan Abo-Abda.



File:Hannah Clover at Wikimania 2024 (cropped).jpg
Ahmad Ali Karim
CC0 1.0
300
2024-09-04

What it's like to be Wikimedian of the Year

Contribute   —  
Share this
By Clovermoss
This is a photograph of me at the opening ceremony.
This is a photograph of me at the opening ceremony.

I attended my first Wikimania this year in Katowice, Poland. I thought about applying for a scholarship when the process was open, but ultimately decided against it. I figured that attending WikiConference North America was enough for one year; obviously, I changed my mind once I was chosen as the Wikimedian of the Year. I had never been outside of North America before this event, so this experience was a lot of firsts for me. If I had told younger me that my first trip to Europe would be in Poland, she would have been very confused.

In late May, I received an email telling me that I was one of the five people shortlisted for the award. I tried not to think about it too much: I didn't think I'd actually be the winner and that one of the other four editors would be chosen. I didn't consider my accomplishments to be even remotely comparable to those of Rosie Stephenson-Goodknight or Emily Temple-Wood, so why would it be me? I was told to expect a response within three weeks, but it ended up taking longer than that (apparently, there were unexpected challenges internally, and I was told it wasn't my fault). I found out that it was actually me on July 4, which gave me about a month to come to terms with my upcoming fame. I was excited for the most part, but I was also terrified; sometimes it felt like a countdown of doom, where my life would never be normal again.

August 6 – Tuesday

This was a pre-conference culture crawl day, so there were no sessions to attend. Katowice is six hours ahead of Niagara Falls, where I live, so I was also trying to recover from jet lag. I didn't really see much of the city other than getting a super secret tour of the venue and hanging out with some staff members in the attached café. We had some interesting conversations, though: I found out that the Wikimedia Foundation owns their data centres for privacy reasons, that this practice is incredibly expensive, and that it's unusual for tech companies to do this. A new data centre was recently built in Brazil, and this took a lot of work: you can read about it here. I was also told that the codebase for MediaWiki is incredibly old: as a result, this presents unique challenges and a lot of things are "hacks on top of hacks". I was encouraged to attend a session where this topic was featured, which can be watched here. Unfortunately, I did not manage to do so.

August 7 – Wednesday

I had breakfast in the hotel lobby and talked to New Zealand user Giantflightlessbirds, who told me about some interesting work he does as a Wikipedian at Large (an alternative name for a Wikipedian in Residence) in his home-country. I also talked to a few other Wikimedians... but did not get their usernames. Finally, I showed one young woman my knitting and we took a selfie together.

Sessions

Opening ceremony

Preparation for the opening ceremony started at 1 pm. I was one of two recipients who misunderstood that I was supposed to have lunch before meeting Jimmy Wales; luckily, Vermont saved the day by finding us meals and beverages. Apart from that, my introduction to Wales and the rest of the recipients went smoothly. We sat next to each other in one big circle and shared who we are and which category we were chosen for. Then, we rehearsed the ceremony itself.

After the rehearsal finished, I spent time with a bunch of friends behind a staircase (we had a table and it's way less gloomy than it sounds). Some plans were made for after the opening ceremony, because "it's not like any of us will have anything to do". It was incredibly difficult to keep a straight face and not give the secret away at that point. When we all sat down at a table in the room for the opening ceremony, at 5 pm, my heart was pounding, but I tried my best to remain calm and just act like everything was normal, and I think I did a good job acting the part. On the inside, I felt like I was experiencing something akin to an adrenaline rush: it's difficult to explain precisely what I was feeling, but it was incredibly intense. I was sitting next to Seddon, and he was determined to update all the award recipients as they were announced. However, he had no idea that I was going to be one of them, and his laptop died, so he switched to his tablet to edit through the app when my time came. It was oddly fitting, given that I'm known for mobile editing... The secret was out once Natalia started describing me; Seddon suddenly looked up from his tablet and literally blurted out, "It's you!" We shared a knowing look: sure enough, it was me. My name was announced, the lights that gave everyone a headache went crazy, and I forced myself to walk onto the stage.

I admit I have very limited experience with public speaking: I had never been on a stage before, and I had a thousand people watching me for the first time in my life. I could literally feel my legs shake, and I spent a lot of my mental effort just trying to stay still and not fall. I was told by a few people afterwards that I did look a little nervous, but the situation didn't look as dire as it felt. If you wish to watch it, you can do so here. In retrospect, I'd empty my pockets beforehand (my wallet and passport are bulky)... I would also have spoken more slowly, deliberately, and with less filler words. After the ceremony ended, I mingled with the other conference participants, because I'm a social butterfly. A bunch of people congratulated me and asked for a selfie, and one person even asked me to sign their copy of All the Knowledge in the World.

August 8 – Thursday

Sessions

"You mentioned you were very pro-student editing and how you think everyone should do it, right? Obviously, I'm cool with young people editing, because I'm 21 and if I was against that, I wouldn't be editing at all. But I think maybe there are more factors to consider than just seeing if some articles stay. From the newcomer's perspective, you don't want to be setting people up to fail. Then, from the community outreach perspective, [...] yes, people will clean up after the people who are doing things that they aren't supposed to be, but it kind of diminishes the volunteer morale a bit? [If] they're constantly flooded with content that they need to clean up, then it can be a bit of a vicious cycle where they're less welcoming to student editors. So, I was just wondering if you've ever considered that, and if you had any thoughts on how you might want to mitigate factors like that?"

"I think it's a very good argument that you're making, but there's two things that I wanted to add to that. First of all, editors are already flooded with bad quality edits. [I interrupted them to clarify that my concerns were related to the scale in which these issues can arise. Then they said:] I would still argue that the average quality of professor-supervised class editing will be higher than the average quality of a newcomer edit. Mainly because students have access to all those journal libraries and are, by design, probably the top 1% of knowledge-privileged people. By design, their edits will most likely not be horrible, although probably not great, either. Second, I think the problem you're raising is super important, that we do not discourage people by hanging them out to dry, go out and edit Wikipedia and of course, prepare them. I think you're very right that, first of all, we need to let people know what the rules are, maybe get them familiar with the format, but isn't that true of academic writing in general? You do not ask people to start writing journals."

"[How did] you [come] to the conclusion that there are less younger editors that are interested in contributing? I think I actually had a conversation with Selena about this briefly at WikiConference North America, [where] I talked a bit about how I know lots of people my age that edit. [Obviously,] anecdotal experience isn't everything, but I assume you have pretty good reasons for coming to that conclusion?"

"I think there's editing and there's readers, so I can talk about the editing piece of it. With editors, it's complex. There are things that have shifted over time, and I actually have this really promising report the Community Metrics team put together, that says we're starting to see a rise in younger editors overall. That doesn't [necessarily] translate to functionaries, but I don't have as good data on [them] overall. They're a crucial population of people that make the whole system work, so for me there's data that shows that younger editors are kind of turning in a different direction, and if you dig in and look at each region, you start to see different stories. So it's quite a complex picture. Overall, I would say I get a lot of feedback from the administrators, in particular, that they're just seeing their numbers drop, that we're not getting enough new people into that system, so those are the factors and data that I have about admins and I'm really interested in more."

Conversations

August 9 – Friday

Sessions

Conversations

August 10 – Saturday

Sessions

Conversations

August 11 – Sunday

I woke up early to check out of the hotel, because my shuttle back to the Katowice Airport would leave at 9 am. It was about a half-hour drive, and I had a fun time talking with several other editors on the bus.

When we arrived at the airport, I said an official goodbye to some editors, and we arranged a group photo where we all showed our passports. However, plenty of us didn't have flights for hours, so we organized an impromptu edit-a-thon in the airport café. I unpacked my backpack to show Kingoflettuce the books I had brought to the conference, and he did start reading one of these books: Jehovah's Witnesses: A New Introduction by George Chryssides. He got about halfway through it, and then we talked a bit about the lack of active editors in the topic area, and how I've been trying to reduce the reliance on primary sources; he told me what he knew about the group's history in Singapore. On a side note, Chlod said that he was going to try to nominate an article for good article status for the first time, so we all encouraged him to go for it!

Finally, I learned a little bit about how Malaysian names worked from Taufik Rosman, and he also told me about the work he does across projects. It was really cool to have an extended conversation with the previous Wikimedian of the Year!

Wikipedia:Wikipedia Signpost/2024-09-04/Op-ed Wikipedia:Wikipedia Signpost/2024-09-04/In focus Wikipedia:Wikipedia Signpost/2024-09-04/Arbitration report


File:Signpost column image for dumb reply.png
300
2024-09-04

Local man halfway through rude reply no longer able to recall why he hates other editor

Contribute   —  
Share this
By JPxG
Screenshot of an open reply-tool posting box.
Well, whatever he did, screw this guy.

VILLAGE PUMP — Lamenting his lack of diligence, longtime Wikipedia editor Hubert Glockenspiel, 42, told reporters that halfway through writing a response to a comment, he has completely forgotten why he hated the guy whose signature he recognized.

"Originally I had been planning to oppose whatever stupid proposal he was making, or support his siteban, or whatever," said Glockenspiel. "You know, on account of the fact that he's repeatedly demonstrated himself to be an arrogant incompetent moron, or an incorrigible POV warrior, or a disingenuous cheat who routinely misrepresents both sources and policy. But then I couldn't remember which of these things he was, or what he had done, or why. You know, now that I think of it, maybe he was one of those damn deletionists. Or worse, one of those damn anti-deletionists."

Glockenspiel's attempts to jog his memory proved fruitless, as neither the guy's userpage nor the guy's top hundred or so contributions turned up anything obvious. Even external tools were no help; an Xtools list of his most-edited pages, a Startist list of all the discussion threads he had opened, and an afdstats analysis of his deletion votes all depicted a completely normal editor with no visible agenda or obsession.

"I cannot for the life of me remember why I hate this guy," Glockenspiel said. "I can't open a proposal for a siteban, because someone might ask me to give actual evidence, but I'm sure as heck going to support it if someone else does." He added that he had consulted WP:CONFUSED to make sure he hadn't mixed him up with anyone else having a similar name.

Despite a failure to recall anything about the circumstances that gave rise to his seething disdain, Glockenspiel reiterated a commitment to keep hating.

"Well, I don't decide to hate somebody's guts for no reason. It had to have been something."

At press time, Glockenspiel was trying to find the hard drive with his old IRC logs from during the Esperanza MfD, in the hopes that he might discover a long-forgotten flamewar.


If articles have been updated, you may need to refresh the single-page edition.



       

The Signpost · written by many · served by Sinepost V0.9 · 🄯 CC-BY-SA 4.0