Facebook's Galactica demo provides a case study in large language models for text generation at scale: this one was silly, but we cannot ignore them forever.
citation templates now flag open access content, proposed best practices for communication and community involvement, and an improvement to Wikipedia's citation infrastructure
HTTPS-only rollout completed; proposal to enable VisualEditor for new accounts: The rollout of HTTPS only has now been completed across all Wikimedia wikis.
...allegedly. In a post to wikitech-l, Steven Walling pointed out that the TV show CSI: Cyber had used a screenshot of MediaWiki's HTML output and claimed it was responsible for blowing up printers.
Last month, I wrote an open letter to the Wikimedia Foundation, inviting others to join me in a simple but important request: roll back the recent actions—both technical and social—by which the Wikimedia Foundation has overruled legitimate decisions of several Wikimedia projects.
As the start of Wikimania proper on 8 August approaches, the Signpost looks ahead to what its dozens of presentations might offer the technologically-inclined, whether attending in person or taking advantage of what promises to be a strong digital offering.
In the early hours of Tuesday morning, Wikimedia Deutschland's Toolserver project was switched off, marking the end of one of the Wikimedia movement's longest running Chapter-led projects. The Toolserver, which was in fact a collection of servers, first came online in 2005, hosting hundreds of webpages and scripts ("tools") made available for use by Wikimedia readers, editors and administrators.
As you have probably read on this weeks op-ed, or via various other channels of announcement, 3 April will see the introduction of the Typography refresh (or update) for the Vector skin on all Wikipedias. Other projects like Commons will have this update rolled out a few days prior.
This week we're interviewing Brion Vibber about the then-upcoming Architecture Summit. Brion is a long time Wikipedian, the first employee of the Wikimedia Foundation, and currently the lead software architect working with the mobile team.
The proposed schedule for the MediaWiki Archicture Summit has been published. The two main plenary sessions will be about HTML templating, and Service-oriented architecture.
OAuth: future of user designed tools: Last month, the OAuth extension was deployed to all Wikimedia wikis. OAuth is a standard used for allowing users to authenticate third-party applications, also known as consumers, to take actions on their behalf.
This week, the GLAMWikiToolset, or GWToolset, is being deployed to the Wikimedia Commons. It allows for GLAM organizations to batch upload content based on various metadata stored in an XML schema. In the past this has been done by various bots, but now it will be easier for GLAMs to do it directly.
On 6 December, the latest version of the MediaWiki software was released. In development from March 2013 through October 2013, the release featured anti-spam and counter-vandalism improvements.
In this week's "Technology report", we look at how the growth of Wikidata can benefit Wikipedia. Gerard Meijssen is a highly active contributor and frequent blogger about Wikidata. We asked him to share his thoughts on how the new project benefits Wikipedia.
In this week's "Technology report", we explore ways of making Wikipedia more accessible to users of screen readers. Graham87 is a highly active contributor who is also blind and accesses the site through a screen reader.
Wikipedia's traditional image gallery format, produced by the markup, has remained largely unchanged for years. The resulting layout, seen below, does not adapt well to variations in image size, and has been characterized by some critics as aesthetically unappealing.
The VisualEditor extension has gone live by default to registered users on the English Wikipedia, marking a huge milestone in a project that has taken the best part of a decade to reach fruition. The extension was previously described as "the biggest and most important change to our user experience we’ve ever undertaken" by the WMF team behind it.
May engineering report published: The WMF's engineering report for May was published recently on the Wikimedia blog and on the MediaWiki wiki ("friendly" summary version), giving an overview of all Foundation-sponsored technical operations in that month.
Developers accused of making Toolserver fight 'pointless': Last week, the Signpost reported on a feeling at the Amsterdam hackathon that Toolserver developers were coming round to the idea of migrating to Wikimedia Labs.
Second only to the technical track of Wikimania in terms of numbers, the Berlin Hackathon (2009–2012) provided those with an interest in the software that underpins Wikimedia wikis and supports its editors a place to gather, exchange ideas and learn new skills.
The Wikimedia Foundation will be receiving more than $100,000 worth of free developer time courtesy of internet giant Google, it was announced this week. The funds, allocated as part of Google's Summer of Code programme, will support up to 21 student developers through three months of coding time.
On Monday, the English Wikipedia became the 12th wiki to be able to pull data from the central Wikidata.org repository, with other wikis scheduled to receive the update on Wednesday.
Testing week: The deployment of phase 2 of Wikidata to the English Wikipedia, originally scheduled for 8 April but delayed due to technical problems, may be rescheduled again as the result of community resistance.
Since its inception in May 2011, the Foundation's Visual Editor project has grown to become one of its main focuses. As the project nears its two-year birthday, the Signpost caught up with Visual Editor project manager James Forrester to discuss the progress on the project.
Visual Editor "on schedule": The WMF's engineering report for January was published this week, giving an overview of all Foundation-sponsored technical operations in that month.
Article Feedback reversal: The WMF has aborted a plan to deploy version 5 of the Article Feedback tool (AFTv5) rolled out to all English Wikipedia articles.
As of time of writing, twenty wikis (including the English, French and Hungarian Wikipedias) are in the process of getting access to the Lua scripting language, an optional substitute for the clunky template code that exists at present.
Following the deployment of the Wikidata client to the Hungarian Wikipedia last month, the client was also deployed to the Italian and Hebrew Wikipedias on Wednesday. The next target for the client, which automatically provides phase 1 functionality, is the English Wikipedia, with a deployment date of 11 February already set.
As reported in last week's "Technology Report", the WMF's data centre in Ashburn, Virginia took over responsibility for almost all of the remaining functions that had previously been handled by their old facility in Tampa, Florida on 22 January. The Signpost reported then that few problems had arisen since handover. Unfortunately that was not to remain the case, with reports of caching problems (which typically only affect anonymous users) starting to come in.
Data centre switchover a tentative success: On 22 January, WMF staff and contractors switched incoming, non-cached requests (including edits) to the Foundation's newer data centre in Ashburn, Virginia, making it responsible for handling almost all regular traffic. For the first time since 2004, virtually no traffic will be handled by the WMF's other facility in Tampa, Florida.
The Wikidata client extension was successfully deployed to the Hungarian Wikipedia on 14 January, its team reports. The interwiki language links can now come from wikidata.org, though "manual" interwiki links remain functional, overriding those from the central repository.
Following on from last week's reflections on 2012, this week the Technology report looks ahead to 2013, a year that will almost certainly be dominated by the juggernauts of Wikidata, Lua and the Visual Editor.
In the first of two features, the Signpost this week looks back on 2012, a year when developers finally made inroads into three issues that had been put off for far too long (the need for editors to learn wiki-markup, the lack of a proper template language and the centralisation of data) but left all three projects far from finished.
Our tool- and bot-hosting servers are currently operated by Wikimedia Germany, with assistance from the Foundation and volunteers — but they've failed to see eye-to-eye on the trajectory for the Toolserver, scheduled to be replaced by Wikimedia Labs in late 2013.
MediaWiki users (including Wikimedians) can now organise themselves into groups, receiving recognition and support-in-kind from the Wikimedia Foundation. The project, backed by new Wikimedia technical contributor coordinator Quim Gil, has seen five proposals lodged in its first week of operation. The idea of MediaWiki groups mimics that of Wikimedia User Groups.
Deployments of MediaWiki 1.21wmf5 cause widespread problems for users across wikis when HTML and CSS updates came temporarily out of sync. On the first wikis targeted for deployment, this was caused by the different cache invalidation rates for HTML (typically one month) and CSS (typically five minutes). The retrospective on the problem highlighted the fact that that the test wiki – the WMF's answer to a production environment that individual developers can no longer practically emulate themselves – actually demonstrated the exact problem that would later manifest itself on production wikis. It went unnoticed.
Wikidata, the new "Wikimedia Commons for data" and the first new Wikimedia project since 2006, reached 100,000 entries this week. The project aims to be a single, human- and machine-readable database for common data, spanning across all Wikipedia projects, which will "lead to a higher consistency and quality within Wikipedia articles, as well as increased availability of information in the smaller language editions" while lowering the burden on Wikipedia's volunteer editors—whose numbers have stalled overall, and continue to dwindle on the English Wikipedia.
WMF Executive Director Sue Gardner was forced to clarify this week that proposed structural changes to the Foundation's Engineering and Product Development Department were not a "done deal" and that it was "important that you [particularly affected staff] realise that ... your input is wanted". The reorganisation, announced on November 5 and planned for the middle of next year, will see its two components split off into their own departments.
In late September, the Technology report published its findings about (particularly median) code review times. To the 23,900 changesets analysed the first time (the data for which has been updated), the Signpost added data from the 9,000 or so changesets contributed between September 17 and November 9 to a total of 93,000 reviews across 45,000 patchsets. Bots and self-reviews were also discarded, but reviews made by a different user in the form of a superseding patch were retained. Finally, users were categorised by hand according to whether they would be best regarded as staff or volunteers. The new analyses were consistent with the predictions of the previous analysis.
The Wikimedia Foundation's engineering report for October 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month. TimedMediaHandler also went live.
The TimedMediaHandler extension (TMH), which brings dramatic improvements to MediaWiki's video handling capabilities, will go live to the English Wikipedia this week following a long and turbulent development, WMF Director of Platform Engineering Rob Lanphier announced on Monday ... Wikidata.org, a new repository designed to host interwiki links, launched this week and will begin accepting links shortly. The site, which is one half of the forthcoming Wikidata trial (the other half being the Wikidata client, which will be deployed to the Hungarian Wikipedia shortly) will also act as a testing area for phase 2 of Wikidata (centralised data storage). The longer term plan is for Wikidata.org to become a "Wikimedia Commons for data" as phases 2 and 3 (dynamic lists) are developed, project managers say.
Planning for Wikivoyage's migration into the WMF fold built up steam this week following a statement by WMF Deputy Director Erik Möller about what the technical side of the migration will involve. Wikivoyage, which split from sister site Wikitravel in 2006, is hoping to migrate its own not-inconsiderable user base to Wikimedia, as well as much of its content, presenting novel challenges for Wikimedia developers
Wikidata is a go: well, almost: A trial of the first phase of Wikimedia Deutschland's "Wikidata" project–implementing the first ever interwiki repository—may soon get underway following the successful passage of much of its code through MediaWiki's review processes this week.
The Wikimedia Foundation's engineering report for September 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month (as well as brief coverage of progress on Wikimedia Deutschland's Wikidata project, phase 1 of which is edging its way towards its first deployment). Three of the seven headline items in the report have already been covered in the Signpost: problems with the corruption of several Gerrit (code) repositories, the introduction of widespread translation memory across Wikimedia wikis, and the launch of the "Page Curation" tool on the English Wikipedia, with development work on that project now winding down. The report also drew attention to the end of Google Summer of Code 2012, the deployment to the English Wikipedia of a new ePUB (electronic book) export feature, and improvements to the WLM app aimed at more serious photographers.
The Toolserver is an external service hosting the hundreds of webpages and scripts (collectively known as "tools") that assist Wikimedia communities in dozens of mostly menial tasks. Few people think that it has been operating well recently; the problems, which include high database replication lag and periods of total downtime, have caused considerable disruption to the Toolserver's usual functions. Those functions are highly valued by many Wikimedia communities ... In 2011, the Foundation announced the creation of Wikimedia Labs, a much better funded project that among other things aimed to mimic the Toolserver's functionality by mid-2013. At the same time, Erik Möller, the WMF's director of engineering, announced that the Foundation would no longer be supporting the Toolserver financially, but would continue to provide the same in-kind support as it had done previously.
Late last month, the "Technology report" included a story using code review backlog figures – the only code review figures then available – to construct a rough narrative about the average experience of code contributors. This week, we hope to go one better, by looking directly at code review wait times, and, in particular, median code review times
1.20wmf12, the 12th release to Wikimedia wikis from the 1.20 branch, was deployed to its first wikis on September 17; if things go well, it will be deployed to all wikis by September 26. Its 200 or so changes – 111 to WMF-deployed extensions plus 98 to core MediaWiki code – include support for links with mixed-case protocols (e.g. Http://example.com) and the removal of the "No higher resolution available" message on the file description pages of SVG images.
The Wikimedia Foundation's engineering report for August 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month (as well as brief coverage of progress on Wikimedia Deutschland's Wikidata project, phase 1 of which is edging its way towards its first deployment).
Developers are currently discussing the possibility of a MediaWiki Foundation to oversee those aspects of MediaWiki development that relate to non-Wikimedia wikis. The proposal was generated after a discussion on the wikitech-l mailing list about generalising Wikimedia's CentralAuth system.
Developers were left one step closer to an understanding of the code review outlook this week after the creation of a graph plotting "number changesets awaiting review" over time.
New embeddable scripting ("template replacement") language Lua received considerable scrutiny this week when it began its long road to widespread deployment, landing on the test2wiki test site on Wednesday (wikitech-l mailing list). ... the fourth in our series profiling participants in this year's Google Summer of Code (GSoC) programme.
The Wikimedia Foundation sometimes proposes new features that receive substantive criticism from Wikimedians, yet those criticisms may be dismissed on the basis that people are resistant to change—there's an unjustified view that the wikis have been overrun by vested contributors who hate all change. That view misses a lot of key details and insight because there are good reasons that Wikimedians are suspicious of features development, given past and present development of bad software, growing ties with the problematic Wikia, and a growing belief that it is acceptable to experiment on users.
Three weeks into a month-long evaluation of code review tool Gerrit, a serious alternative has finally gained traction in the review process: Facebook-developed but now independently operated Phabricator and its sister command-line tool Arcanist.
Wikidata nears first deployment but wikis go down in fibre cut calamity: The Wikimedia Foundation's engineering report for July 2012 was published this week on the Wikimedia Techblog and on the MediaWiki wiki, giving an overview of all Foundation-sponsored technical operations in that month (as well as brief coverage of progress on Wikimedia Deutschland's Wikidata project). ... At least one fibre-optic cable was damaged at the WMF's Tampa site on August 6, leading to a sharp downwards spike in traffic lasting over an hour and almost three hours of disruption for readers around the globe.
In the light of recent questions over the long-term reliability of Wikimedia wikis, the Signpost caught up with CT Woo, the Wikimedia Foundation's director of technical operations.
As Wikimania, the annual conference targeted at Wikimedians and often well attended by those with a technical slant, draws to a close, comments have already begun to come in from attendees regarding the many tech-related features of the conference.
The results from last month's trial of the LastModified extension were published this week on the Wikimedia blog. The first analyses have indicated a significant positive impact, suggesting that the extension – which makes the time since a page's last edit much more prominent in the interface – could eventually find its way onto Wikimedia wikis.