![]() | This is a draft of a potential Signpost article, and should not be interpreted as a finished piece. Its content is subject to review by the editorial team and ultimately by JPxG, the editor in chief. Please do not link to this draft as it is unfinished and the URL will change upon publication. If you would like to contribute and are familiar with the requirements of a Signpost article, feel free to be bold in making improvements!
|
Optional: write a lede — not necessarily a WP:LEAD. Interesting > encyclopedic.
Gizmodo [1] and The Verge [2] report that Wikimedia Enterprise and Google's Kaggle are supplying a dataset from Wikipedia formatted for AI companies in order to ease the burden from scraping bots on the WMF's IT infrastructure. Both outlets cite an announcement from Wikimedia Enterprise (a paid service operated by Wikimedia LLC, the Wikimedia Foundation's for-profit subsidiary) that in turn links to the download page on Kaggle. As of April 17 – the date of Gizmodo's report – it had recorded 186 downloads. Google's Blog also reports the news on the dataset.
An earlier version of the same dataset had been published on Hugging Face in September 2024. As summarized in the current Enterprise announcement, the dataset consists of structured Wikipedia content in English and French [...d]esigned with machine learning workflows in mind
, and includes high-utility elements such as abstracts, short descriptions, infobox-style key-value data, image links, and clearly segmented article sections
. It does not include the media files from Wikimedia Commons that the Foundation recently described as the primary target of problematic crawler activity (see last Signpost issue: "Op-ed: How crawlers impact the operations of the Wikimedia projects", "Opinion: Crawlers, hogs and gorillas"). According to an FAQ, Enterprise currently support[s] all text-based Wikimedia projects, but do[es] not currently support Wikidata (besides QIDs) or Wikimedia Commons.
– S, H
Sed quia non numquam eius modi tempora incidunt, ut labore et dolore magnam aliquam quaerat voluptatem. Ut enim ad minima veniam, quis nostrum exercitationem ullam corporis suscipit laboriosam, nisi ut aliquid ex ea commodi consequatur? Quis autem vel eum iure reprehenderit, qui in ea voluptate velit esse, quam nihil molestiae consequatur, vel illum, qui dolorem eum fugiat, quo voluptas nulla pariatur?
This page is a draft for the next issue of the Signpost. Below is some helpful code that will help you write and format a Signpost draft. If it's blank, you can fill out a template by copy-pasting this in and pressing 'publish changes': {{subst:Wikipedia:Wikipedia Signpost/Templates/Story-preload}}
Images and Galleries
|
---|
To put an image in your article, use the following template (link): This will create the file on the right. Keep the 300px in most cases. If writing a 'full width' article, change
Placing (link) will instead create an inline image like below The significant thing is feeling, as such, quite apart from the environment in which it is called forth.
To create a gallery, use the following Each line inside the tags should be formatted like
If you want it centered, remove t |
Quotes
| |||
---|---|---|---|
To insert a framed quote like the one on the right, use this template (link): If writing a 'full width' article, change
To insert a pull quote like
use this template (link):
To insert a long inline quote like
use this template (link): |
Side frames
|
---|
Side frames help put content in sidebar vignettes. For instance, this one (link): gives the frame on the right. This is useful when you want to insert non-standard images, quotes, graphs, and the like.
For example, to insert the {{Graph:Chart}} generated by in a frame, simple put the graph code in to get the framed Graph:Chart on the right. If writing a 'full width' article, change |
Two-column vs full width styles
|
---|
If you keep the 'normal' preloaded draft and work from there, you will be using the two-column style. This is perfectly fine in most cases and you don't need to do anything. However, every time you have a However, you can also fine-tune which style is used at which point in an article. To switch from two-column → full width style midway in an article, insert where you want the switch to happen. To switch from full width → two-column style midway in an article, insert where you want the switch to happen. |
Article series
|
---|
To add a series of 'related articles' your article, use the following code or will create the sidebar on the right. If writing a 'full width' article, change Alternatively, you can use at the end of an article to create
If you think a topic would make a good series, but you don't see a tag for it, or that all the articles in a series seem 'old', ask for help at the WT:NEWSROOM. Many more tags exist, but they haven't been documented yet. |
Links and such
|
---|
By the way, the template that you're reading right now is {{Editnotices/Group/Wikipedia:Wikipedia Signpost/Next issue}} (edit). A list of the preload templates for Signpost articles can be found here. |
Discuss this story