data:image/s3,"s3://crabby-images/b62c9/b62c9351ee459a36b6312002e43a5f2eaa5e9cc1" alt=""
Our APIs include 100M+ pages across 850+ Wikimedia projects and languages
Wikimedia Enterprise APIs
Snapshot API
Retrieve entire Wikimedia projects as a database dump file.
- Download a compressed file containing everything in any project, in any language
- Article body in HTML as well as Wikitext
- Free: New snapshots every two weeks
- Upgrade: Daily snapshots cadence
- Upgrade: Beta Structured Contents Snapshot with parsed infoboxes and article sections
On-demand API
Retrieve single articles from any Wikimedia project at anytime.
- Make standard HTTP requests to retrieve documents by ID or name from all projects and languages, or use filters to limit response
- Request the latest page data anytime to augment your Realtime or Snapshot API data
- Free: 5,000 requests monthly
- Free: Structured Contents beta endpoint with parsed infobox and full article sections
Realtime API
Stream updates in real-time from any Wikimedia project.
- Streaming: Receive streaming updates (firehose) of every change as they occur in real-time
- Batch: Download compressed snapshot files of incremental updates every hour
- Instant updates for new content, any edits, deletions, and breaking news events including community-curated visibility changes
- Contact sales for pricing
Payload Examples: Production response. Beta Structured Contents reponse.
Gain instant access to Snapshot API and On-demand API with a free account.
Wikimedia Enterprise API Payload
All of our APIs return the same structured JSON (or ND-JSON) response format making it easy to augment one API with another.
Three APIs; same data, different retrieval methods.
_ Retrieve bulk data with the Snapshot API
_ Receive changes instantly with Realtime API streaming
_ Retrieve single articles with the On-demand API
API responses include article data such as summary, image, Wikidata QID, license, and more. Also included is data specific to the last revision, such as editor, size of change, and credibility score with revert probability.
Example: Run this cURL command with your access token (see auth docs) to grab the live english squirrel wikipedia article seen here →
curl --location 'https://api.enterprise.wikimedia.com/v2/articles/Squirrel' --header 'Content-Type: application/json' --header 'Authorization: Bearer ACCESS_TOKEN' --data '{"filters":[{"field":"is_part_of.identifier","value":"enwiki"}]}'
For a full breakdown and explanation of the data schema in responses see our Data Dictionary.
More questions? – We’re here to help.
{
"name": "Squirrel",
"identifier": 28492,
"abstract": "...summary of article...",
"date_created": "2001-10-09T17:55:09Z",
"date_modified": "2025-02-05T03:04:17Z",
"date_previously_modified": "2024-12-21T06:38:32Z",
"version": {
"identifier": 1274023344,
"comment": "...version edit comment...",
"tags": [...],
"is_minor_edit": true,
"has_tag_needs_citation": true,
"scores": {
"revertrisk": {
"probability": {...}
}
},
"editor": {...},
"number_of_characters": 33076,
"size": {
"value": 33188,
"unit_text": "B"
},
"event": {...}
},
"previous_version": {...},
"url": "https://en.wikipedia.org/wiki/Squirrel",
"watchers_count": 363,
"image": {
"content_url": "https://upload.wikimedia.org/wikipedia/commons/6/68/Sciuridae.jpg",
"width": 600,
"height": 600
},
"article_body": {
"html": "...html of entire page...",
"wikitext": "...wikitext of entire page..."
},
"namespace": {
"identifier": 0
},
"in_language": {
"identifier": "en"
},
"main_entity": {
"identifier": "Q9482",
"url": "https://www.wikidata.org/entity/Q9482"
},
"protection": [...],
"additional_entities": [...],
"categories": [...],
"templates": [...],
"redirects": [...],
"is_part_of": {...},
"license": [
{
"name": "Creative Commons Attribution-ShareAlike License 4.0",
"identifier": "CC-BY-SA-4.0",
"url": "https://creativecommons.org/licenses/by-sa/4.0/"
}
],
}
Structured Contents BETA Endpoints Payload
Our Structured Contents endpoints have the same familiar structure as our production responses but include beta fields and objects we’re working to parse out of articles. Parsed objects that are unique to Structured Contents are: infobox, sections, description (with more coming). See the beta section of our data dictionary for details.
On-demand API structured contents endpoint is freely available whereas the bulk Snapshot API Structured Contents dumps is available on request.
Example: Run this cURL command with your access token (see auth docs) to get structured contents response from the live english squirrel wikipedia article seen here →
curl --location 'https://api.enterprise.wikimedia.com/v2/structured-contents/Squirrel' --header 'Content-Type: application/json' --header 'Authorization: Bearer ACCESS_TOKEN' --data '{"filters":[{"field":"is_part_of.identifier","value":"enwiki"}]}'
For a full breakdown and explanation of the structured contents data schema in responses see our Data Dictionary: Beta section.
More questions? – We’re here to help.
{
"name": "Squirrel",
"identifier": 28492,
"abstract": "...summary of article...",
"version": {
"identifier": 1274023344,
"comment": "...version edit comment...",
"scores": {
"revertrisk": {
"probability": {...}
}
},
"editor": {...},
"number_of_characters": 33076,
"size": {...},
"maintenance_tags": {}
},
"url": "https://en.wikipedia.org/wiki/Squirrel",
"date_created": "2001-10-09T17:55:09Z",
"date_modified": "2025-02-05T03:04:17Z",
"main_entity": {...},
"is_part_of": {...},
"additional_entities": [...],
"in_language": {...},
"image": {...},
"license": [...],
"description": "...short description...",
"infobox": [
{
"name": "Pp-semi-indef",
"type": "infobox",
"has_parts": [
{
"type": "section",
"has_parts": [
{
"type": "field",
"value": "Squirrels Temporal range",
"values": [...]
},
{
"type": "image",
"values": [...],
"images": [
{
"content_url": "...image",
"height": 250,
"width": 250
}
]
},
{
"type": "field",
"value": "Various members of the family Sciuridae",
"values": [...],
"links": [...]
},
{
"type": "field",
"value": "Scientific classification",
"values": [...],
"images": [...],
"links": [...]
}
]
}
]
}
],
"sections": [
{
"name": "Abstract",
"type": "section",
"has_parts": [
{
"type": "paragraph",
"value": "...section paragraph...",
"links": [...]
}
]
},
{
"name": "Etymology",
"type": "section",
"has_parts": [
{
"type": "paragraph",
"value": "...section paragraph...",
"links": [...]
},
{
"type": "paragraph",
"value": "...section paragraph...",
"links": [...]
}
]
}
]
}
SLA and Support
Contract accounts receive 99% SLA, support response-time guarantees, and feedback collaboration.
All accounts have access to our help center FAQs.
Open Licensing
Benefit from open-licensed, royalty-free data. Every response includes license information in the payload.