Research Dataset
Continuously updated from 7 international fact-checking organizations. Structured, normalized, and encrypted at rest — built for researchers studying misinformation, not for moderation pipelines.
Request AccessOverview
This dataset aggregates fact-check records published by leading international fact-checking organizations. Each record captures the original claim, the organization's verdict, a normalized verdict label, publication metadata, and a content hash for deduplication. Ingestion runs daily via automated RSS and Atom feed parsing, with each record archived in its original form alongside normalized fields.
The dataset is designed for researchers studying misinformation patterns, claim lifecycles, cross-source verdict consistency, and the temporal dynamics of false information. It is not intended as a moderation tool or real-time decision system. Records reflect the judgments of the source organizations and are preserved as-is to support comparative and longitudinal analysis.
Coverage spans seven organizations across multiple geographies and languages, including English-language U.S. sources (PolitiFact, FactCheck.org, Snopes), a viral misinformation tracker (LeadStories), UK-based Full Fact, Africa Check covering sub-Saharan Africa, and AFP Fact Check with multilingual international reach.
Coverage
U.S. political fact-checking from the Poynter Institute. Rates claims on a six-point "Truth-O-Meter" scale ranging from True to Pants on Fire.
One of the oldest fact-checking and rumor-debunking publications. Covers viral claims, urban legends, and political misinformation.
Nonpartisan U.S. political fact-checking operated by the Annenberg Public Policy Center at the University of Pennsylvania.
Focuses on viral misinformation trending on social media platforms. Uses a real-time trending story detection methodology.
UK-based independent fact-checking charity. Covers claims from politicians, media outlets, and public discourse in the United Kingdom.
Africa's first fact-checking organization, covering claims across sub-Saharan Africa in English, French, and Portuguese.
Global fact-checking unit of Agence France-Presse. Covers claims in multiple languages across Europe, Asia, Africa, and the Americas.
Structure
Each record in the BigQuery table corresponds to a single fact-check article. Key fields are listed below. All content fields are nullable — individual feeds may not populate every attribute.
| Field | Type | Description |
|---|---|---|
| title | string | Headline or title of the fact-check article |
| claim | string | The specific claim being evaluated, when extractable from the feed |
| link | string | Canonical URL of the original fact-check article |
| verdict_raw | string | Original verdict label as published by the source organization |
| verdict_normalized | enum | Standardized verdict: true | false | misleading | unsupported | exaggerated |
| published_at_normalized | timestamp | Publication datetime normalized to UTC ISO 8601 |
| source.name | string | Name of the fact-checking organization |
| source.url | string | Base URL of the source organization |
| language | string | BCP-47 language code of the article (e.g., 'en', 'fr') |
| content_sha256 | string | SHA-256 hash of canonical content for deduplication across ingestion runs |
Example
A representative record as it appears in the dataset after normalization. Field values are drawn from a real AFP Fact Check article for illustrative purposes.
{
"title": "No, WHO did not declare a 'global health emergency' over a new mpox strain in January 2026",
"claim": "The WHO declared a global health emergency over a new mpox strain in January 2026.",
"link": "https://factcheck.afp.com/doc.afp.com.36UE3JE",
"verdict_raw": "False",
"verdict_normalized": "false",
"published_at_normalized": "2026-01-14T09:22:00Z",
"source": {
"name": "AFP Fact Check",
"url": "https://factcheck.afp.com"
},
"language": "en",
"content_sha256": "a3f9c2d1e4b8765432fedcba9876543210abcdef0123456789abcdef01234567"
}Security & Compliance
SAFE_PUBLIC for fully processed records, RESTRICTED for records pending manual review, and QUARANTINED for records flagged by automated quality checks. Shared dataset views surface only SAFE_PUBLIC records.Access
This dataset is available to academic researchers, journalists, and data scientists working on misinformation research, computational social science, or related fields. Access is granted on a case-by-case basis after a brief review of the intended use.
Please include the following in your request: