Claims Validation

Scope: This page defines the ideal-state specification for a CSA fact-checking module to be developed. It establishes the required behavior of the module, the editorial workflow for acting on its output, and the infrastructure needed to support it. Sections marked Pending identify capabilities not yet confirmed as implemented. Everything else represents the target standard the module should meet and that editors should apply once it is live.

⬇ Download this section


Content Pipeline Tiers

Not all content in the CSA pipeline requires the same level of fact-checking oversight. The tier a piece belongs to determines whether and how the Claims Validation module applies.

Tier names are provisional—naming conventions are not yet finalized.

Tier Description Raw Module Output in CSA UI Claims Validation
High Touch (HITL) Human-in-the-loop—CSA-generated drafts requiring full human editorial review Yes Applies in full. All verdicts require editor action per this page.
Semi-Automated Everything that is not High Touch or Fully Automated Yes Applies selectively—editor judgment determines which verdicts require action based on content type and risk.
Fully Automated Structured data output with no significant editorial judgment involved—e.g., game scores, weather reports, United Robots automated inbound content No No Claims Validation pass required.

The rules documented on the rest of this page apply in full to High Touch (HITL) content. Semi-Automated content—everything that is not HITL or Fully Automated—applies the same rules with editor judgment on verdict prioritization. Fully Automated content does not surface module output and does not require a Claims Validation pass.

Role-level access: Which specific roles can view raw module output in the CSA UI is pending confirmation with engineering leadership and product leads.


What the Module Produces

The CSA fact-checking module runs automatically as part of the content validation step. After a draft is generated, the module evaluates factual claims against its knowledge base and returns a structured validation output for each flagged claim. The following defines the required output format.

Each flag carries one of five verdicts:

Verdict Meaning
TRUE Claim is accurate and verifiable
FALSE Claim is factually incorrect
MISLEADING Claim is technically true but framed in a way that distorts meaning
INSUFFICIENT_EVIDENCE Claim cannot be verified from available sources
OVERGENERALIZED Claim may be true in some contexts but is stated too broadly

Confidence Level

Pending module capability. The module should return a confidence level alongside each verdict. This field is not yet confirmed as available. The escalation logic below includes confidence-level handling and will apply once the field is live.

When present, the confidence level indicates how certain the module is in its verdict. Treat it as an escalation signal:

Confidence Meaning Editorial Action
High Module is confident in the verdict Apply the verdict’s standard action
Medium Module has moderate certainty Apply standard action; note in draft
Low Module has limited basis for the verdict Treat as INSUFFICIENT_EVIDENCE regardless of the stated verdict; verify independently

Editorial Action Taxonomy

I. Needs Correction

Factually wrong, demonstrably misleading, or unsupported by a verifiable source. Apply when the CSA validation output returns:

II. Needs Clarification

Nuanced, partially true, or contextually dependent; requires rewording rather than removal. Apply when the CSA validation output returns:

TRUE flags require no action. They are informational and confirm the claim passed validation.


Source Authority Tiers

The module weights source authority when evaluating claims. Editors should apply the same weighting when reviewing flagged output. The specific approved outlets within each tier are documented in Acceptable Sources §5.

Tier 1—Authoritative (accept)

Tier 2—Acceptable with verification

Tier 3—Flag for review

If the module returns a MISLEADING or FALSE verdict and the original source is Tier 3, treat it as Needs Correction regardless of the specific verdict.

Source Count Requirements

The number of Tier 1 sources required to verify a claim depends on claim type.

One Tier 1 source is sufficient for:

Two independent Tier 1 sources are required for:

A primary source document is required (mandatory regardless of count—news coverage of the document does not qualify) for:


Content-Type Specific Rules

Some content types carry elevated risk and require stricter treatment of module output.

Health and Medical

Financial and Economic

Real Estate and Local Services

Travel and Scheduling

Entertainment and Celebrity


Escalation Logic

Situation Action
1–2 FALSE or MISLEADING flags Editor resolves; documents change in draft notes
3+ flags in a single piece Escalate to senior editor before publish; consider full manual fact-check
Any health or legal claim flagged FALSE Hard stop; senior editor must sign off on resolution before piece moves to CMS
Flags the editor cannot resolve Do not publish; bring to direct editor with specific claim and verdict noted
Same flag type recurs across multiple runs of the same content type Report as pattern bug; stop using the module for that content type pending acknowledgment
(Pending) Any verdict returned with low confidence Treat as INSUFFICIENT_EVIDENCE regardless of verdict; verify independently before publish
(Pending) 3+ low-confidence verdicts in a single piece Escalate to senior editor; consider full manual fact-check

Audit Trail

Implementation required. Validation output must be stored per-piece in the CMS—not only visible at generation time. This is a stated requirement; confirm implementation with the CSA product/dev team.

The audit trail for each piece must include:

This record must be:

The audit trail is the authoritative record for editorial decisions and pattern analysis. It is what makes override documentation, escalation history, and recurring-bug identification possible across time.


Override Documentation

When an editor overrides a module verdict—publishing a claim the module flagged, or removing one it verified—note:

This documentation travels with the piece through the review process. It is not punitive. Editors who consistently out-perform the module’s flags are providing signal that improves it.

Override documentation also serves as the source for aggregate pattern tracking. Across pieces, the record should capture:

This aggregate view is what enables pattern-bug identification, module improvement, and editorial QA over time. It requires the per-piece audit trail to be stored in the CMS (see Audit Trail).


Acceptance Criteria for Test Article Review

When walking the 15 test articles, apply each of the following: