<!-- AGENT-AUDIENCE: human-only -->

# Claims Validation

> **Scope:** This page defines the ideal-state specification for a CSA fact-checking module to be developed. It establishes the required behavior of the module, the editorial workflow for acting on its output, and the infrastructure needed to support it. Sections marked **Pending** identify capabilities not yet confirmed as implemented. Everything else represents the target standard the module should meet and that editors should apply once it is live.


## Content Pipeline Tiers

Not all content in the CSA pipeline requires the same level of fact-checking oversight. The tier a piece belongs to determines whether and how the Claims Validation module applies.

> **Tier names are provisional**—naming conventions are not yet finalized.

| Tier | Description | Raw Module Output in CSA UI | Claims Validation |
|---|---|---|---|
| **High Touch (HITL)** | Human-in-the-loop—CSA-generated drafts requiring full human editorial review | Yes | Applies in full. All verdicts require editor action per this page. |
| **Semi-Automated** | Everything that is not High Touch or Fully Automated | Yes | Applies selectively—editor judgment determines which verdicts require action based on content type and risk. |
| **Fully Automated** | Structured data output with no significant editorial judgment involved—e.g., game scores, weather reports, United Robots automated inbound content | No | No Claims Validation pass required. |

The rules documented on the rest of this page apply in full to **High Touch (HITL)** content. **Semi-Automated** content—everything that is not HITL or Fully Automated—applies the same rules with editor judgment on verdict prioritization. **Fully Automated** content does not surface module output and does not require a Claims Validation pass.

> **Role-level access:** Which specific roles can view raw module output in the CSA UI is pending confirmation with engineering leadership and product leads.

---

## What the Module Produces

The CSA fact-checking module runs automatically as part of the content validation step. After a draft is generated, the module evaluates factual claims against its knowledge base and returns a structured validation output for each flagged claim. The following defines the required output format.

Each flag carries one of five verdicts:

| Verdict | Meaning |
|---|---|
| `TRUE` | Claim is accurate and verifiable |
| `FALSE` | Claim is factually incorrect |
| `MISLEADING` | Claim is technically true but framed in a way that distorts meaning |
| `INSUFFICIENT_EVIDENCE` | Claim cannot be verified from available sources |
| `OVERGENERALIZED` | Claim may be true in some contexts but is stated too broadly |

### Confidence Level

> **Pending module capability.** The module should return a confidence level alongside each verdict. This field is not yet confirmed as available. The escalation logic below includes confidence-level handling and will apply once the field is live.

When present, the confidence level indicates how certain the module is in its verdict. Treat it as an escalation signal:

| Confidence | Meaning | Editorial Action |
|---|---|---|
| High | Module is confident in the verdict | Apply the verdict's standard action |
| Medium | Module has moderate certainty | Apply standard action; note in draft |
| Low | Module has limited basis for the verdict | Treat as `INSUFFICIENT_EVIDENCE` regardless of the stated verdict; verify independently |

---

## Editorial Action Taxonomy

### I. Needs Correction

Factually wrong, demonstrably misleading, or unsupported by a verifiable source. Apply when the CSA validation output returns:

- **`FALSE`**—rewrite or remove the claim entirely before peer review
- **`MISLEADING`**—reframe the claim; document what the module flagged and what you changed

### II. Needs Clarification

Nuanced, partially true, or contextually dependent; requires rewording rather than removal. Apply when the CSA validation output returns:

- **`OVERGENERALIZED`**—narrow the scope or add qualifying language
- **`INSUFFICIENT_EVIDENCE`**—add a verified source, or remove the claim if no primary source can be found

> **`TRUE` flags require no action.** They are informational and confirm the claim passed validation.

---

## Source Authority Tiers

The module weights source authority when evaluating claims. Editors should apply the same weighting when reviewing flagged output. The specific approved outlets within each tier are documented in [Acceptable Sources §5]({{ "/docs/acceptable-sources" | relative_url }}).

### Tier 1—Authoritative (accept)

- Government and institutional primary sources (.gov, .edu, established regulatory bodies)
- Peer-reviewed publication records
- Official statements from named public figures on verified channels
- Established national news organizations (AP, Reuters, BBC, major metros)
- McClatchy publications

### Tier 2—Acceptable with verification

- Trade and industry publications with editorial standards
- Official company/brand statements via verified press releases
- Court records, police reports, legal filings

### Tier 3—Flag for review

- Aggregator blogs, content farms, unverified third-party roundups
- Anonymous sourcing without corroboration
- Social media posts, even from verified accounts (unless the post itself is the news)

> If the module returns a `MISLEADING` or `FALSE` verdict and the original source is Tier 3, treat it as **Needs Correction** regardless of the specific verdict.

### Source Count Requirements

The number of Tier 1 sources required to verify a claim depends on claim type.

**One Tier 1 source is sufficient for:**
- Straightforward factual claims—dates, confirmed events, official records
- Statistics drawn directly from a government database or institutional publication
- Direct quotes from named public figures via verified channels

**Two independent Tier 1 sources are required for:**
- Contested or sensitive claims where sources could reasonably disagree
- Allegations, legal proceedings, or anything that could harm a named individual
- Health claims not covered by a clear primary source document

**A primary source document is required (mandatory regardless of count—news coverage of the document does not qualify) for:**
- Court outcomes, legal status, and regulatory figures
- Health dosage, drug interaction, and treatment efficacy claims
- Financial statistics cited as authoritative fact

---

## Content-Type Specific Rules

Some content types carry elevated risk and require stricter treatment of module output.

### Health and Medical

- Any flagged health claim is a **hard stop**. Do not publish until the claim is either removed or confirmed against a Tier 1 source.
- `OVERGENERALIZED` flags on health content should be treated as **Needs Correction**, not Needs Clarification.
- Dosage, drug interaction, and treatment efficacy claims require a **primary source document**—two independent Tier 1 sources if no primary source document exists—even if the module returns `TRUE`.

### Legal and Regulatory

- Court outcomes, legal status, and regulatory figures require a **primary source document** (filings, official dockets, agency databases)—news coverage of those documents does not qualify.
- Stale regulatory figures are a known failure mode. If a claim involves a figure with a defined annual update cycle (tax rates, benefit amounts, sentencing guidelines), verify the vintage of the figure the module used.
- Allegations or claims that could harm a named individual require **two independent Tier 1 sources**.

### Financial and Economic

- Dollar amounts, percentage changes, and economic statistics cited as authoritative fact require a **primary source document** even on `TRUE` verdicts.
- Forward-looking claims (projections, forecasts) must be attributed to the source of the forecast—not stated as fact. One Tier 1 source (the originating institution) is sufficient if the claim is attributed.

### Real Estate and Local Services

- Local pricing, service availability, and geographic coverage claims have a known error rate in CSA output. Treat all `OVERGENERALIZED` flags in this category as **Needs Correction**.
- Address-specific or market-specific claims: verify against a current local source before publish.

### Travel and Scheduling

- Operating hours, travel restrictions, visa requirements, and schedules are time-sensitive. Even `TRUE` verdicts may be stale. Verify against the official source at time of publication.

### Entertainment and Celebrity

- Relationship status, pregnancy, and legal proceedings require **two independent Tier 1 sources**—both must be named and on-record. Unconfirmed claims should not appear in the draft regardless of module verdict.
- Historical facts about creative works (release dates, awards, box office): **one Tier 1 source** is sufficient; `TRUE` verdicts can proceed without additional verification unless the piece is a definitive reference.

---

## Escalation Logic

| Situation | Action |
|---|---|
| 1–2 `FALSE` or `MISLEADING` flags | Editor resolves; documents change in draft notes |
| 3+ flags in a single piece | Escalate to senior editor before publish; consider full manual fact-check |
| Any health or legal claim flagged `FALSE` | Hard stop; senior editor must sign off on resolution before piece moves to CMS |
| Flags the editor cannot resolve | Do not publish; bring to direct editor with specific claim and verdict noted |
| Same flag type recurs across multiple runs of the same content type | Report as pattern bug; stop using the module for that content type pending acknowledgment |
| *(Pending)* Any verdict returned with low confidence | Treat as `INSUFFICIENT_EVIDENCE` regardless of verdict; verify independently before publish |
| *(Pending)* 3+ low-confidence verdicts in a single piece | Escalate to senior editor; consider full manual fact-check |

---

## Audit Trail

> **Implementation required.** Validation output must be stored per-piece in the CMS—not only visible at generation time. This is a stated requirement; confirm implementation with the CSA product/dev team.

The audit trail for each piece must include:

- The full module output at the time of generation—all verdicts returned, the claim each verdict was attached to, and the confidence level (once available)
- Any editor actions taken on flagged claims—what was changed, removed, or overridden, and why
- The final publication state relative to the module output—which flags were resolved and how

This record must be:
- **Stored per-piece in the CMS**—accessible after the piece is published, not only during the drafting workflow
- **Attached to the piece record**—not stored in a separate log that requires cross-referencing
- **Readable by editors, senior editors, and content leads**—role-level access pending confirmation with engineering leadership and product leads

The audit trail is the authoritative record for editorial decisions and pattern analysis. It is what makes override documentation, escalation history, and recurring-bug identification possible across time.

---

## Override Documentation

When an editor overrides a module verdict—publishing a claim the module flagged, or removing one it verified—note:

- What the module returned
- What the editor decided
- One-sentence rationale

This documentation travels with the piece through the review process. It is not punitive. Editors who consistently out-perform the module's flags are providing signal that improves it.

Override documentation also serves as the source for aggregate pattern tracking. Across pieces, the record should capture:

- **Flag frequency by type**—how often each verdict (`FALSE`, `MISLEADING`, `OVERGENERALIZED`, `INSUFFICIENT_EVIDENCE`) is being flagged, by content type
- **Remediation type**—what action the editor took on each flagged claim:
  - *Rewrite*—claim was reworked and kept
  - *Removal*—claim was cut entirely
  - *Kept with rationale*—editor judged the module flag incorrect and published as-is
- **Override rate by verdict**—how often each verdict type is being overridden versus acted on, which surfaces where the module is underperforming editorially

This aggregate view is what enables pattern-bug identification, module improvement, and editorial QA over time. It requires the per-piece audit trail to be stored in the CMS (see [Audit Trail](#audit-trail)).

---

## Acceptance Criteria for Test Article Review

When walking the 15 test articles, apply each of the following:

- Does the verdict taxonomy map cleanly to the Needs Clarification / Needs Correction split defined in this document?
- Are there verdicts that don't fit either tier? (These need a third category or a handling note.)
- Does the content-type-specific rule apply correctly in the test case?
- Does the source tier weighting change the editorial action in any test case?
- Is there any verdict the module returned that an editor would have caught differently—and if so, what does that imply for the ruleset?