Home Knowledge Base MultiFC

MultiFC is the large-scale, multi-domain fact-checking dataset aggregated from 26 professional fact-checking websites — providing the most diverse collection of real-world misinformation labels in NLP, spanning politics, health, science, and urban legends from sources like PolitiFact, Snopes, and FactCheck.org.

What Is MultiFC?

The Label Normalization Challenge

The core technical difficulty of MultiFC is that different fact-checking sites use incompatible label vocabularies. A "Misleading" label on Reuters Fact Check is not equivalent to "Misleading" on Snopes — the standards and definitions differ. Models must either:

Why MultiFC Matters

Model Approaches

Text-Only Baselines:

Metadata-Enhanced Models:

Evidence-Retrieval Models:

Comparison to Related Benchmarks

FeatureFEVERClimate-FEVERMultiFC
ClaimsArtificialReal (climate)Real (multi-domain)
Labels3 standard4100+ site-specific
EvidenceWikipediaWikipediaFull fact-check articles
MetadataNoneNoneSpeaker, date, tags
Scale185k1.5k36k

Common Failure Modes

Applications

MultiFC is the professional fact-checker's dataset — training AI on tens of thousands of real expert verdicts to recognize the patterns, contexts, and metadata signals that distinguish reliable information from coordinated misinformation.

multifcevaluation

Explore 500+ Semiconductor & AI Topics

From EUV lithography to CUDA optimization — search the full knowledge base or chat with our AI assistant.