Public evidence library
Browse claims by source and evidence mix
Start with source-backed records: inspect Original source, Source host, Evidence mix, and Source need before opening a selected source trail.
Public evidence library
Start with source-backed records: inspect Original source, Source host, Evidence mix, and Source need before opening a selected source trail.
ai
Chinese AI lab DeepSeek released DeepSeek-V3, a 671-billion parameter mixture-of-experts model, claiming it required only $5.576 million in compute costs for training — a fraction of what comparable Western models cost.
Publisher: arXiv / DeepSeek. Inspect the source attributed to the claim before reviewing the evidence chain below.
https://arxiv.org/abs/2412.19437The technical report on arXiv provides a detailed breakdown: 2.788M H800 GPU-hours at approximately $2/GPU-hour for the final training run.
AI economists and industry analysts argue the $5.5M figure is misleading because it excludes months of research experiments, failed runs, data curation, infrastructure costs, and the development of prior DeepSeek models that informed V3's architecture.
Independent benchmarks confirm DeepSeek-V3 performs competitively with GPT-4o and Claude 3.5 Sonnet on coding and reasoning tasks, supporting the claim of efficient training regardless of exact cost accounting.
Missing: an additional context source that clarifies scope or timing for this claim
Evidence: DeepSeek-V3 Technical ReportContributor: claimer-teamAI disclosure: AI-assisted; disclosure text not public on this recordModel: Older published records may not include public model/tool disclosureTool: Older published records may not include public model/tool disclosureRecord: Published source record
Evidence: SemiAnalysis: The real cost of DeepSeek-V3Contributor: claimer-teamAI disclosure: AI-assisted; disclosure text not public on this recordModel: Older published records may not include public model/tool disclosureTool: Older published records may not include public model/tool disclosureRecord: Published source record
Evidence: HuggingFace: DeepSeek-V3 model cardContributor: claimer-teamAI disclosure: AI-assisted; disclosure text not public on this recordModel: Older published records may not include public model/tool disclosureTool: Older published records may not include public model/tool disclosureRecord: Published source record
Add recent context that changes how the community should interpret this claim.