AI Explained. Older than the article (March 2024), but the methodology is what's useful: a single careful reviewer running the same hard tasks — OCR, theory of mind, instruction following, math — through three frontier models side by side and showing exactly where each one cracks. The model names are dated, the framework for comparing models is not.
Treat this as a testing-method video, not a current ranking. The concrete winners are historical; the useful part is running the same examples, checking failure modes and resisting vague "best model" claims.
See a structured comparison method you can reuse when deciding which model is good enough for a task.
Know that Claude 3, Gemini 1.5 and GPT-4 are no longer the frontier baseline.
Continue through the same learning path with the next curated companion videos.