In this mini clip of episode #355, Daniel and Chris break down how AI benchmarks compare models and why those scores don’t always reflect real-world performance. They explore the gap between open vs. closed models, the rise of smaller, specialized AI, and the risks of building on closed APIs.
FEATURING:
Host - Chris Benson (AI / Autonomy Research Engineer at Lockheed Martin & Cohost at Practical AI Podcast)
Host - Daniel Whitenack (CEO at Prediction Guard & cohost at Practical AI Podcast)
Full episode releasing this Thursday!
Watch our full episode here on Youtube or listen to the full episode on practicalai.show