HealthBench

3 videos across 2 channels

HealthBench looks at how AI models perform in healthcare, from safety and grounding in medical knowledge to data privacy and clinician collaboration. The videos contrast top models like Claude Fable 5 and GPT-5.5 across benchmarks with real-world healthcare tasks, while also exploring governance, compute costs, and the path toward safer, more accessible patient care through practical deployments and multi-modal data integration.

Claude Fable 5 - Full 319 page Breakdown thumbnail

Claude Fable 5 - Full 319 page Breakdown

The video analyzes Anthropic's Claude Fable 5 and Mythos 5 as significant yet guarded advances in AI, highlighting the e

00:33:59
GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies thumbnail

GPT 5.5 Arrives, DeepSeek V4 Drops, and the Compute War Intensifies

The video surveys the AI landscape through new models like GPT-5.5 and DeepSeek V4, comparing their performance, costs,

00:25:19
Building AI for better healthcare — the OpenAI Podcast Ep. 14 thumbnail

Building AI for better healthcare — the OpenAI Podcast Ep. 14

OpenAI researchers Dr. Nate Gross and Karan Singhal discuss the OpenAI approach to healthcare AI, emphasizing safety, gr

00:30:54