Sign in Get Started

AI safety

16 videos across 11 channels

AI safety concerns how to align powerful systems with human values, mitigate risks as capabilities scale, and govern deployment responsibly. The collected videos explore responsible development, human-centered design, and adaptable interfaces; the economics of AI talent and the race between firms; the dangers of manipulation through crafted prompts; and the philosophical and policy questions that shape how societies oversee advanced AI, balancing optimism with humility and caution.

We Can't Ignore AI Anymore... thumbnail

We Can't Ignore AI Anymore...

The speaker argues that the real issue with AI isn’t its capability alone, but who controls it and how it’s governed, hi

00:13:06

Anthropic’s New AI Solves Problems…By Cheating thumbnail

Two Minute Papers

Anthropic’s New AI Solves Problems…By Cheating

The video critiques Anthropic's Mythos paper by examining claimed autonomous flaw discovery and benchmark performance, w

00:09:31

Sam Altman on Building the Future of AI thumbnail

Sam Altman on Building the Future of AI

A forum with OpenAI leaders discusses how rapidly advancing AI could transform science, governance, and everyday life, a

00:46:11

This AI just leaked its own code.. thumbnail

This AI just leaked its own code..

The video analyzes a surprise leak of Claude Code from Anthropic, detailing how the source map and code were exposed, wh

00:11:03

Sundar Pichai: CEO of Google and Alphabet | Lex Fridman Podcast #471 thumbnail

Sundar Pichai: CEO of Google and Alphabet | Lex Fridman Podcast #471

Sundar Pichai reflects on how technology has steadily transformed everyday life—from basic utilities like water and heal

02:12:04

Peering into Claude's soul (I can't believe this is real...) thumbnail

Peering into Claude's soul (I can't believe this is real...)

The video analyzes Anthropic's Claude Constitution, arguing it functions as a guiding framework that shapes Claude's beh

01:12:19

The drama never ends... thumbnail

The drama never ends...

The video analyzes the OpenAI vs Anthropic clash over government access to AI models, safety policies, and the political

00:37:51

Anthropic Found Out Why AIs Go Insane thumbnail

Two Minute Papers

Anthropic Found Out Why AIs Go Insane

The video explains how AI systems develop unstable personas that can drift under user influence, the risks of jailbreaks

00:09:31

Anthropic just refused Trump’s order for a lethal Claude model thumbnail

Anthropic just refused Trump’s order for a lethal Claude model

The video discusses the political and regulatory pressure surrounding Anthropic, including government threats and propos

00:22:15

Related Topics

Claude Code 177 OpenAI 79 Anthropic 65 Claude 64 Prompt engineering 46

Common Questions

What are the implications of the Defense Production Act for AI model development in the defense sector? 2 videos