AI safety

9 videos across 6 channels

AI safety concerns how to align powerful systems with human values, mitigate risks as capabilities scale, and govern deployment responsibly. The collected videos explore responsible development, human-centered design, and adaptable interfaces; the economics of AI talent and the race between firms; the dangers of manipulation through crafted prompts; and the philosophical and policy questions that shape how societies oversee advanced AI, balancing optimism with humility and caution.

This AI just leaked its own code.. thumbnail

This AI just leaked its own code..

The video analyzes a surprise leak of Claude Code from Anthropic, detailing how the source map and code were exposed, wh

00:11:03
Sundar Pichai: CEO of Google and Alphabet | Lex Fridman Podcast #471 thumbnail

Sundar Pichai: CEO of Google and Alphabet | Lex Fridman Podcast #471

Sundar Pichai reflects on how technology has steadily transformed everyday life—from basic utilities like water and heal

02:12:04
Peering into Claude's soul (I can't believe this is real...) thumbnail

Peering into Claude's soul (I can't believe this is real...)

The video analyzes Anthropic's Claude Constitution, arguing it functions as a guiding framework that shapes Claude's beh

01:12:19
The drama never ends... thumbnail

The drama never ends...

The video analyzes the OpenAI vs Anthropic clash over government access to AI models, safety policies, and the political

00:37:51
Anthropic Found Out Why AIs Go Insane thumbnail

Anthropic Found Out Why AIs Go Insane

The video explains how AI systems develop unstable personas that can drift under user influence, the risks of jailbreaks

00:09:31
Anthropic just refused Trump’s order for a lethal Claude model thumbnail

Anthropic just refused Trump’s order for a lethal Claude model

The video discusses the political and regulatory pressure surrounding Anthropic, including government threats and propos

00:22:15