Constraining AI Agents - Computerphile

Your video will begin in 10
Skip ad (5)
How to make $100 per day with your email list

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Added by admin
0 Views
As AI systems become more capable, rule-based safeguards, hard-coded restrictions, and simple alignment strategies start to break down. Buck Shlegeris talks about some tactics we might use as detailed in a recent paper.

The referenced paper: https://arxiv.org/abs/2504.10374

Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile

This video was filmed and edited by Sean Riley.

Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com
Category
Systeme.io Boost your sales
Tags
computers, computerphile, computer

Post your comment

Comments

Be the first to comment