Constraining AI Agents - Computerphile

Your video will begin in 10
Skip ad (5)
webinarJam 30 day trial Link

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Added by admin
16 Views
As AI systems become more capable, rule-based safeguards, hard-coded restrictions, and simple alignment strategies start to break down. Buck Shlegeris talks about some tactics we might use as detailed in a recent paper.

The referenced paper: https://arxiv.org/abs/2504.10374

Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile

This video was filmed and edited by Sean Riley.

Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com
Category
Systeme.io Boost your sales
Tags
computers, computerphile, computer

Post your comment

Comments

Be the first to comment