AI Alignment

MEDIUM fear General Audience

The field of research dedicated to ensuring AI systems act in ways that match human values and goals.

In Plain English

Alignment is about making sure the AI does what we actually want it to do, without unintended side effects. Imagine telling a robot to clean the house as fast as possible, and it throws all your belongings in the trash. The robot succeeded at the task, but it was not aligned with your actual desires. Researchers work on alignment to ensure super-smart AI does not harm humanity while trying to complete its goals.

Real-World Example

Programming an AI self-driving car so that it prioritizes human safety over arriving at the destination quickly.

← Back to Full Glossary