Can you give an AI a goal which involves “minimally impacting the world”?

1 min read

Suggest changes in Google Docs

Penalizing an AI for affecting the world too much is called impact regularization and is an active area of alignment research.

AISafety.info

We’re a global team of specialists and volunteers from various backgrounds who want to ensure that the effects of future AI are beneficial rather than catastrophic.

Get involved

Partner projects

Alignment Ecosystem Development

Aisafety.info is an Ashgro Inc Project. Ashgro Inc (EIN: 88-4232889) is a 501(c)(3) Public Charity incorporated in Delaware.