What are some open research questions in AI alignment?

1 min read

Suggest changes in Google Docs

Table Below

This	Is	A
Table	For	Testing

Table above

AI alignment has numerous subdomains and each has many open research questions

Here are some examples:

Agent foundations

Mechanistic interpretability - This is research which aims to understand the inside workings of Neural networks

Honesty

Governance

As a new field, there are many more open problems as well, and new questions are constantly being asked