What are some open research questions in AI alignment?
Table Below
This | Is | A |
---|---|---|
Table | For | Testing |
Table above
Open Problems in AI X-Risk [PAIS #5]
AI alignment has numerous subdomains and each has many open research questions
Here are some examples:
Agent foundations
Mechanistic interpretability - This is research which aims to understand the inside workings of Neural networks
Honesty
Governance
As a new field, there are many more open problems as well, and new questions are constantly being asked