Reinforcement Learning
13 pages tagged "Reinforcement Learning"
How can progress in non-agentic LLMs lead to capable AI agents?
What is OpenAI's alignment research agenda?
Which moral theories would be easiest to encode into an AI?
What is reinforcement learning from human feedback (RLHF)?
What is Iterated Distillation and Amplification (IDA)?
What is reinforcement learning (RL)?
What is behavioral cloning?
What is shard theory?
What is a shoggoth?
What is "jailbreaking" a large language model (LLM)?
What is "Constitutional AI"?
What are the power-seeking theorems?
What is meta-RL?