______________
These are not CGI. Reinforcement learning is so back. When operating on strings, it gives us o3. When operating on physical motors, it gives us a robot creature that out-maneuvers almost every animal on earth. RL is one of the very few learning algorithms that can master both the world of bits and the world of atoms.
Give me a reward function, and I shall move the world.
2024 is the year of prompt engineering.
2025 is the year of reward engineering.