back
Get SIGNAL/NOISE in your inbox daily
I will argue that a large class of reward functions, which I call “behaviorist”, and which includes almost every reward function in the RL and LLM literature, are all doomed to eventually lead to AI that will “scheme”—i.e., pretend to be docile and cooperative while secretly looking for opportunities to behave in egregiously bad ways such as world takeover (cf. “treacherous turn”)…
Recent Stories
Jan 19, 2026
OpenAI CFO Friar: 2026 is year for ‘practical adoption’ of AI
OpenAI CFO Sarah Friar said the company is focused on "practical adoption" in 2026, especially in health, science, and enterprise.
Jan 19, 2026OpenAI’s 2026 ‘focus’ is ‘practical adoption’
As the company spends a huge amount of money on infrastructure, OpenAI is working to close the gap on what AI can do and how people actually use it.
Jan 19, 2026Chef Robotics and Packline Partner for Automated Food Manufacturing Solution
The companies have developed a wireless integration that enables seamless end-to-end communication between Chef’s and Packline’s equipment throughout the production line.