Created: June 12, 2021
Modified: June 12, 2021

reward uncertainty

This page is from my personal notes, and has not been specifically reviewed for public consumption. It might be incomplete, wrong, outdated, or stupid. Caveat lector.

See also: cooperative inverse reinforcement learning, love is value alignment

reward uncertainty

Links to this note

value learning

values all the way down

reward is enough

reinforcement learning

tractable approximations to utilitarianism

AI safety

Meta