Created:
Modified:

inverse reinforcement learning

This page is from my personal notes, and has not been specifically reviewed for public consumption. It might be incomplete, wrong, outdated, or stupid. Caveat lector.

inverse reinforcement learning

Links to this note

value learning

tractable approximations to utilitarianism

love is value alignment

Meta