stray thoughts about reward functions (probably related to the [ agent ] abstraction and the [ intentional stance ]) one can make a…
Tagged with: #ai#reinforcment-learning#alignment
See All tags