inverse reinforcement learning: Nonlinear Function