A mixture-of-experts model consists of a set of functions , the 'experts', and a gating function that determines how to select which…
Closely related to [ discrete latent variable ]s and to [ reinforcement learning ] with discrete actions. If I do a thing and it goes well…