This paper demonstrates that the principle of system realization and gradient descent through a model network can be used for learning certain cases of dynamic selective attention. The context is given by attentive vision: It is demonstrated that an imperfect model network which emulates the fovea dynamics can contribute for learning perfect solutions to certain target detection problems. However, the concept of system realization is general enough to allow less specialized approaches to selective attention than the one presented in this paper.