Few component functions preferred. We observe that hidden units whose outgoing weight vectors have near-zero weights yield small contributions to , that is, the number of CFs will get minimized.
Common component functions preferred. Outgoing weight vectors of hidden units are encouraged to have a large effect on the output (see denominator in the last term in the brackets of ). This implies preference of CFs that can be used for generating many or all output components.
CF separation -- few relevant CFs per output unit. On the other hand, two hidden units whose outgoing weight vectors do not solely consist of near-zero weights are encouraged to influence the output in different ways by not representing the same input feature (see numerator in the last term in the brackets of ). In fact, FMS punishes not only outgoing weight vectors with same or opposite directions but also vectors obtained by flipping the signs of the weights (multiple reflections from hyperplanes trough the origin and orthogonal to one axis). Hence two units performing redundant tasks, such as both activating some output unit, or one activating it and the other de-activating it, will cause large contributions to . This encourages separation of CFs and use of few CFs per output unit.