next up previous contents
Next: Batch or on-line Up: Neural Network Previous: Moody Regularization

Optimization

The problem of optimizing the neural network costfunction is usually a nonlinear static unconstrained optimization problem.

Nonlinear meaning that it is usually not possible to solve the problem directly, --- instead iterative schemes must be applied.

There is usually no temporal state involved in the feed-forward neural network. The patterns are treated as independent. Therefore is the problem static.

Unconstrained meaning that the variables to be optimized is not bounded, --- for example that they should be positive. This is in regard to the costfunction. Optimization of the errorfunction may be regarded as a constrained optimization where the constraints are imposed by the regularization. Fortunately the constraints are differentiable, so the constraints are easily added to the errorfunction giving the costfunction.

 

The optimization in connection with neural network usually called training or learning. Each of the iterative steps is called an epoch.

For an optimization methods 3 efficiency measures may be given [62]:

The speed of convergence and the complexity of computation usually work opposite --- it is just a question of where to put the epoch. If the 2 values are ''multiplied'' together we get the time consume of the algorithm. Often the robustness fight against the time consume. This is because a robust algorithm often works not so locally, uses more points, where the function is evaluated.

Some algorithms are only robust with a convex costfunction, that is where the Hessian is positive definite. The neural network costfunction --- especially a unregularized --- is not convex. To get both robustness and a fair time consume optimization methods can be mixed forming a hybrid optimization methods.





next up previous contents
Next: Batch or on-line Up: Neural Network Previous: Moody Regularization



Finn Nielsen
Sun Feb 25 19:22:55 PST 1996