The Goal of Learning in Boltzmann Machines
We aim to maximize the product of probabilities that the Boltzmann machine assigns to the binary vectors in the training set. This is equivalent to maximizing the sum of the log probabilities that the Boltzmann machine assigns to the training vectors.
It is also equivalent to maximizing the probability that we would obtain exactly the N training cases if we did the following: Let the network settle to its stationary distribution N different times with no external input, then sample the visible vector once each time.