Clever Algorithms: Nature-Inspired Programming Recipes

A book by Jason Brownlee

Learning Vector Quantization

Learning Vector Quantization, LVQ.

Taxonomy

The Learning Vector Quantization algorithm belongs to the field of Artificial Neural Networks and Neural Computation. More broadly to the field of Computational Intelligence. The Learning Vector Quantization algorithm is a supervised neural network that uses a competitive (winner-take-all) learning strategy. It is related to other supervised neural networks such as the Perceptron and the Back-propagation algorithm. It is related to other competitive learning neural networks such as the the Self-Organizing Map algorithm that is a similar algorithm for unsupervised learning with the addition of connections between the neurons. Additionally, LVQ is a baseline technique that was defined with a few variants LVQ1, LVQ2, LVQ2.1, LVQ3, OLVQ1, and OLVQ3 as well as many third-party extensions and refinements too numerous to list.

Inspiration

The Learning Vector Quantization algorithm is related to the Self-Organizing Map which is in turn inspired by the self-organizing capabilities of neurons in the visual cortex.

Strategy

The information processing objective of the algorithm is to prepare a set of codebook (or prototype) vectors in the domain of the observed input data samples and to use these vectors to classify unseen examples. An initially random pool of vectors is prepared which are then exposed to training samples. A winner-take-all strategy is employed where one or more of the most similar vectors to a given input pattern are selected and adjusted to be closer to the input vector, and in some cases, further away from the winner for runners up. The repetition of this process results in the distribution of codebook vectors in the input space which approximate the underlying distribution of samples from the test dataset.

Procedure

Vector Quantization is a technique from signal processing where density functions are approximated with prototype vectors for applications such as compression. Learning Vector Quantization is similar in principle, although the prototype vectors are learned through a supervised winner-take-all method.

Algorithm (below) provides a high-level pseudocode for preparing codebook vectors using the Learning Vector Quantization method. Codebook vectors are initialized to small floating point values, or sampled from an available dataset. The Best Matching Unit (BMU) is the codebook vector from the pool that has the minimum distance to an input vector. A distance measure between input patterns must be defined. For real-valued vectors, this is commonly the Euclidean distance:

$dist(x,c) = \sum_{i=1}^{n} (x_i - c_i)^2$

where $n$ is the number of attributes, $x$ is the input vector and $c$ is a given codebook vector.

Input: ProblemSize, InputPatterns, $iterations_{max}$, $CodebookVectors_{num}$, $learn_{rate}$
Output: CodebookVectors
CodebookVectors $\leftarrow$ InitializeCodebookVectors{$CodebookVectors_{num}$, ProblemSize}
For ($i=1$ To $iterations_{max}$)
    $Pattern_i$ $\leftarrow$ SelectInputPattern{InputPatterns}
    $Bmu_i$ $\leftarrow$ SelectBestMatchingUnit{$Pattern_i$, CodebookVectors}
    For ($Bmu_{i}^{attribute}$ $\in$ $Bmu_i$)
        If ($Bmu_{i}^{class}$ $\equiv$ $Pattern_{i}^{class}$)
            $Bmu_{i}^{attribute}$ $\leftarrow$ $Bmu_{i}^{attribute}$ $+$ $learn_{rate}$ $\times$ ($Pattern_{i}^{attribute}$ $-$ $Bmu_{i}^{attribute}$)
        Else
            $Bmu_{i}^{attribute}$ $\leftarrow$ $Bmu_{i}^{attribute}$ $-$ $learn_{rate}$ $\times$ ($Pattern_{i}^{attribute}$ $-$ $Bmu_{i}^{attribute}$)
        End
    End
End
Return (CodebookVectors)

Pseudocode for LVQ1.

Heuristics

Learning Vector Quantization was designed for classification problems that have existing data sets that can be used to supervise the learning by the system. The algorithm does not support regression problems.
LVQ is non-parametric, meaning that it does not rely on assumptions about that structure of the function that it is approximating.
Real-values in input vectors should be normalized such that $x \in [0,1)$.
Euclidean distance is commonly used to measure the distance between real-valued vectors, although other distance measures may be used (such as dot product), and data specific distance measures may be required for non-scalar attributes.
There should be sufficient training iterations to expose all the training data to the model multiple times.
The learning rate is typically linearly decayed over the training period from an initial value to close to zero.
The more complex the class distribution, the more codebook vectors that will be required, some problems may need thousands.
Multiple passes of the LVQ training algorithm are suggested for more robust usage, where the first pass has a large learning rate to prepare the codebook vectors and the second pass has a low learning rate and runs for a long time (perhaps 10-times more iterations).

Code Listing

Listing (below) provides an example of the Learning Vector Quantization algorithm implemented in the Ruby Programming Language. The problem is a contrived classification problem in a 2-dimensional domain $x\in[0,1], y\in[0,1]$ with two classes: 'A' ($x\in[0,0.4999999], y\in[0,0.4999999]$) and 'B' ($x\in[0.5,1], y\in[0.5,1]$).

The algorithm was implemented using the LVQ1 variant where the best matching codebook vector is located and moved toward the input vector if it is the same class, or away if the classes differ. A linear decay was used for the learning rate that was updated after each pattern was exposed to the model. The implementation can easily be extended to the other variants of the method.

def random_vector(minmax)
  return Array.new(minmax.size) do |i|
    minmax[i][0] + ((minmax[i][1] - minmax[i][0]) * rand())
  end
end

def generate_random_pattern(domain)
  classes = domain.keys
  selected_class = rand(classes.size)
  pattern = {:label=>classes[selected_class]}
  pattern[:vector] = random_vector(domain[classes[selected_class]])
  return pattern
end

def initialize_vectors(domain, num_vectors)
  classes = domain.keys
  codebook_vectors = []
  num_vectors.times do
    selected_class = rand(classes.size)
    codebook = {}
    codebook[:label] = classes[selected_class]
    codebook[:vector] = random_vector([[0,1],[0,1]])
    codebook_vectors << codebook
  end
  return codebook_vectors
end

def euclidean_distance(c1, c2)
  sum = 0.0
  c1.each_index {|i| sum += (c1[i]-c2[i])**2.0}
  return Math.sqrt(sum)
end

def get_best_matching_unit(codebook_vectors, pattern)
  best, b_dist = nil, nil
  codebook_vectors.each do |codebook|
    dist = euclidean_distance(codebook[:vector], pattern[:vector])
    best,b_dist = codebook,dist if b_dist.nil? or dist<b_dist
  end
  return best
end

def update_codebook_vector(bmu, pattern, lrate)
  bmu[:vector].each_with_index do |v,i|
    error = pattern[:vector][i]-bmu[:vector][i]
    if bmu[:label] == pattern[:label]
      bmu[:vector][i] += lrate * error
    else
      bmu[:vector][i] -= lrate * error
    end
  end
end

def train_network(codebook_vectors, domain, iterations, learning_rate)
  iterations.times do |iter|
    pat = generate_random_pattern(domain)
    bmu = get_best_matching_unit(codebook_vectors, pat)
    lrate = learning_rate * (1.0-(iter.to_f/iterations.to_f))
    if iter.modulo(10)==0
      puts "> iter=#{iter}, got=#{bmu[:label]}, exp=#{pat[:label]}"
    end
    update_codebook_vector(bmu, pat, lrate)
  end
end

def test_network(codebook_vectors, domain, num_trials=100)
  correct = 0
  num_trials.times do
    pattern = generate_random_pattern(domain)
    bmu = get_best_matching_unit(codebook_vectors, pattern)
    correct += 1 if bmu[:label] == pattern[:label]
  end
  puts "Done. Score: #{correct}/#{num_trials}"
  return correct
end

def execute(domain, iterations, num_vectors, learning_rate)
  codebook_vectors = initialize_vectors(domain, num_vectors)
  train_network(codebook_vectors, domain, iterations, learning_rate)
  test_network(codebook_vectors, domain)
  return codebook_vectors
end

if __FILE__ == $0
  # problem configuration
  domain = {"A"=>[[0,0.4999999],[0,0.4999999]],"B"=>[[0.5,1],[0.5,1]]}
  # algorithm configuration
  learning_rate = 0.3
  iterations = 1000
  num_vectors = 20
  # execute the algorithm
  execute(domain, iterations, num_vectors, learning_rate)
end

Learning Vector Quantization in Ruby

Download: lvq.rb.

References

Primary Sources

The Learning Vector Quantization algorithm was described by Kohonen in 1988 [Kohonen1988], and was further described in the same year by Kohonen [Kohonen1988a] and benchmarked by Kohonen, Barna, and Chrisley [Kohonen1988b].

Learn More

Kohonen provides a detailed overview of the state of LVQ algorithms and variants (LVQ1, LVQ2, and LVQ2.1) [Kohonen1990]. The technical report that comes with the LVQ_PAK software (written by Kohonen and his students) provides both an excellent summary of the technique and its main variants, as well as summarizing the important considerations when applying the approach [Kohonen1996]. The seminal book on Learning Vector Quantization and the Self-Organizing Map is "Self-Organizing Maps" by Kohonen, which includes a chapter (Chapter 6) dedicated to LVQ and its variants [Kohonen1995].

Bibliography

[Kohonen1988]	T. Kohonen, "Learning Vector Quantization", Neural Networks, 1988.
[Kohonen1988a]	T. Kohonen, "An introduction to neural computing", Neural Networks, 1988.
[Kohonen1988b]	T. Kohonen and G. Barna and R. Chrisley, "Statistical pattern recognition with neural networks: benchmarking\n\tstudies", in IEEE International Conference on Neural Networks, 1988.
[Kohonen1990]	T. Kohonen, "Improved versions of learning vector quantization", in IJCNN International Joint Conference on Neural Networks, 1990.
[Kohonen1995]	T. Kohonen, "Self-Organizing Maps", Springer, 1995.
[Kohonen1996]	T. Kohonen and J. Hynninen and J. Kangas and J. Laaksonen and K.\n\tTorkkola, "LVQ–PAK: The Learning Vector Quantization Program Package", Technical Report A30, Helsinki University of Technology, Laboratory of Computer and Information\n\tScience, Rakentajanaukio, 1996.

Please Note: This content was automatically generated from the book content and may contain minor differences.