Home | Read Online | Amazon | GoodReads | Google Books | PDF (code) | GitHub
Learning Vector Quantization, LVQ.
The Learning Vector Quantization algorithm belongs to the field of Artificial Neural Networks and Neural Computation. More broadly to the field of Computational Intelligence. The Learning Vector Quantization algorithm is a supervised neural network that uses a competitive (winner-take-all) learning strategy. It is related to other supervised neural networks such as the Perceptron and the Back-propagation algorithm. It is related to other competitive learning neural networks such as the the Self-Organizing Map algorithm that is a similar algorithm for unsupervised learning with the addition of connections between the neurons. Additionally, LVQ is a baseline technique that was defined with a few variants LVQ1, LVQ2, LVQ2.1, LVQ3, OLVQ1, and OLVQ3 as well as many third-party extensions and refinements too numerous to list.
The Learning Vector Quantization algorithm is related to the Self-Organizing Map which is in turn inspired by the self-organizing capabilities of neurons in the visual cortex.
The information processing objective of the algorithm is to prepare a set of codebook (or prototype) vectors in the domain of the observed input data samples and to use these vectors to classify unseen examples. An initially random pool of vectors is prepared which are then exposed to training samples. A winner-take-all strategy is employed where one or more of the most similar vectors to a given input pattern are selected and adjusted to be closer to the input vector, and in some cases, further away from the winner for runners up. The repetition of this process results in the distribution of codebook vectors in the input space which approximate the underlying distribution of samples from the test dataset.
Vector Quantization is a technique from signal processing where density functions are approximated with prototype vectors for applications such as compression. Learning Vector Quantization is similar in principle, although the prototype vectors are learned through a supervised winner-take-all method.
Algorithm (below) provides a high-level pseudocode for preparing codebook vectors using the Learning Vector Quantization method. Codebook vectors are initialized to small floating point values, or sampled from an available dataset. The Best Matching Unit (BMU) is the codebook vector from the pool that has the minimum distance to an input vector. A distance measure between input patterns must be defined. For real-valued vectors, this is commonly the Euclidean distance:
$dist(x,c) = \sum_{i=1}^{n} (x_i - c_i)^2$where $n$ is the number of attributes, $x$ is the input vector and $c$ is a given codebook vector.
Input:
ProblemSize, InputPatterns, $iterations_{max}$, $CodebookVectors_{num}$, $learn_{rate}$
Output:
CodebookVectors
CodebookVectors $\leftarrow$ InitializeCodebookVectors{$CodebookVectors_{num}$, ProblemSize}For ($i=1$ To $iterations_{max}$)SelectInputPattern{InputPatterns}SelectBestMatchingUnit{$Pattern_i$, CodebookVectors}For ($Bmu_{i}^{attribute}$ $\in$ $Bmu_i$)If ($Bmu_{i}^{class}$ $\equiv$ $Pattern_{i}^{class}$)ElseEndEndEndReturn (CodebookVectors)Listing (below) provides an example of the Learning Vector Quantization algorithm implemented in the Ruby Programming Language. The problem is a contrived classification problem in a 2-dimensional domain $x\in[0,1], y\in[0,1]$ with two classes: 'A' ($x\in[0,0.4999999], y\in[0,0.4999999]$) and 'B' ($x\in[0.5,1], y\in[0.5,1]$).
The algorithm was implemented using the LVQ1 variant where the best matching codebook vector is located and moved toward the input vector if it is the same class, or away if the classes differ. A linear decay was used for the learning rate that was updated after each pattern was exposed to the model. The implementation can easily be extended to the other variants of the method.
def random_vector(minmax)
return Array.new(minmax.size) do |i|
minmax[i][0] + ((minmax[i][1] - minmax[i][0]) * rand())
end
end
def generate_random_pattern(domain)
classes = domain.keys
selected_class = rand(classes.size)
pattern = {:label=>classes[selected_class]}
pattern[:vector] = random_vector(domain[classes[selected_class]])
return pattern
end
def initialize_vectors(domain, num_vectors)
classes = domain.keys
codebook_vectors = []
num_vectors.times do
selected_class = rand(classes.size)
codebook = {}
codebook[:label] = classes[selected_class]
codebook[:vector] = random_vector([[0,1],[0,1]])
codebook_vectors << codebook
end
return codebook_vectors
end
def euclidean_distance(c1, c2)
sum = 0.0
c1.each_index {|i| sum += (c1[i]-c2[i])**2.0}
return Math.sqrt(sum)
end
def get_best_matching_unit(codebook_vectors, pattern)
best, b_dist = nil, nil
codebook_vectors.each do |codebook|
dist = euclidean_distance(codebook[:vector], pattern[:vector])
best,b_dist = codebook,dist if b_dist.nil? or dist<b_dist
end
return best
end
def update_codebook_vector(bmu, pattern, lrate)
bmu[:vector].each_with_index do |v,i|
error = pattern[:vector][i]-bmu[:vector][i]
if bmu[:label] == pattern[:label]
bmu[:vector][i] += lrate * error
else
bmu[:vector][i] -= lrate * error
end
end
end
def train_network(codebook_vectors, domain, iterations, learning_rate)
iterations.times do |iter|
pat = generate_random_pattern(domain)
bmu = get_best_matching_unit(codebook_vectors, pat)
lrate = learning_rate * (1.0-(iter.to_f/iterations.to_f))
if iter.modulo(10)==0
puts "> iter=#{iter}, got=#{bmu[:label]}, exp=#{pat[:label]}"
end
update_codebook_vector(bmu, pat, lrate)
end
end
def test_network(codebook_vectors, domain, num_trials=100)
correct = 0
num_trials.times do
pattern = generate_random_pattern(domain)
bmu = get_best_matching_unit(codebook_vectors, pattern)
correct += 1 if bmu[:label] == pattern[:label]
end
puts "Done. Score: #{correct}/#{num_trials}"
return correct
end
def execute(domain, iterations, num_vectors, learning_rate)
codebook_vectors = initialize_vectors(domain, num_vectors)
train_network(codebook_vectors, domain, iterations, learning_rate)
test_network(codebook_vectors, domain)
return codebook_vectors
end
if __FILE__ == $0
# problem configuration
domain = {"A"=>[[0,0.4999999],[0,0.4999999]],"B"=>[[0.5,1],[0.5,1]]}
# algorithm configuration
learning_rate = 0.3
iterations = 1000
num_vectors = 20
# execute the algorithm
execute(domain, iterations, num_vectors, learning_rate)
end
The Learning Vector Quantization algorithm was described by Kohonen in 1988 [Kohonen1988], and was further described in the same year by Kohonen [Kohonen1988a] and benchmarked by Kohonen, Barna, and Chrisley [Kohonen1988b].
Kohonen provides a detailed overview of the state of LVQ algorithms and variants (LVQ1, LVQ2, and LVQ2.1) [Kohonen1990]. The technical report that comes with the LVQ_PAK software (written by Kohonen and his students) provides both an excellent summary of the technique and its main variants, as well as summarizing the important considerations when applying the approach [Kohonen1996]. The seminal book on Learning Vector Quantization and the Self-Organizing Map is "Self-Organizing Maps" by Kohonen, which includes a chapter (Chapter 6) dedicated to LVQ and its variants [Kohonen1995].
| [Kohonen1988] | T. Kohonen, "Learning Vector Quantization", Neural Networks, 1988. |
| [Kohonen1988a] | T. Kohonen, "An introduction to neural computing", Neural Networks, 1988. |
| [Kohonen1988b] | T. Kohonen and G. Barna and R. Chrisley, "Statistical pattern recognition with neural networks: benchmarking\n\tstudies", in IEEE International Conference on Neural Networks, 1988. |
| [Kohonen1990] | T. Kohonen, "Improved versions of learning vector quantization", in IJCNN International Joint Conference on Neural Networks, 1990. |
| [Kohonen1995] | T. Kohonen, "Self-Organizing Maps", Springer, 1995. |
| [Kohonen1996] | T. Kohonen and J. Hynninen and J. Kangas and J. Laaksonen and K.\n\tTorkkola, "LVQ–PAK: The Learning Vector Quantization Program Package", Technical Report A30, Helsinki University of Technology, Laboratory of Computer and Information\n\tScience, Rakentajanaukio, 1996. |
Please Note: This content was automatically generated from the book content and may contain minor differences.