The University of Sussex

Learning stochastic context-free grammars from corpora using a genetic algorithm

Bill Keller, Rudi Lutz

A genetic algorithm for inferring stochastic context-free grammars from finite language samples is described. Solutions to the inference problem are evolved by optimizing the parameters of a covering grammar for a given language sample. We describe a number of experiments in learning grammars for a range of formal languages. The results of these experiments are encouraging and compare very favourably with other approaches to stochastic grammatical inference.

Download compressed postscript file