Raymond describes his independent Lisp research project intended to extract knowledge from the Wikipedia website.
His project combines natural language processing techniques, knowledge representation paradigms and machine learning algorithms that creates a semantic model of the information contained in Wikipedia.
This presents an algorithm for the automatic generation of topic taxonomies and suggests how such a model can be used to implement contextually relevant web searches. In doing so, Raymond provides a brief overview of the following topics and algorithms:
- Natural Language Processing
- Semantic Nets
- Similarity Metrics
- Clustering Algorithms