Capstone
Reference Material
- Speech and Language Processing, 3rd Edition Working version of Jurafsky, et. al. book on natural language processing whose content on n-grams is helpful for the capstone.
Course Project
- n-gram Computations and Computer Capacity Explains the amount of memory required to convert the text files for the course project into n-grams, using the quanteda package.
- Capstone Strategy Describes a general strategy to get through the Capstone: use the simplest approaches possible.
- Choosing a Text Analysis Package Reviews pros and cons of various R packages used for natural language processing, in the context of requirements for the Capstone project.