Machine clustering and “The Flowing Light of the Godhead”

In Chapter 11 of Text Analysis with R for Students of Literature, Matthew Jockers introduces machine clustering through an authorship attribution problem. That is, by comparing different usages of high frequency words in an ingested corpus and calculating the Euclidean metric between them, the distance between different books in the corpus can be determined. Books closer…Read more Machine clustering and “The Flowing Light of the Godhead”