In Chapter 11 of Text Analysis with R for Students of Literature, Matthew Jockers introduces machine clustering through an authorship attribution problem. That is, by comparing different usages of high frequency words in an ingested corpus and calculating the Euclidean metric between them, the distance between different books in the corpus can be determined. Books closer…Read more Machine clustering and “The Flowing Light of the Godhead”
text mining
Using R to analyze “The Flowing Light of the Godhead”
A few weeks ago I started learning the R programming language for use in quantitative and computational approaches to textual analysis. Although my areas of expertise are in 3D visualization and simulation, it seemed to me that a lot of work in the digital humanities is in textual analysis (e.g., text mining and topic modeling)…Read more Using R to analyze “The Flowing Light of the Godhead”