There is an interesting series of articles in the new york times on the benefits and dangers of using large-scale corpora and statistical methods in the analysis of literary and other texts in the humanities. The first discusses some projects that are part of the digging-into-data challenge. The second article illustrates what race horses with conspicuous names can teach us about the pitfalls of the new windfall of data (hat-tip to Kate McCurdy).
06 Dec 2010
•
discussion