Analyzing Large Collections of Electronic Text Using OLAP

Abstract

Computer-assisted reading and analysis of text has various applications in the humanities and social sciences. The increasing size of many electronic text archives has the advantage of a more complete analysis but the disadvantage of taking longer to obtain results. On-Line Analytical Processing is a method used to store and quickly analyze multidimensional data. By storing text analysis information in an OLAP system, a user can obtain solutions to inquiries in a matter of seconds as opposed to minutes, hours, or even days. This analysis is user-driven allowing various users the freedom to pursue their own direction of research.

Keywords

OLAP, Data Warehouse, Warehouse of Words, CARAT, Literary research

Reference

Steven Keith, Owen Kaser, Daniel Lemire, Analyzing Large Collections of Electronic Text Using OLAP, UNBSJ CSAS Technical Report TR-05-001, June 2005.

Download

Hint : It is sometimes necessary to hold down shift while clicking in order to save a document.

BibTeX

@TechReport{KeyKaserLemireTR05001,
   author    = {Steven Keith and Owen Kaser and Daniel Lemire},
   title     = {Analyzing Large Collections of Electronic Text Using OLAP},
   institution = {UNBSJ CSAS},
   year      = {2005},
   month={June},
   number ={TR-05-001},
   url = {http://www.daniel-lemire.com/fr/documents/publications/tr05-001.pdf}
}

Author

Related work

Valid XHTML 1.0! Valid CSS!