Search
Now showing items 1-2 of 2
Novel document representations based on labels and sequential information
(Georgia Institute of Technology, 2015-07-23)
A wide variety of text analysis applications are based on statistical machine learning techniques. The success of those applications is critically affected by how we represent a document. Learning an efficient document ...
Modeling and visualization of version-controlled documents
(Georgia Institute of Technology, 2011-04-05)
Version-controlled documents, such as Wikipedia or program codes in Subversion, demands a novel methodology to be analyzed efficiently. The documents are continually edited by one or more authors in contrast of the case ...