Personal tools
You are here: Home Collections and Tools Temporal Contexts - Datasets

Temporal Contexts - Datasets

Bag of words representation for documents from ACM-DL and MEDLINE datasets. Dataset format: doc_id;year;class;{term_id;term_frequency;}+ That is, each line corresponds to a document. The first field is the unique document identifier. The second field denotes its year of creation, and the third its class. The remaining fields are pairs of (term identifier;term frequency) . Note that each field is separated by ';'.

File ACM-DL and MEDLINE Datasets
 
Document Actions
« August 2014 »
August
MoTuWeThFrSaSu
123
45678910
11121314151617
18192021222324
25262728293031