Personal tools
You are here: Home Collections and Tools Temporal Contexts - Datasets

Temporal Contexts - Datasets

Bag of words representation for documents from ACM-DL and MEDLINE datasets. Dataset format: doc_id;year;class;{term_id;term_frequency;}+ That is, each line corresponds to a document. The first field is the unique document identifier. The second field denotes its year of creation, and the third its class. The remaining fields are pairs of (term identifier;term frequency) . Note that each field is separated by ';'.

File ACM-DL and MEDLINE Datasets
 
Document Actions
« December 2014 »
December
MoTuWeThFrSaSu
1234567
891011121314
15161718192021
22232425262728
293031