Personal tools
You are here: Home Collections and Tools Temporal Contexts - Datasets

Temporal Contexts - Datasets

Bag of words representation for documents from ACM-DL and MEDLINE datasets. Dataset format: doc_id;year;class;{term_id;term_frequency;}+ That is, each line corresponds to a document. The first field is the unique document identifier. The second field denotes its year of creation, and the third its class. The remaining fields are pairs of (term identifier;term frequency) . Note that each field is separated by ';'.

File ACM-DL and MEDLINE Datasets
 
Document Actions
« January 2015 »
January
MoTuWeThFrSaSu
1234
567891011
12131415161718
19202122232425
262728293031