To request access this dataset you will need to login with an IMPACT account. Accounts are free. If you don't have one please register.
The cookies in this data set were gathered from crawls of the top 100K Alexa web sites conducted in November, 2013 and April, 2015. Due to page request timeouts, our Crawler successfully visited 95,220 (95,311) web sites. Note, the set of web sites that caused a timeout is likely an artifact of our crawler, however even among the top 100K Alexa web sites, downtime is not uncommon. The data set is described in detail in the paper "An Empirical Study of Web Cookies", by Cahn et al., which appeared in WWW '16.
wisconsin, web, 658, web cookie data, cookie, 2013, university of wisconsin, cookies, sites, crawler, alexa, top, 100k, al, traffic, caused, conducted, gathered, paper, 220, downtime, 2015, visited, detail, flow, traffic flow data, appeared, 311, uncommon, artifact, timeout, april, cahn, empirical, timeouts, crawls, study, note, request, november