This is a non-IMPACT record, meaning that access to the data is not controlled by IMPACT. For access, see the directions below.

Disclaimer:
This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.

Summary

DS-1370
Yahoo Password Frequency Corpus
External Dataset
External Data Source
figshare.com
Unknown
Unknown
56 (lowest rank is 56)

Category & Restrictions

Other
cyber defense
Unrestricted
true

Description


This dataset includes sanitized password frequency lists collected from Yahoo in May 2011.

Each of the 51 .txt files represents one subset of all users' passwords observed during the experiment period. "yahoo-all.txt" includes all users; every other file represents a strict subset of that group.

Each file is a series of lines of the format:

FREQUENCY #OBSERVATIONS
...

with FREQUENCY in descending order. For example, the file:

3 1
2 1
1 3

would represent a the frequency list (3, 2, 1, 1, 1), that is, one password observed 3 times, one observed twice, and three separate passwords observed once each.

Additional Details

127.6KB
false
Unknown
frequency, password, yahoo, yahoo password frequency corpus, 1370, corpus, source, external, corporation, inferlink corporation, external data source, inferlink, includes, sanitized, lists, dataset, collected, 2011, observed, file, passwords, txt, users, subset, represents, files, list, series, other, format, separate, descending, times, experiment, observations, strict, lines, period, represent