This is a non-IMPACT record, meaning that access to the data is not 
												controlled by IMPACT. For access, see the directions below.
												
Disclaimer:
This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.
										Disclaimer:
This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.
Summary
DS-1263
														Malware Training Sets
														External Dataset
														External Data Source
                                                        GitHub
                                                        Unknown
														Unknown
    														56 (lowest rank is 56)
                                                        Description
Aim of the project is to provide an useful and classified dataset to researchers who want to investigate deeper in malware analysis by using Machine Learning techniques.
One of the most challenging tasks during Machine Learning processing is to define a great training (and possible dynamic) dataset. Assuming a well known learning algorithm and a periodic learning supervised process what you need is a classified dataset to best train your machine. Thousands of training datasets are available out there, but no great classified datasets for malware analyses exist. This dataset was created to share with the scientific community (and everybody interested on it) in order to give to everyone a base point to start with Machine Learning for Malware Analysis.
The collected dataset is composed by the following samples:
APT1 292 Samples
Crypto 2024 Samples
Locker 434 Samples
Zeus 2014 Samples
Additional Details
																31.6MB
															
														false
														Unknown
														malware, training, 1263, malware training sets, sets, corporation, external, inferlink corporation, inferlink, external data source, source, learning, dataset, machine, classified, analysis, aim, investigate, provide, techniques, project, researchers, deeper, samples, datasets, 2014, assuming, supervised, composed, 292, start, define, share, process, base, scientific, periodic, algorithm, tasks, dynamic, 2024, 434, locker, challenging, collected, train, created, other, processing, analyses, zeus, exist, crypto, community, thousands, apt1