Datasets


This website provides access to a variety of datasets suitable for data mining and machine learning experimentation.  Links in the table below link to .zip folders that contain .xls, .csv, and .arff versions of the dataset.  The .zip folder also includes .pdf file descriptions of the dataset, as well as .pdf reference articles, where available.

 

Each .arff file was created from the associated .csv file, which was created from the associated .xls file.  Each .arff dataset has been loaded into Weka Explorer to insure that the file is in a useable format.

 

The datasets have been collected from a number of publically-accessible, including primarily the UC Irvine Machine Learning Repository.

 

Dataset

Folder

CPU Performance

CPUPerformance.zip

Diabetes Diagnosis

DiabetesDiagnosis.zip

Edible Mushrooms

EdibleMushrooms.zip

Fisher's Iris Dataset

FishersIrisDataset.zip

Fractionation Column

FractionationColumn.zip

Gamma Ray Bursts

GammaRayBursts.zip

Landform Identification

LandformIdentification.zip

Sensor Discrimination

SensorDiscrimination.zip

To Play or Not To Play

ToPlayOrNotToPlay.zip

To Play Or Not To Play Numeric

ToPlayOrNotToPlayNumeric.zip

Voting Record Yay Nay

VotingRecordYayNay.zip

Wine Cultivars

WineCultivars.zip

Wine Quality

WineQuality.zip

 


For problems with this web site, contact the webmaster