CPE 466: Knowledge Discovery in Data
Lab 3 materials

IRIS Datasaet

Iris dataset is one of the most popular machine learning datasets. We use it to look at how C4.5 handles numeric attributes.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description iris.names
Dataset iris.data

Letter Recognition Datasaet

Letter Recognition dataset presents a list of features extracted from images of letters and asks you to properly identify the letters.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description letter-recognition.names.txt
Dataset letter-recognition.data.csv

Wine Quality Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description winequality.names.txt
Dataset
Red Wines winequality-red-fixed.csv
White Wines winequality-white-fixed.csv

Credit Approval Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description crx.names.txt
Dataset crx.data.csv

Seeds Datasaet

UCI Machine Learning Repository Page for the dataset (Dataset Description) HTML
Dataset seeds_dataset.csv

Mushroom Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description agaricus-lepiota.names.txt
Dataset agaricus-lepiota.data.csv

April 19, 2018 dekhtyar at calpoly.edu