CPE 466: Knowledge Discovery in Data
Lab 3 materials

Part 1

Balloons Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description balloons.names.txt
Dataset: Adult Or Stretch adult-stretch.csv
Dataset: Adult And Stretch adult+stretch.csv
Dataset: Yellow and Small yellow-small.csv
Dataset: Yellow and Small or Adult and Stretch yellow-small+adult-stretch.csv

Mushroom Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description agaricus-lepiota.names.txt
Dataset agaricus-lepiota.csv

Nursery Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description nursery.names.txt
Dataset nursery.csv

Sample Decision Tree Format

Part 2

IRIS Datasaet

Iris dataset is one of the most popular machine learning datasets. We use it to look at how C4.5 handles numeric attributes.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description iris.names
Dataset iris.data

Letter Recognition Datasaet

Letter Recognition dataset presents a list of features extracted from images of letters and asks you to properly identify the letters.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description letter-recognition.names.txt
Dataset letter-recognition.data.csv

Wine Quality Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description winequality.names.txt
Dataset
Red Wines winequality-red-fixed.csv
White Wines winequality-white-fixed.csv

Credit Approval Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description crx.names.txt
Dataset crx.data.csv

Heart Disease Dataset
Kaggle Page for the dataset HTML
Dataset heart.csv

October 14, 2021 dekhtyar at calpoly.edu