CPE 466: Knowledge Discovery in Data
Lab 3 materials

Part 1

Balloons Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description balloons.names.txt
Dataset: Adult Or Stretch adult-stretch.csv
Dataset: Adult And Stretch adult+stretch.csv
Dataset: Yellow and Small yellow-small.csv
Dataset: Yellow and Small or Adult and Stretch yellow-small+adult-stretch.csv

Nursery Datasaet

UCI Machine Learning Repository Page for the dataset HTML
Dataset Description nursery.names.txt
Dataset nursery.csv

IRIS Datasaet

Iris dataset is one of the most popular machine learning datasets. We use it to look at how C4.5 handles numeric attributes.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description iris.names
Dataset iris.data

Letter Recognition Datasaet

Letter Recognition dataset presents a list of features extracted from images of letters and asks you to properly identify the letters.
UCI Machine Learning Repository Page for the dataset HTML
Dataset Description letter-recognition.names.txt
Dataset letter-recognition.data.csv

Sample Decision Tree Format


October 11, 2023 dekhtyar at calpoly.edu