CPE 369: Distributed Computing
Lab 6 materials

Student Name Filter program template StudentFilter.java
Data File students.csv HDFS location: /data/students.csv
Instructor's Jar lab6.jar Main Class StudentListFilter
Instructor's run filter-run-alex.txt

Note: please note that instructor's sample run file shows errors in starting some of the map jobs (the run succeeded on the third try after two mappers failed). While this is not the best behavior, a successful run that has some failed map or reduce functions will be accepted, as long as proper output was generated. The failures of the map() and reduce() containers are largely due to the brittleness of our Hadoop cluster.


Feb 15, 2017 dekhtyar at calpoly.edu