CSCI 479 -- Machine Learning
Spring 2026 - Assignment 3
Submit deadline: 10:00, 6 March 2026, Friday

Problem Description:

The data set used in this assignment comes from the Car Evaluation Data Set that is available at http://archive.ics.uci.edu/ml/datasets/Car+Evaluation. The data set is donated by its creator Marko Bohanec at 1997.

The data set used in this assignment is stored in the csv file CarData.csv. There are 1728 instances in this data set.

There are 6 descriptive attributes in this data set:

The target attribute, EVALUATION, provides a description of the car's value as unacceptable (unacc), acceptable (acc), good (good), or very good (vgood).

Your tasks:

Overall, your task is to build a Naive Bayes Classifier.

Specifically, you need to perform the following tasks and document the process along the way:

  1. Generate and store all the probabilities that will be used by your Naive Bayes Classifier based on the given data in CarData.csv. If any of these probabilities is 0, use Laplace smoothing algorithm wih k=2 to smooth the noises. (You can use Your Lab 4 program and result with the extension of Laplace smoothing if needed.)
  2. Design and implement the Naive Bayes Classifier, and generate the class labels for each data item that's in the training data set CarData.csv, regardless what target class label the data item actually has. Save your predication result to a csv or text file.
  3. Implement a program to compare the class labels generated by your Naive Bayes Classfier and their given class labels, and fill the following table with corresponding counts or percentages:
    Your Result
    unacc acc good vgood
    Given Result unacc
    acc
    good
    vgood

What to Submit:

Submit a document that explains the whole process of building, applying and evaluating the Naive Bayes Classifier using the given Car Data set. Your document should include at least the following sections:

How to submit:

Choose one of the following two ways to submit your work:


Last updated: February 3, 2026