1. Homepage
  2. Programming
  3. Fundamentals of Data Science Assignment 1: Predictive modeling based on decision trees

Fundamentals of Data Science Assignment 1: Predictive modeling based on decision trees

Contact Us On WeChat
USUCIFundamentals of Data SciencePredictive modeling based on decision treesPythonGraphViz

Fundamentals of Data Science Assignment 1 CourseNana.COM

In this assignment, you will implement a predictive modeling approach based on the decision tree. CourseNana.COM

Detailed Requirement CourseNana.COM

We have introduced a predictive modeling approach based on the decision tree in the class. In this assignment, you will implement and evaluate this approach on the Vertebral Column dataset from the UCI Machine Learning Repository CourseNana.COM

You should partition the dataset into two subsets: one for training and the other for evaluation. The partitioning should be performed in such a way that the proportions of data records belonging to the different classes in the training set and test set should be similar to those of the original dataset. CourseNana.COM

Please note that there are two versions of the Vertebral Column dataset. Please use the version in which the orthopedic patients are categorized into three classes (disk hernia (DH), spondylolisthesis (SL) or normal (NO)). CourseNana.COM

You can implement a decision tree model using the Python package scikit-learn, and visualize the model by installing the package, e.g., python-graphviz. CourseNana.COM

You may refer to the following references for more details about Python and its packages. CourseNana.COM

  • Data mining tutorials using Python

(https://www.cse.msu.edu/~ptan/dmbook/software) CourseNana.COM

  • Scikit-learn website (https://scikit-learn.org)

Assignment Submission CourseNana.COM

You should submit a report to summarize your work. The following tasks are to be performed: CourseNana.COM

a.     Construct multiple decision trees based on different partitions of the dataset into a training set and a test set. You should clearly specify which impurity measure you have used for tree construction, and the parameters you have selected. (25%) CourseNana.COM

b.     Compare the structures and classification performances of these different trees. (25%) CourseNana.COM

c.     For selected trees, observe the classification performance associated with the different classes, and determine which pair(s) of classes are likely to be confused with each other. (25%) CourseNana.COM

d. For selected confused class pairs in c., identify the corresponding leaf node(s) and analyze the sequence of decisions that lead to the misclassification. (25%) CourseNana.COM

Please provide a detailed description of the results of the above tasks in your report. CourseNana.COM

Get Expert Help On This Assignment

Scan above qrcode with Wechat

US代写,UCI代写,Fundamentals of Data Science代写,Predictive modeling based on decision trees代写,Python代写,GraphViz代写,US代编,UCI代编,Fundamentals of Data Science代编,Predictive modeling based on decision trees代编,Python代编,GraphViz代编,US代考,UCI代考,Fundamentals of Data Science代考,Predictive modeling based on decision trees代考,Python代考,GraphViz代考,UShelp,UCIhelp,Fundamentals of Data Sciencehelp,Predictive modeling based on decision treeshelp,Pythonhelp,GraphVizhelp,US作业代写,UCI作业代写,Fundamentals of Data Science作业代写,Predictive modeling based on decision trees作业代写,Python作业代写,GraphViz作业代写,US编程代写,UCI编程代写,Fundamentals of Data Science编程代写,Predictive modeling based on decision trees编程代写,Python编程代写,GraphViz编程代写,USprogramming help,UCIprogramming help,Fundamentals of Data Scienceprogramming help,Predictive modeling based on decision treesprogramming help,Pythonprogramming help,GraphVizprogramming help,USassignment help,UCIassignment help,Fundamentals of Data Scienceassignment help,Predictive modeling based on decision treesassignment help,Pythonassignment help,GraphVizassignment help,USsolution,UCIsolution,Fundamentals of Data Sciencesolution,Predictive modeling based on decision treessolution,Pythonsolution,GraphVizsolution,