Homepage
Programming
Final Project Prediction on Movie Rating

Final Project Prediction on Movie Rating

Engage in a Conversation

Final Project

Prediction on movie rating

In final project, we have a IMDB movie dataset, which includes thousands of movies. In the training data, all movies have been rated, ranging from 1-10. The goal of this project is to analyse data, build a machine learning model on the training data, and predict rates for movies in the testing data. CourseNana.COM

The final grade will be a combination of data processing, exploratory data analysis, machine learning model building, implementation and report writing. CourseNana.COM

Data

This data set is in attributes-in-columns format, comma-separated values. It contains two files: CourseNana.COM

train.csv: training data, consisting of 50,000 instances with 8 features and the target “rate”. CourseNana.COM

test: test data, consisting of 10,000 unlabelled instances with 8 features. CourseNana.COM

NOTE: the target “rate” in the data is continuous numbers (e.g., 0.2, 5.8, 8.5, etc.). You can choose either of the following options to make predictions: CourseNana.COM

· Build regression models to predict the continuous ratings. CourseNana.COM

· Transform continuous ratings to integer numbers (e.g., 1, 5, 9, etc.), and consider them as 11 different classes, so that you can build classification models for the predictions. CourseNana.COM

Requirements

I. Coding CourseNana.COM

1. Data pre-processing and Exploratory data analysis CourseNana.COM

a. Missing data – remove instances that contain missing values CourseNana.COM

b. Abnormal data – outliers, inconsistent data format/type, wrong values, etc. CourseNana.COM

c. Distribution analysis CourseNana.COM

d. Plotting, to help you understand your data CourseNana.COM

2. Feature engineering CourseNana.COM

a. Feature selection – if needed, drop irrelevant features CourseNana.COM

b. Transformation – if needed, transform categorical features into multiple binary features CourseNana.COM

c. Normalisation CourseNana.COM

3. Model building CourseNana.COM

a. Build at least two machine learning models on training data CourseNana.COM

b. Evaluate your models and pick up the best model for the prediction in the next step CourseNana.COM

c. (Optional) Hyperparameter tuning CourseNana.COM

4. Prediction CourseNana.COM

a. Use your best model to make predictions on test data CourseNana.COM

b. Save your prediction results into a CSV file with the following format, the first column: movie_id and the second column: rate (your predictions): CourseNana.COM

movie_id CourseNana.COM	rate CourseNana.COM
test_1 CourseNana.COM	3 CourseNana.COM
test_2 CourseNana.COM	9 CourseNana.COM
test_3 CourseNana.COM	7 CourseNana.COM

CourseNana.COM

II. Report writing CourseNana.COM

A 1000-1500 report detailing: CourseNana.COM

o The steps to pre-process your data CourseNana.COM

o How did you perform the feature engineering CourseNana.COM

o Machine learning models building and the performance evaluation CourseNana.COM

o Conclusions CourseNana.COM

Submission

You are required to submit the following files: CourseNana.COM

· Jupyter notebook (40’) CourseNana.COM

o Jupyter notebook in HTML file, named as “Student ID_notebook.html” CourseNana.COM

o Code has been well documented (adding markdown/comments to explain your code) CourseNana.COM

· Predicted result (20’) CourseNana.COM

o Prediction results in CSV file, named as “Student ID_prediction.csv” CourseNana.COM

o The file should contain two columns: movie_id and predicted rates CourseNana.COM

· Report (40’) CourseNana.COM

o Save your report in PDF file, named as “Student ID_report.pdf” CourseNana.COM

Please zip all your submissions and include your student ID into the name of the submission file, e.g., “Student ID_submission.zip". CourseNana.COM

CourseNana.COM

Get in Touch with Our Experts

WeChat (微信)

Last: [2022] ECON 178 - Economic&Business Forecasting - Final Project Guidelines: Wealth Prediction

Next: [2022] Data100 - Principles and Techniques of Data Science - Homework 6: Modeling and Analyzing COVID-19 Cases

IMDB代写,regression models代写,classification models代写,Feature Engineering代写,Movie Rating代写,IMDB代编,regression models代编,classification models代编,Feature Engineering代编,Movie Rating代编,IMDB代考,regression models代考,classification models代考,Feature Engineering代考,Movie Rating代考,IMDBhelp,regression modelshelp,classification modelshelp,Feature Engineeringhelp,Movie Ratinghelp,IMDB作业代写,regression models作业代写,classification models作业代写,Feature Engineering作业代写,Movie Rating作业代写,IMDB编程代写,regression models编程代写,classification models编程代写,Feature Engineering编程代写,Movie Rating编程代写,IMDBprogramming help,regression modelsprogramming help,classification modelsprogramming help,Feature Engineeringprogramming help,Movie Ratingprogramming help,IMDBassignment help,regression modelsassignment help,classification modelsassignment help,Feature Engineeringassignment help,Movie Ratingassignment help,IMDBsolution,regression modelssolution,classification modelssolution,Feature Engineeringsolution,Movie Ratingsolution,