1. Homepage
  2. Programming
  3. MDS5130/IBA6205 Advanced Time Series Analysis Project: Horse Racing

MDS5130/IBA6205 Advanced Time Series Analysis Project: Horse Racing

Engage in a Conversation
CUHKMDS5130IBA6205Advanced Time Series AnalysisR

MDS5130/IBA6205 Project
CourseNana.COM

Due date: May 10, 2024 CourseNana.COM

Outstanding projects will be invited to give a presentation on April 25 , 2024. Students who have given a presentation can receive maximum 10 bonus points in their final exam. CourseNana.COM

Students who want to present their work need to submit the project by April 18, 2024. Submissions after April 18 will not be considered for presentation. All students can revise their work before May 10, 2024. CourseNana.COM

The submitted codes must be clearly written in a R file. A report to describe your analysis is required. CourseNana.COM

Background CourseNana.COM

In this project, we will analysis a dataset about horse racing. Let’s have a brief introduction of horse racing. In a particular game, there are 14 horses racing. Before a particular time tfinal, people are allowed to bet which horse can win the game. Let bi(t) be the total amount betting on horse i at time t. Note that bi(t) is increasing before tfinal. After the game, we have bi(tfinal) being bet on horse i for i = 1, . . . , 14. If horse I wins the game, people who bet on horse I can get the dividend CourseNana.COM

dfI =dI(tfinal)=(1−∇)Pnj=1bj(tfinal) bI(tfinal) CourseNana.COM

for each $1 bet, here = 0.175 is the percentage track-take. Note that the dividends di(t)= (1−∇)Pnj=1bj(t) CourseNana.COM

bi (t) CourseNana.COM

for horse i, i = 1,...,14, are known by all gamers at time t < tfinal. As bi(t) is time varying, so does di(t). CourseNana.COM

Now suppose we have some insider information and we believe that we know the “true” winning probability πi of each horse i. Since we will only make a bet on horse i if the expected return is greater than 1i, so one betting strategy is betting on horse i if dfi > 1i. However, CourseNana.COM

1 CourseNana.COM

CourseNana.COM

we don’t know dfi at time we bet (tbet). Let bi = bi(tbet), di = di(tbet), fiW be the amount we bet on horse i at tbet and Ci be the amount bet on horse i by other parties after tbet. Then CourseNana.COM

we have CourseNana.COM

dfi =(1−∇)Pnj=1(bj+Cj+fjW). bi +Ci +fiW CourseNana.COM

The unknown quantities here are Ci for i = 1, . . . , 14. Obviously, the amount of Ci’s affects CourseNana.COM

the accuracy of the strategies that are based on the values at time tbet. In this project, your CourseNana.COM

task is to analyse the time series Csum = P14 Ci. i=1 CourseNana.COM

2 Data CourseNana.COM

The datasets “data20XX.RData” with XX=14,15,16,17,18 are given. They all have the same set of column names, which are CourseNana.COM

ID: It is of the form “yyyymmddrr”, which means Year yyyy Month mm Date dd Race rr. Note that there are more than one race on each day and the number of races can be different on each day. CourseNana.COM

WIN POOL.x: The total amount in the pool at time tbet.
WIN POOL.y: The total amount in the pool at time tfinal. Hence Csum is the CourseNana.COM

difference between WIN POOL.y and WIN POOL.x.
WIN TAKE.x: = 0.175. It is the same as WIN TAKE.y. CourseNana.COM

WIN ODDS i.x: di = di(tbet). If it is 0, it means that horse i actually was not in the race. CourseNana.COM

WIN ODDS i.y: dfi = di(tfinal). If it is 0, it means that horse i actually was not in the race. CourseNana.COM

WIN MODEL i.x: “True” winning probability πi. If it is 0, it means that horse i actually was not in the race. It is the same as WIN MODEL i.y. CourseNana.COM

WIN TIME.y The “yyyymmdd” part of ID. WIN NUMBER.y The “rr” part of ID. CourseNana.COM

Tasks CourseNana.COM

In this project, you are required to forecast Csum for each race in data2018.RData. Note that you MUST only use the information BEFORE tbet to forecast the Csum in a particular race. Let N be the total number of races in 2018, xr be the true Csum on Race r, xˆr be your forecast, and fp,r be your quantile forecast with probability p = 0.95. You should include the followings in your project. CourseNana.COM

2 CourseNana.COM

CourseNana.COM

  1. (10 points) Describe clearly the model you used for forecasting xr based on the infor- mation prior to the time tbet for Race r. That is, CourseNana.COM

    xr = H(Fr,tbet) + er, (1) where Fr,tbetis the information prior to the time tbet for Race r, H is some specific CourseNana.COM

    function you need to describe, and er is the error term. CourseNana.COM

  2. (20 points) Compute the mean absolute percentage error MAPE described in Section 5.8 in the textbook “Forecasting: Principles and Practice, 3rd Ed” for you forecasts. Your codes must output the mean absolute percentage error in a variable MAPE. CourseNana.COM

  3. (20 points) Compute the quantile score Q0.95,r described in Section 5.9 in the textbook “Forecasting: Principles and Practice, 3rd Ed” for each Race r in 2018. And then report the average quantile score in a variable QS. CourseNana.COM

Please note the followings. CourseNana.COM

  1. Your work will be evaluated by other dataset, namely “data2019.RData”, that have the same set of columns of the given data set. CourseNana.COM

  2. Only the given data set and the information provided in the project can be used. Don’t use any other additional information in your analysis. CourseNana.COM

CourseNana.COM

Get in Touch with Our Experts

WeChat WeChat
Whatsapp WhatsApp
CUHK代写,MDS5130代写,IBA6205代写,Advanced Time Series Analysis代写,R代写,CUHK代编,MDS5130代编,IBA6205代编,Advanced Time Series Analysis代编,R代编,CUHK代考,MDS5130代考,IBA6205代考,Advanced Time Series Analysis代考,R代考,CUHKhelp,MDS5130help,IBA6205help,Advanced Time Series Analysishelp,Rhelp,CUHK作业代写,MDS5130作业代写,IBA6205作业代写,Advanced Time Series Analysis作业代写,R作业代写,CUHK编程代写,MDS5130编程代写,IBA6205编程代写,Advanced Time Series Analysis编程代写,R编程代写,CUHKprogramming help,MDS5130programming help,IBA6205programming help,Advanced Time Series Analysisprogramming help,Rprogramming help,CUHKassignment help,MDS5130assignment help,IBA6205assignment help,Advanced Time Series Analysisassignment help,Rassignment help,CUHKsolution,MDS5130solution,IBA6205solution,Advanced Time Series Analysissolution,Rsolution,