1. Homepage
  2. Homework
  3. ST117 Introduction to Statistical Modelling 2024 Final project assignment for WR
This question has been solved

ST117 Introduction to Statistical Modelling 2024 Final project assignment for WR

Engage in a Conversation
WarwickST117Introduction to Statistical ModellingEDAexploratory data analysisR

ST117 2024 Final project assignment for WR (written report) – Phase 1 CourseNana.COM

Your written report will be a summary of a real-world data analysis project using the UK ECN repository introduced in the last lecture in Week 10 of Term 2. The assignment is released in two phases. The first phase (released along with Log3) is detailed below. The second phase will be released along with Log4. CourseNana.COM

Phase 1: Scientific background, study design, data download, data subsets, and EDA CourseNana.COM

The goal of this phase is to download and familiarise yourself with the datasets and their context. This involves reading about the scientific methods used to collect the data and use R to explore them from many angles. CourseNana.COM

  1. ECN is a UK-based multi-agency programme with funding and monitoring from a consortium of UK government departments and agencies. The network is coordinated by staff at the UK Centre for Ecology & Hydrology (UKCEH). UKCEH manage the data generated by the programme, which are stored in a central database and are made available for research and education. ECN is a highly valuable long-term data collection that started in the early 1990s. CourseNana.COM

    "In its first two decades of operation the ECN has accumulated a robust set of baseline data that describe environmental and biological variability across a range of habitats in unprecedented detail. With appropriate, informed development, these should prove invaluable in discerning the causes and consequences of environmental change for decades to come." (Sier & Monteith, 2016) CourseNana.COM

  2. The scientific basis for the ECN data repository is explained in Rennie, S. (2016). Providing information on environmental change: Data management, discovery and access in the UK Environmental Change Network Data Centre. Ecological Indicators, 68, 13-20. DOI: 10.1016/j.ecolind.2016.01.060 (see Moodle link) CourseNana.COM

  3. The ECN homepage https://ecn.ac.uk is your starting point. Get an overview of the subpages available there, specifically: CourseNana.COM

    • information about the terrestrial monitoring locations https://ecn.ac.uk/sites/site/terr CourseNana.COM

    • publications based on ECN data https://ecn.ac.uk/what-we-do/science/20yrs-si- CourseNana.COM

      keypoints CourseNana.COM

    • available datasets https://ecn.ac.uk/data/available-data CourseNana.COM

  4. The data is actually held at UK CEH. The landing page for selecting datasets to download is https://catalogue.ceh.ac.uk/documents/4971bce4-8a81-4e23-9637-7fcff37c5f21 CourseNana.COM

  5. Plan where you store the data, supplementary information and R code for this project in your computer. For example, you can create an R project to store everything relevant in the same location (see Log2). Download the raw data and the supporting documentation for the following UK ECN datasets: CourseNana.COM

Page 1 of 2 (please continue reading on next page) CourseNana.COM

CourseNana.COM

  1. Write a summary describing the aspect of these datasets that are most essential for statistical analyses (e.g. focus on when, where, and how the data was collected, as well as the study objectives). (Your summary should not exceed 1 page in 11pt font.) CourseNana.COM

  2. Using the documentation, draw diagrams that visualise the structure of these datasets taking into account the spatial and temporal structure of the data collection and, where applicable, the mode of data collection and recording schedule. CourseNana.COM

  3. Carry out exploratory data analysis (EDA) for the bat, bird, and moth data. In selecting your summaries and plots keep in mind that the goal is to familiarise yourself with the data considering: CourseNana.COM

    • guidance and resources in the EDA paragraph of the Log3 section on Data Analysis Cycle; CourseNana.COM

    • using Tidy R (see Log3) to administer the data structures using the piping technique CourseNana.COM

      (which can be more readable that iterated subsetting and brackets); CourseNana.COM

    • recall the resources about data visualisations given for Activities 1 and 2; CourseNana.COM

    • the summary and diagram from Steps 6 and 7 can guide you in selecting suitable plots; CourseNana.COM

    • EDA includes data quality analysis (DQA), as mentioned in Log3 as well; CourseNana.COM

    • if there are obvious recording error you may fixed those before further analysis, but you CourseNana.COM

      should leave a note in your report about this for transparency; CourseNana.COM

    • most of all we are interested in the counts (in the datasets these are in the column CourseNana.COM

      VALUE) of the animals obtained during the recording periods which can be visualised from a variety of perspectives (across years, across recording periods, across sites, etc). CourseNana.COM

  4. By Tuesday you will receive an email with the codes for 3 bat species, 4 bird species, 5 moth species, and 4 locations. These are specific to your report pod. CourseNana.COM

    • Carry out a more detailed descriptive analyses of these species (in the datasets this is the column FIELDNAME). CourseNana.COM

    • Compare your observations with those of the full datasets of bats, birds, and moths, respectively. CourseNana.COM

  5. The meteorology data is scattered around many (big) files so EDA for all of them would be beyond the scope of this project. However, do carry out EDA for the meteorological variables for the year in which you were born. CourseNana.COM

The outputs of steps 6 to 10 will be used as a basis for your report writing. Note you want to also explain what you intended to show with a certain type of figure and what it does indeed show for the data, including what that means in the real-world context. (Detailed guides and submissions templates for the final version will be given along with Phase 2 of the assignment.) CourseNana.COM

Page 2 of 2  CourseNana.COM

Get in Touch with Our Experts

WeChat (微信) WeChat (微信)
Whatsapp WhatsApp
Warwick代写,ST117代写,Introduction to Statistical Modelling代写,EDA代写,exploratory data analysis代写,R代写,Warwick代编,ST117代编,Introduction to Statistical Modelling代编,EDA代编,exploratory data analysis代编,R代编,Warwick代考,ST117代考,Introduction to Statistical Modelling代考,EDA代考,exploratory data analysis代考,R代考,Warwickhelp,ST117help,Introduction to Statistical Modellinghelp,EDAhelp,exploratory data analysishelp,Rhelp,Warwick作业代写,ST117作业代写,Introduction to Statistical Modelling作业代写,EDA作业代写,exploratory data analysis作业代写,R作业代写,Warwick编程代写,ST117编程代写,Introduction to Statistical Modelling编程代写,EDA编程代写,exploratory data analysis编程代写,R编程代写,Warwickprogramming help,ST117programming help,Introduction to Statistical Modellingprogramming help,EDAprogramming help,exploratory data analysisprogramming help,Rprogramming help,Warwickassignment help,ST117assignment help,Introduction to Statistical Modellingassignment help,EDAassignment help,exploratory data analysisassignment help,Rassignment help,Warwicksolution,ST117solution,Introduction to Statistical Modellingsolution,EDAsolution,exploratory data analysissolution,Rsolution,