7 country_data/United States.csv Daily observations of vaccination in the US. 8 country data/France.csv 9 country_data/I

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899603
Joined: Mon Aug 02, 2021 8:13 am

7 country_data/United States.csv Daily observations of vaccination in the US. 8 country data/France.csv 9 country_data/I

Post by answerhappygod »

7 Country Data United States Csv Daily Observations Of Vaccination In The Us 8 Country Data France Csv 9 Country Data I 1
7 Country Data United States Csv Daily Observations Of Vaccination In The Us 8 Country Data France Csv 9 Country Data I 1 (173.72 KiB) Viewed 74 times
Please Solve B C AND D Part as per instruction
7 country_data/United States.csv Daily observations of vaccination in the US. 8 country data/France.csv 9 country_data/Israel.csv Daily observations of vaccination in France. Daily observations of vaccination in Israel. Table 1: List of data files To complete the tasks in the following sections, you are required to review and analyse the dataset that is available in the named files. Part B: Designing the Database (10%) Task B.1 Produce an ER diagram for a relational database that will be able to store the given dataset. It is important to note that the given CSV files are not necessarily representing a good design for a relational database. It is your task to design a database that will adhere to good design principles that were taught throughout the course. This means your database schema will not match the structure of the CSV files and, therefore, you will require to manipulate the structure of the dataset (and not the data itself) to import it into your database. Importing the data is required to complete Task C.2. The ER diagram must be produced by Lucidchart similar to the exercises that were completed in in the course. UML notation is expected and using other notations will not be acceptable. Including a high-quality image representing your model is important, which can be achieved using Export function of Lucidchart.. You are also required to transform the ER diagram into a database schema that will be used in the next part of the assignment. Creating a good database design typically involves some database normalisation activities. You should document your normalisation activities and support them with good reasoning. This typically involves explaining what the initial design was, what the problem was, and what changes have been made to rectify the issue. The expected outcome of completing this task is one PDF file named model.pdf containing the following sections. 1. Database ER diagram and, if needed, a reasonable set of assumptions. 2. Explanation of normalisation challenges and the resulting changes. 3. Database schema. Part C: Creating the Database and Importing Data (10%) Task C.1 Produce one SQL script file named database.sql. This script file requires all the SQL statements necessary to create all the database relations and their corresponding integrity constraints as per your proposed design in Part B. The script file must run without any errors in SQLite Studio and contain necessary commenting to separate various relations. Note that this script is not supposed to store any data into the relations. The expected outcome of completing this task is one script file with the specific name of database.sql. Task C.2 Create a database file named Vaccinations.db. Import the given dataset into your database. To complete this task, you may need to change the format of the CSV files to match the attributes of your designed database. You can use a spreadsheet editor such as Microsoft Excel. The next step is to import the spreadsheets into the database you create in SQLite Studio. To complete this task, use the menu option Tools - Import in SQLite. The expected outcome of completing this task is one database file named Vaccinations.db, which must contain all the data that is stored in the CSV files named in Table 1. Part D: Data Retrieval and Visualisation (15%) Now that you have created and populated a database, it is time to create some queries to investigate the data in various ways. In addition to writing the required queries, you are also asked to produce data visualisation for the results of your queries. The tasks in this section represent the queries that must be supported. Each query must consist of one SQL statement. It would be acceptable to use several nested queries, combine several SELECT statements with various operators etc. However, it would not be acceptable to have multiple and separated queries for each task. After you have written each query, you are expected to produce a data visualisation for each result set. You have the freedom to choose the tool for creating your visuals (e.g., Excel, Google Charts, Tableau) as well as the visualisation techniques (e.g., charts, plots, diagrams, maps). Completing this portion of the work will require that you understand the nature of the results of each query, undertake research to choose a visualisation tool you are comfortable with, decide about the best technique to visually represent each result set, and produce the visualisation. Answers to tasks in Part D that are not supported by a visualisation can achieve up to 80% of the grade associated with each task. The expected outcome of completing this task is as follows. 1. One SQL script file named Queries.sql containing all the queries developed for the tasks in this section. It is important that you add comment lines to separate the queries and indicate which task they belong to. Note that valid SQL comments must not generate errors in SQLite Studio. The marker of your work will use this file to execute and test your queries. 1. A PDF file namned QuerieResults.pdf containing the following elements for each task. a. The SQL query

b. a snapshot of the first 10 results of your query. The snapshot must also show the total number of results retrieved by the query. A sample snapshot is provided below for your reference. 52. GEDR 21 Tod: Inngange Kheekhoe Ndaba Shona Twara Norhsotho Southethe S Figure 1: Sample results snapshot with total rows C. Data visualisation List of Tasks Task D.1 For any two given dates (i.e., you can assume any two dates, e.g., 1 April and 3 April), list the dates, the total number of vaccines administered in each observation date in each of all countries, and the difference between the administered vaccines. Each row in the result set must have the following structure. (Note: OD2 is after OD1) Observation Date 1 (OD1) Country Name Administered Observation Administered Vaccine on Date 2 Vaccine on OD1 (VOD1) (OD2) OD2 (VOD2) Difference of totals (VOD1-VOD2) (CN) Figure 2: Column Headers in the Result Set for Task D.1 Task D.2 Find the countries with the cumulative numbers of COVID-19 doses administered by each bigger than the average doses administered by all countries. Produces a result set containing the name of each country and the cumulative number of doses administered in that country. Each row in the result set must have the following structure. Country Cumulative Doses Figure 3: Column Headers in the Result Set for Task D.2 Task D.3 Produce a list of 10 countries with the biggest numbers of vaccine types, with the type of vaccines (e.g., Oxford/AstraZeneca, Pfizer/BioNTech) administered in each country. For a country that has administered several types of vaccine, the result set is required to show several tuples a San 4 Airkaam 7 English reporting each type of vaccine in a separate tuple. Each row in the result set must have the following structure. Country Vaccine Type Figure 4: Column Headers in the Result Set for Task D.3. Task D.4 There are different data sources used to produce the dataset. Produce a report showing the total number of vaccines administered according to each data source (i.e., each unique URL). Order the result set by source name and URL Each row in the result set must have the following structure. Source URL Source Name Total Administered Vaccines Figure 5: Column Headers in the Result Set for Task D.4 Task D.5 How does various countries compare in the speed of their vaccine administration? Produce a report that lists all the observation dates in 2022 and, for each date, list the total number of people fully vaccinated in each one of the 4 countries used in this assignment. [Date, Australia, United States, England, China) Date Australia United States England Chinal Figure 6: Column Headers in the Result Set for Task D.5 Submission Format You are required to submit the files with the exact names as below. 1. Model.pdf 2. Database.sql 3. Vaccinations.db 4. Queries.sql 5. Queries.pdf In the previous sections of the assignment, the expected content of each of the files is explained in detail. Referencing guidelines Use RMIT Harvard referencing style for this assessment. You must acknowledge all the courses of information you have used in your assessments. Refer to the RMIT Easy Cite referencing tool to see examples and tips on how to reference in the appropriated style. You can also refer to the library referencing page for more tools such as EndNote, referencing tutorials and referencing guides for printing. Academic integrity and plagiarism Academic integrity is about honest presentation of your academic work. It means acknowledging the work of others while developing your own insights, knowledge, and ideas. You should take extreme care that you have:
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!
Post Reply