Programming Assignment 1 Attached Files: File crime.zip Click for more options (10.647 MB) Use attached Vancouver crime

Business, Finance, Economics, Accounting, Operations Management, Computer Science, Electrical Engineering, Mechanical Engineering, Civil Engineering, Chemical Engineering, Algebra, Precalculus, Statistics and Probabilty, Advanced Math, Physics, Chemistry, Biology, Nursing, Psychology, Certifications, Tests, Prep, and more.
Post Reply
answerhappygod
Site Admin
Posts: 899604
Joined: Mon Aug 02, 2021 8:13 am

Programming Assignment 1 Attached Files: File crime.zip Click for more options (10.647 MB) Use attached Vancouver crime

Post by answerhappygod »

Programming Assignment 1 Attached Files: File crime.zip Clickfor more options (10.647 MB) Use attached Vancouver crime data asinput for this assignment. Details of this data can be found at:https://www.kaggle.com/wosaku/crime-in-vancouver/data AssignmentTasks: First create input directory on hdfs and copy all the filespresent in the input zip file to the input hdfs directory. Write amap/reduce program in Java that counts number of crimes perneighborhood. The name of the Driver program must be named asNeighborhoodCrimeCountDriver. The program should take the inputhdfs directory as first argument and output hdfs directory as thesecond argument. Run your programs on the HDP with provided dataSubmit: The jar file containing all classes as well as source files(e.g. bigdata-0.0.1-SNAPSHOT-sources.jar andbigdata-0.0.1-SNAPSHOT.jar) The output part-* file(s). Note: Setnumber of reducers to 1: job.setNumReduceTasks(1);
Assignment Programming Assignment 2 Data: Use the same data asAssignment 1. Assignment Tasks: Write a map/reduce program in Javathat outputs the first date and last date and counts of a crimetype has happened. For date use the year, month and day values inYYYY-mm-dd (four digit year, two digit month an day) format. Thename of the Driver program must be named asCrimeFirstLastDateDriver. The program should take the input hdfsdirectory as first argument and output hdfs directory as the secondargument. Set number of reducers to 5. Write a map/reduce programin Java that outputs top 5 most occuring crime types. The name ofthe Driver program must be named asTop5NeighborhoodCrimeTypeDriver. The program should take the inputhdfs directory as first argument (outputput of the first program)and output hdfs directory as the second argument. Run both programson the data Submit: The jar file containing all classes as well assource files. The output file(s). You may submit separate jar perprogram.
Join a community of subject matter experts. Register for FREE to view solutions, replies, and use search function. Request answer by replying!
Post Reply