Task-Practical Big Data Cloudera HUE exercise with two parts:Part 1-Working with large dataset BDMS processing and creating reportsPart 2 – Migrate the created set up away from Hive to the NoSQL scheme offered by Camel-MongoDB Requires Expert Skills for the 2 Tasks:Practical Experiences with Cloudera HUE, importing data in HUE, working within the Hive environment in Cloudera, scripting in Latin Pig,developing simple reports for the given task, database migration knowledge resource: Use the Cloudera HUE Demo account for this exercise.You get access to the HUE Demo account here: http://demo.gethue.com/hue/editor/?type=hive Requirements:Create a 2.000wordstep by step documentation with detailed screenshots about each steps executing the practical exercise described below. As your task is divided in two parts. Part 1 should cover about 1.000 words and Part 2 also about 1.000 words. Cite and reference all sources using the Harvard Liverpool Referencing System (at least 5 references).Task Scenario:You have been engaged as aData advisorworking with Advanced Data Science Services (ADSS). ADSS has just been awarded a contract by a government department (the Department of Environment) to help with the management, data mining and visualization of atmospheric emissions (and pollution) data gathered by various borough and county environment monitoring units. You need to assist this project, and you will be required to carry out a number of tasks as described below using the Hadoop framework and tools. Preparation for your 2 Tasks:Prepare your Demo Account:Create a new directory called ADDS_Test for your task in the HUE Hive Demo environment.Screenshot: New directory called ADDS_Test Import the data for Analysis into your directory in the demo account. The data for the import you find under https://data.london.gov.uk/dataset/london-atmospheric-emissions-inventory-2013?resource=0fa6a83a-1529-48a7-890d-414a2f52b0e6. (It relates to the year 2013). The file is called: “3 – Detailed Road Transport – LAEI2013_MajorRoads_EmissionsbyLink_2013 (332.19 MB)” After the preparation now, your two tasks in Detail:Create a step by step documentation with detailed screenshots about each step, including the detailed syntax, executing the practical exercise described below. Part 1:Analyze your data’s. E.g. use relevant Hive DML statements, scripts and summary functions (e.g. max, min, count or avg) and generate at least two reports that summarize the data in the tables in an insightful way. Examples for your reports could be:1.Who creates the most Emissions (Unit=tonne/year) – Motorcycle, Taxi or Car?2.Which road creates the most pollution (Hint: e.g. the road with the largest length)? However, feel free to create another appropriate insight report for illustration. Explain your rationale for producing those particular report summaries. Part 2:1.Migrate the created set up away from Hive to the NoSQL scheme offered by Camel-MongoDB. Explain the prime considerations and actions that need to be taken for such a migration.2.It is intended to convert the static visualisation dashboards previously created to live ones via the use of streaming schemes offered by Spark and Kafka. Outline what steps would need to be taken to accomplish this.3.Explain how the use of Sentry and/or IAM (Identity Access Manager) for AWS may be used to help secure the cluster and the Hadoop deployment in an enterprise environment.
The post Big Data Cloudera HUE appeared first on Homework Aider.


What Students Are Saying About Us

.......... Customer ID: 12*** | Rating: ⭐⭐⭐⭐⭐
"Honestly, I was afraid to send my paper to you, but you proved you are a trustworthy service. My essay was done in less than a day, and I received a brilliant piece. I didn’t even believe it was my essay at first 🙂 Great job, thank you!"

.......... Customer ID: 11***| Rating: ⭐⭐⭐⭐⭐
"This company is the best there is. They saved me so many times, I cannot even keep count. Now I recommend it to all my friends, and none of them have complained about it. The writers here are excellent."


"Order a custom Paper on Similar Assignment at essayfount.com! No Plagiarism! Enjoy 20% Discount!"