Data Engineering
TASK
In this Course Project, you will need to complete the following:
- Choose and describe in detail the data analytics goals you will seek to accomplish for your chosen data set that is appropriately and specifically tailored for the underlying context of the data set you have chosen to analyze with the data analytics techniques and tools like spark and hive.
- Propose and detail a full, step-by-step methodology you will use to perform the data analysis on the chosen data-set.
- Show the results, conclusions, and insights you have gained as a result of applying the data analytic methodology you have proposed on the chosen data set.
You will need to write a full report detailing a title page, an abstract, introduction, background providing a brief but comprehensive description of the data set you have chosen and its underlying industry domain in addition to the data analytic goals that are appropriate for the given data set, a step-by-step methodology of the data analytics tools and techniques you will apply to the chosen data set in pursuit of those data analytic goals, a results section that will detail your findings, and a conclusion section where you will discuss your findings and insights within the context of achieving your data analytic industry-domain specific goals. Feel free to incorporate tables, screenshots, and figures in moderation to enhance the points you are making and showcase your results in particular. You may use Microsoft Word to type up this technical report for your Course Project.