Posts

Showing posts from December, 2015

How to process data using Hive Batting Example :Big Data Assignment Part 3 for Praxis Business School

Image
As part of the exercise in learning about the hive query language we completed one sample exercise on hive . The sample exercise  has been taken from the following website  http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-hive/ Hive provides a platform to run SQL queries. This is a more familiar with the programmers of SQL background. It is a known and learned fact that sql queries are similar to comprehend and understand 1) Data Downloading: The csv file can be downloaded from the following zip file. http://hortonassets.s3.amazonaws.com/pig/lahman591-csv.zip 2) Uploading of the data: Although the link from where we are solving the program is an easy guide. We would be using the hue to upload the files for execution of the program. Steps to upload: a) Start VM Box and open SSH Terminal  b) Log on to the address http://127.0.0.1:8000/ c) Click on file browser option followed by view then click on hue d) Upload 2 files batting.csv a...