Kabaddi is one of the oldest sports of India. It has been the national game of Bangladesh and Nepal. The game is a test of strength and unity. The game continues till the other team members oust the last player. The game tests ones mental, physical and interpersonal skills. The onus of one lies on other members hence the trust is the other key factor. The statistical programming languages are on the same fighting arena. Lets look around us we have R, Python, SPSS, SAS, Stata among many other. Which one is the best? 1)We have problem statement on which analysis needs to be done any language can be used to build up on the solution as the languages are a tool whereas logic remains same behind these. 2)R is a free open courseware whereas others are costly in terms of licensing costs. Meanwhile R has been used most freely SAS/Stata is more powerful to handle more statistical languages and techniques. 3)R being free is the most documented and used for the research and t...
Hi Folks!! This is my first blog in the series where I would love to share my experimentation with text analytics using R.. The initial 3 parts we would concentrate our efforts towards scrapping or extraction of reviews of samsung s4 product from the flipkart. We would be understanding the code bit by bit to get a hang of it and make it simpler to follow for a novice who is just enticed and want to start afresh here. When I am starting with -- this means R code 1) R is an open source platform and in case you want to have added functionalities you load the library --library(RCurl) --library(XML) RCurl: Provides functions to allow one to compose general HTTP requests and provides convenient functions to fetch URIs, get & post forms, etc. and process the results returned by the Web server. ( Ref: https://cran.r-project.org/web/packages/RCurl/index.html) XML:This collection of functions allow us to add, remove and replace children from an X...
Welcome to the second part of the blog where we would learn how to run a simple pig program. This blog has been written to complete an assignment on big data with the praxis business school. The sample exercise has been taken from the following link "http://hortonworks.com/hadoop-tutorial/how-to-process-data-with-apache-pig/" . About Pig: Pig is a high level scripting language that i used with Apache hadoop. Pig excels at describing data analysis problems as data flows. Pig is complete in that you can do all the required data manipulations in Apache Hadoop with Pig. In addition through the User Defined Functios(UDF) facility in Pig you can have Pig invoke code in many languages like Jruby, Jython and Java Pig Scripts are translated into a series of Mapreduce jobs that run on the Apache Hadoop Cluster. As part of the translation the Pig interpreter does perform optimizations to speed execution on Apache Hadoop. Lets solve a simple exercise on how ...
Comments
Post a Comment