What are the major differences hadoop and spark

Page 1

What are the major differences Hadoop and Spark

Hadoop is said to be an Apache.org project, which is adept at providing the distribution of software that processes large data sets, for a number of computer clusters, simply by using programming models. Hadoop is one such software, which is able to scale from a single computing system to close to thousands of commodity systems that are known to offer local storage and computer power. In a simpler sense, you can think of Hadoop as the 800 lb big data gorilla in the big data analytics space. This is one of the reasons why the use of this particular software programme is popular among data analysts. On the other hand Spark, is known as the fast and general engine for large scale data processing, by Apache Spark developers. If we go on to compare these two programming environments, then where Hadoop happens to be the 800lb gorilla, Spark would be the 130 lb big data cheetah. Spark is cited to be way faster in terms of in-memory processing, when compared to Hadoop and MapReduce; but many believe that it may not be as fast when it comes to processing on disk space. What Spark actually excels at is effortlessly streaming of interactive queries, workloads and most importantly, machine learning. While these two may be contenders, but time and again a lot of data analysts, have wanted the two programming environments to work together, on the same side. This is why a direct comparison kind of becomes a lot more difficult, as both of these perform the same functions and yet sometimes are able to perform entirely parallel functions. Come to think of it, if there were conclusions to be drawn, then it would be Hadoop that would be a better, more independently


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
What are the major differences hadoop and spark by Imarticus Learning - Issuu