Category: Hadoop

0

Uninstall Hortonworks HDP 2.2

Source: https://pulsatingtaurus.wordpress.com/2015/02/15/clean-uninstall-hortonworks-hdp-2-2/   I love Hadoop and Hortonworks is one of my favored Hadoop distributuion. However while experimenting with the hadoop installation, I had many instances when I needed to start afresh on the set of physical as well as...

0

Apache Hadoop 1.x – Jobtracker

Source: http://wiki.apache.org/hadoop/JobTracker The JobTracker is the service within Hadoop that farms out MapReduce tasks to specific nodes in the cluster, ideally the nodes that have the data, or at least are in the same rack. Client applications submit jobs to...

0

YARN – Overview

Source: http://pivotalhd.cfapps.io/introduction/yarn.html YARN Overview Apache Hadoop has two main components: Distributed Storage Distributed computation The distributed storage is provided by the HDFS, and the MapReduce provides the distributed computation. About YARN YARN (Yet-Another-Resource-Negotiator) is the next-generation Hadoop data-processing framework. YARN...

0

Remotely debug hadoop

Source: http://www.gluster.org/2013/07/deep-dive-into-hadoop-with-bigtop-and-eclipse-remote-debuggers/ Deep dive into Hadoop with Bigtop and Eclipse Remote Debuggers Thanks to a little hack session with bradley childs over at Red Hat this week, I learned a new trick: Remote debugging of JVM (Hadoop + MR2) apps in...

0

mapred Vs. mapreduce

mapred Vs. mapreduce Resources http://stackoverflow.com/questions/7598422/is-it-better-to-use-the-mapred-or-the-mapreduce-package-to-create-a-hadoop-job http://stackoverflow.com/questions/10986633/hadoop-configuration-mapred-vs-mapreduce http://www.slideshare.net/sh1mmer/upgrading-to-the-new-map-reduce-api Related posts: Remotely debug hadoop Creating Hive tables on compressed files Hadoop Hive UDTF Tutorial – Extending Apache Hive with Table Functions ​DistCp Between HA Clusters

0

Debug Hadoop source code using an Intellij Idea

Source: http://www.techbite.in/2013/05/debug-hadoop-source-code-using.html If you are someone who wants to dive into Hadoop source code and get a feel of the implementation details of all the abstracted out nitty-gritties of Hadoop’s architectural overview, and want to get your hands dirty by...

0

hadoop randomwriter not reading configuration file

So with randomwriter sample, you want to write data less than 10 GB. You have searched all around the net but have not found the solution and almost gave up? :p Ok, so I am sharing what worked for me....