Running a MapReduce Program on Amazon EC2 Hadoop Cluster with YARN
As in the previous guide we configured Hadoop cluster with YARN on Amazon EC2 instance. Now we will run a simple MapReduce Program on...
Learn How to Develop Effective Data Models in Hbase
To develop a data model in Hbase that is scalable you need a good understanding of the strengths and weaknesses of the database. The...
Learn how to set up a multi node Hadoop cluster on AWS
This tutorial will be divided into two parts. In the first part we will demonstrate how to set up instances on Amazon Web Services...
Learn How To Secure A Hadoop Cluster Using Kerberos Part 2
In part 1 of this tutorial key terminologies used in kerberos authentication were discussed. We demonstrated how to set up and configure a KDC...
Learn How To Use Partitioning In Hive To Improve Query Performance
In previous Hive tutorials we have have looked at Hive as the Hadoop project that offers data warehousing features. Installing and configuring Hive was...
Learn How To Process Data Using Spark On Amazon Elastic Mapreduce
Apache Spark is a data processing framework that has been developed to process very large amounts of data very fast. The speed gains are...
Introduction to Map-Reduce Programming model
(Assuming you have basic working knowledge of Java)
MapReduce programming paradigm is based on the concept of key-value pairs. It also provides powerful paradigms for...
Implementation of data visualization techniques in Hadoop
What is data visualization?
It’s better to visualize the data rather texting it. The brain anatomy also says that, our brain process images up to...
Identification of Cybercrimes using Data Analytics in Hadoop
Introduction to cyber crime
Many organizations don’t even care about pros and cons of dealing with cybercrimes, some of them are product based companies while...
What is Big Data & How Can it Change the World
The word ‘Big Data’ has become a game changer in today’s technology driven world. And you may wonder exactly how “big is Big Data”?...