Learn-how-to-set-up-hadoop-on-4-Amazon-Instances-740X296

Learn How to Set up Hadoop on 4 Amazon Instances

In the first part of this tutorial provisioning a cluster with four instances on Amazon ec2 was demonstrated. Connecting to the instances using SSH...
Learn-how-to-set-up-a-multi-node-Hadoop-cluster-on-AWS-740X296

Learn how to set up a multi node Hadoop cluster on AWS

This tutorial will be divided into two parts. In the first part we will demonstrate how to set up instances on Amazon Web Services...
Learn-How-to-Query,-Summarize-and-Analyze-Data-using-Apache-Hive

Learn How to Query, Summarize and Analyze Data using Apache Hive

Apache Hive is project within the Hadoop ecosystem that provides data warehouse capabilities. It was not designed for processing OLTP workloads. It has features...
Learn-how-to-write-MapReduce-Programs-using-Pig-Latin-740X296

Learn how to write Mapreduce Programs using Pig Latin

In the Hadoop ecosystem Pig offers features for performing extraction, transformation and loading of data (ETL). In ETL the main objective is to acquire...
Learn-how-to-stream-data-into-Hadoop-using-Apache-Flume

Learn how to stream data into Hadoop using Apache Flume

Apache Flume is a tool in the Hadoop ecosystem that provides capabilities for efficiently collecting, aggregating and bringing in large amounts of data into...
LEARN-HOW-TO-IMPORT-AND-EXPORT-DATA-USING-SQOOP-740X296

Learn How To Import Data From Mysql Into Hadoop Using Sqoop

Sqoop is a tool in the apache ecosystem that was designed to solve the problem of importing data from relational databases and exporting data...
Learn-How-to-Install-and-Configure-a-Single-Node-Hadoop-Cluster-on-Ubuntu-740X296

Learn to Install and Configure a Single Node Hadoop Cluster on Ubuntu

Hadoop as a production platform is supported on Linux but Windows and other flavors of UNIX such as Mac OS X can be used...
analytics applications

Why Python is important for big data and analytics applications?

3
Python Programming is a general purpose programming language that is open source, flexible, powerful and easy to use. One of the most important features...
Introduction-to-Map-Reduce-Programming-model-reviewed-740X296

Introduction to Map-Reduce Programming model

0
(Assuming you have basic working knowledge of Java) MapReduce programming paradigm is based on the concept of key-value pairs. It also provides powerful paradigms for...
Why-R-is-important-for-data-science-professionals-v2-740X296

Why R is important for data science professionals

0
R is actually a programming environment and language made specifically for graphical applications and statistical computations. It is licensed under the GNU license, just...