Learn-How-To-Process-Stream-Data-In-Real-Time-Using-Apache-Storm-Part-1-740X296

Learn How To Process Stream Data In Real Time Using Apache Storm Part-1

Apache Storm is a top level hadoop project that has been developed to enable processing of very large stream data that arrives very fast...
Learn-how-to-develop-Spark-applications-using-the-Scala-programming-language-740X296

Learn How To Analyze Data Interactively In Spark Using Scala

Scala is a programming language that incorporates object oriented and functional programming styles. It is one of the programming languages along Java and Python...
Learn-how-to-develop-effective-data-models-in-Hbase-740X296

Learn How to Develop Effective Data Models in Hbase

To develop a data model in Hbase that is scalable you need a good understanding of the strengths and weaknesses of the database. The...
Learn-how-to-create-effective-data-models-in-Hive-740X296

Learn How to Develop Effective Data Models in Hive

Within the Hadoop ecosystem Hive is considered as a data warehouse. This could be true or false depending on how you look at it....
Learn-how-to-process-data-using-Spark-740X296

Learn How To Process Data Using Spark On Amazon Elastic Mapreduce

Apache Spark is a data processing framework that has been developed to process very large amounts of data very fast. The speed gains are...
Learn-how-to-manage-data-in-the-hadoop-file-system-740X296

Learn How to Manage Files Within Hadoop File System

Data in hdfs is store in blocks that have a default size of 64mb. Files that you store in hdfs are broken up and...
Learn-how-to-set-up-hadoop-on-4-Amazon-Instances-740X296

Learn How to Set up Hadoop on 4 Amazon Instances

In the first part of this tutorial provisioning a cluster with four instances on Amazon ec2 was demonstrated. Connecting to the instances using SSH...
Learn-how-to-set-up-a-multi-node-Hadoop-cluster-on-AWS-740X296

Learn how to set up a multi node Hadoop cluster on AWS

This tutorial will be divided into two parts. In the first part we will demonstrate how to set up instances on Amazon Web Services...
Learn-How-to-Query,-Summarize-and-Analyze-Data-using-Apache-Hive

Learn How to Query, Summarize and Analyze Data using Apache Hive

Apache Hive is project within the Hadoop ecosystem that provides data warehouse capabilities. It was not designed for processing OLTP workloads. It has features...
Learn-how-to-write-MapReduce-Programs-using-Pig-Latin-740X296

Learn how to write Mapreduce Programs using Pig Latin

In the Hadoop ecosystem Pig offers features for performing extraction, transformation and loading of data (ETL). In ETL the main objective is to acquire...
- Advertisement -

Marketing

Java-Myths-Featured-Image

Most Common Myths Surrounding Java Programming

In this article, we will study about myths of java programming which are just a misconception and incorrect views about java. Different myths of Java...
Kotlin & Spring

Beginners Guide To Couchbase With Kotlin & Spring

Couchbase server is an open source, multi-model NoSQL document-oriented database. Perhaps, it won’t be wrong to say that it is a merger of two popular NoSQL technologies: ...
Java Stream Collectors

Functionality and Uses Of Java Stream Collectors

With Java 8, we were introduced to a new abstraction called Stream, using which we can process data in a declarative manner. This, when...