Apache-Oozie

Learn How To Use Apache Oozie To Schedule Hadoop Jobs

Within the Hadoop ecosystem Oozie provides services that enable jobs to be scheduled. With job scheduling you are able to organize multiple jobs into...
Integrate data management and visualization for better results in hadoop

Learn to Integrate data management and visualization for better results in Hadoop

Data management is an asset of hadoop Hadoop is often considered as future of data management as this is the beauty of hadoop distributed file...
Introduction-to-Map-Reduce-Programming-model-reviewed-740X296

Introduction to Map-Reduce Programming model

(Assuming you have basic working knowledge of Java) MapReduce programming paradigm is based on the concept of key-value pairs. It also provides powerful paradigms for...
Data-by-using-Apache-Pig-and-Hadoop-platform

Learn to process your data by using Apache Pig and Hadoop platform

Apache Pig is a high level scripting language and a part of the Apache Hadoop eco-system. Pig scripting is mainly used for data analysis...
HDFS Increasing the replications of data dynamically

Increasing the replications of data dynamically with HDFS

Introduction to HDFS Hadoop Distributed file system is a distributed storage system used for storing large scale data sets and real time streaming data, setting...

MapReduce Program In Detail

In our previous guides, we saw how to run wordcount MapReduce program on a single node Hadoop cluster. Now we will understand the MapReduce...
Learn-how-to-secure-a-Hadoop-cluster-using-kerberos-Part2-740X296

Learn How To Secure A Hadoop Cluster Using Kerberos Part 2

In part 1 of this tutorial key terminologies used in kerberos authentication were discussed. We demonstrated how to set up and configure a KDC...
Hadoop Distributed File System

Learn about Hadoop Distributed File System Management

What is HDFS (Hadoop Distributed File System): HDFS is a distributed file system that is fault tolerant. HDFS is the primary distributed storage for Hadoop applications....
Learn-How-to-Query,-Summarize-and-Analyze-Data-using-Apache-Hive

Learn How to Query, Summarize and Analyze Data using Apache Hive

Apache Hive is project within the Hadoop ecosystem that provides data warehouse capabilities. It was not designed for processing OLTP workloads. It has features...
Data Science

The Association Rules In Data Science

Association rules finding is a rule-based machine learning technique used widely in recommender systems. If you have seen advertisements that are tailored according to...
- Advertisement -

Marketing

Java-Myths-Featured-Image

Most Common Myths Surrounding Java Programming

In this article, we will study about myths of java programming which are just a misconception and incorrect views about java. Different myths of Java...
Kotlin & Spring

Beginners Guide To Couchbase With Kotlin & Spring

Couchbase server is an open source, multi-model NoSQL document-oriented database. Perhaps, it won’t be wrong to say that it is a merger of two popular NoSQL technologies: ...
Java Stream Collectors

Functionality and Uses Of Java Stream Collectors

With Java 8, we were introduced to a new abstraction called Stream, using which we can process data in a declarative manner. This, when...