Flume Installation and Streaming Twitter Data Using Flume
Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and...
Dark Secrets of Data Science Which You Should Know
Data Science is now hailed as the sexiest job of the 21st century with hundreds of people having the desire to become a data...
7 Predictive Analysis Tips for Hadoop
Introduction to predictive analysis
It’s hard to find a good analysis tool, in today’s technical era that fits and suits our business requirements. Predictive analysis...
Improving business performance using Hadoop for unstructured data
Hadoop Hadoop everywhere!
Companies like Microsoft, IBM and Oracle are building business solutions for unstructured data analysis similar to Apache Hadoop (used for complex data sets)...
Learn about Hadoop Distributed File System Management
What is HDFS (Hadoop Distributed File System):
HDFS is a distributed file system that is fault tolerant. HDFS is the primary distributed storage for Hadoop applications....
Learn to create input splits on an incoming data with MapReduce Programming
Introduction
Map reduce is the core technology of Hadoop and is the backbone of big data and Hadoop framework. This technology works in conjunction with...
Increasing the replications of data dynamically with HDFS
Introduction to HDFS
Hadoop Distributed file system is a distributed storage system used for storing large scale data sets and real time streaming data, setting...
Visual data mining with predictive analysis in Hadoop
Visual data mining striking in collaboration with big data
The new era enables something new within it, it’s the data era and it’s all about...