Why Should Small Companies Embrace Big Data
Big Data isn’t a new concept anymore, especially for large companies and corporations who are using big data to their advantage. However, small companies...
Learn How To Create Topologies In Storm To Process Data
In part 1 of this tutorial key concepts that are used in Storm were discussed. In that tutorial it was explained Storm topologies are...
Create a Twitter Stream Processor in 15 Lines of Code.
Articles Series Synopsis
This is a series of articles where we look at problems you can encounter when building Applications with the Hadoop Ecosystem. We...
Handling risk management of data in hadoop
Defining risk management
Risk management is mainly defined as identifying and assessing the prioritization of problem that can occur in a proceeding. Risk is the...
Learn how to write Mapreduce Programs using Pig Latin
In the Hadoop ecosystem Pig offers features for performing extraction, transformation and loading of data (ETL). In ETL the main objective is to acquire...
Learn How To Simplify Management Of A Hadoop Cluster Using Ambari
Within the Hadoop ecosystem Apache Ambari was developed to provide a simple way of managing Hadoop clusters using a web based interface. Cluster management...
A real time testing of Big Data with Hadoop
Introduction
Hadoop is well known for its batch processing capability and most of the time Hadoop is used for historical analysis, especially in airlines and...
Improving data modeling and workflow in hadoop
Data work flow modeling in hadoop framework
Hadoop is well known for its schema on read approach that simply denotes to the raw and unprocessed...
Learn How To Coordinate Hadoop Clusters Using Zookeeper
Hadoop was designed to be a distributed system that scales up to thousands of nodes. Even with a few hundred node cluster managing all...
Top Data Science Blogs to Follow in 2019
Data Science is one of the most fascinating technologies in the present world. It is a constantly evolving beast helping industries from all the...