Hadoop Project on NCDC ( National Climate Data Center – NOAA ) Dataset
NOAA's National Climatic Data Center (NCDC) is responsible for preserving, monitoring, assessing, and providing public access to weather data.
NCDC provides access to daily data from...
10 Popular Open Source Big Data Tools
Data has become a powerful tool in today’s society, where it translates into direct knowledge and tons of money. Companies are paying through the...
Learn How To Secure A Hadoop Cluster Using Kerberos Part – 1
Kerberos is a way of authenticating users that was developed at MIT and has grown to become the most widely used authentication approach. Hadoop...
Why Python is important for big data and analytics applications?
Python Programming is a general purpose programming language that is open source, flexible, powerful and easy to use. One of the most important features...
Learn to Integrate data management and visualization for better results in Hadoop
Data management is an asset of hadoop
Hadoop is often considered as future of data management as this is the beauty of hadoop distributed file...
How and When should you use HBase NoSQL DB
Apache HBase is one of the most popular non-relational databases built on top of Hadoop and HDFS (Hadoop Distributed File system). It is also...
Passing Multiple Files for Same Input in Hadoop
Introduction
Hadoop is well known for its data processing capability for searching and sorting and can also be used for batch processing analysis. In order to...
Learn How To Use Hbase Shell To Create Tables, Query Data, Update And Delete...
In previous Hbase tutorials we looked at how to install Hbase and develop suitable data models. In this tutorial we will build on those...
Guide to Import, Export, Run A MapReduce Program
A MapReduce program is written in Java. And mostly Eclipse IDE is used for programming by the developers. In our last guide we saw...
Why R is important for data science professionals
R is actually a programming environment and language made specifically for graphical applications and statistical computations. It is licensed under the GNU license, just...