Data Science Trends To Look Out For In 2017
Data grows at a rate of 2.5 billion gigabytes (GB) per day, and this number is constantly growing. Data has become an integral part...
10 Popular Open Source Big Data Tools
Data has become a powerful tool in today’s society, where it translates into direct knowledge and tons of money. Companies are paying through the...
Analytics of Things and it’s significance to IoT
The idea of the Internet of Things (IoT) is revolutionary and in the future it is expected to find its way into everyone’s lives....
Learn How To Develop And Test Pig Scripts For Data Processing
In the first part of Pig tutorial we explained how Pig fits in the Hadoop ecosystem as tool for performing data extraction, transformation and...
Learn How To Write Advanced Queries To Manipulate Data Using Hive
In previous Hive tutorials we have looked at installing and configuring Hive, data modeling and use of partitions to improve query response time. For...
Learn How To Use Partitioning In Hive To Improve Query Performance
In previous Hive tutorials we have have looked at Hive as the Hadoop project that offers data warehousing features. Installing and configuring Hive was...
Learn How To Use Hbase Shell To Create Tables, Query Data, Update And Delete...
In previous Hbase tutorials we looked at how to install Hbase and develop suitable data models. In this tutorial we will build on those...
Learn How To Simplify Management Of A Hadoop Cluster Using Ambari
Within the Hadoop ecosystem Apache Ambari was developed to provide a simple way of managing Hadoop clusters using a web based interface. Cluster management...
Learn How To Process Data Interactively And In Batch Using Apache Tez Framework
Within Hadoop, MapReduce has been the widely used approach to process data. In this approach data processing happens in batch mode that can take...
Learn How To Use Apache Oozie To Schedule Hadoop Jobs
Within the Hadoop ecosystem Oozie provides services that enable jobs to be scheduled. With job scheduling you are able to organize multiple jobs into...