Bigdata and Hadoop Archives

Learn How To Coordinate Hadoop Clusters Using Zookeeper

Bigdata and Hadoop Krishnakumar - July 29, 2016 0

Hadoop was designed to be a distributed system that scales up to thousands of nodes. Even with a few hundred node cluster managing all...

Learn How To Create Topologies In Storm To Process Data

Bigdata and Hadoop Krishnakumar - July 26, 2016 0

In part 1 of this tutorial key concepts that are used in Storm were discussed. In that tutorial it was explained Storm topologies are...

Learn How To Secure A Hadoop Cluster Using Kerberos Part 2

Bigdata and Hadoop Krishnakumar - July 24, 2016 1

In part 1 of this tutorial key terminologies used in kerberos authentication were discussed. We demonstrated how to set up and configure a KDC...

Learn How To Secure A Hadoop Cluster Using Kerberos Part – 1

Bigdata and Hadoop Krishnakumar - July 20, 2016 5

Kerberos is a way of authenticating users that was developed at MIT and has grown to become the most widely used authentication approach. Hadoop...

Learn How To Process Stream Data In Real Time Using Apache Storm Part-1

Bigdata and Hadoop Krishnakumar - July 17, 2016 0

Apache Storm is a top level hadoop project that has been developed to enable processing of very large stream data that arrives very fast...

Learn-how-to-develop-Spark-applications-using-the-Scala-programming-language-740X296

Learn How To Analyze Data Interactively In Spark Using Scala

Bigdata and Hadoop Krishnakumar - July 14, 2016 0

Scala is a programming language that incorporates object oriented and functional programming styles. It is one of the programming languages along Java and Python...

Learn How to Develop Effective Data Models in Hbase

Bigdata and Hadoop Krishnakumar - July 11, 2016 0

To develop a data model in Hbase that is scalable you need a good understanding of the strengths and weaknesses of the database. The...

Learn-how-to-create-effective-data-models-in-Hive-740X296

Learn How to Develop Effective Data Models in Hive

Bigdata and Hadoop Krishnakumar - July 7, 2016 0

Within the Hadoop ecosystem Hive is considered as a data warehouse. This could be true or false depending on how you look at it....

Learn-how-to-process-data-using-Spark-740X296

Learn How To Process Data Using Spark On Amazon Elastic Mapreduce

Bigdata and Hadoop Krishnakumar - July 3, 2016 0

Apache Spark is a data processing framework that has been developed to process very large amounts of data very fast. The speed gains are...

Learn-how-to-manage-data-in-the-hadoop-file-system-740X296

Learn How to Manage Files Within Hadoop File System

Bigdata and Hadoop Krishnakumar - June 29, 2016 0

Data in hdfs is store in blocks that have a default size of 64mb. Files that you store in hdfs are broken up and...

Bigdata and Hadoop

What Is Cloud Data Management And How Big Is The Industry?

Improve DataOps with Dynamic Indexing in Data Lake