Bigdata and Hadoop Archives

LEARN-HOW-TO-IMPORT-AND-EXPORT-DATA-USING-SQOOP-740X296

Learn How To Import Data From Mysql Into Hadoop Using Sqoop

Bigdata and Hadoop Krishnakumar - May 22, 2016 0

Sqoop is a tool in the apache ecosystem that was designed to solve the problem of importing data from relational databases and exporting data...

Learn-How-to-Install-and-Configure-a-Single-Node-Hadoop-Cluster-on-Ubuntu-740X296

Learn to Install and Configure a Single Node Hadoop Cluster on Ubuntu

Bigdata and Hadoop Krishnakumar - May 19, 2016 1

Hadoop as a production platform is supported on Linux but Windows and other flavors of UNIX such as Mac OS X can be used...

Running Apache Hadoop on Apache Mesos A Distributed kernel system

Running Hadoop on Apache Mesos: A Distributed kernel system

Bigdata and Hadoop Krishnakumar - October 11, 2015 0

Apache Mesos – An overview Apache mesos is an open source cluster management kernel based system. It is built on same principles as Linux kernels...

Learn How to Query, Summarize and Analyze Data using Apache Hive

Bigdata and Hadoop Krishnakumar - June 5, 2016 0

Apache Hive is project within the Hadoop ecosystem that provides data warehouse capabilities. It was not designed for processing OLTP workloads. It has features...

Learn-how-to-create-effective-data-models-in-Hive-740X296

Learn How to Develop Effective Data Models in Hive

Bigdata and Hadoop Krishnakumar - July 7, 2016 0

Within the Hadoop ecosystem Hive is considered as a data warehouse. This could be true or false depending on how you look at it....

Learn how to stream data into Hadoop using Apache Flume

Bigdata and Hadoop Krishnakumar - May 26, 2016 3

Apache Flume is a tool in the Hadoop ecosystem that provides capabilities for efficiently collecting, aggregating and bringing in large amounts of data into...

MapReduce Program In Detail

Bigdata and Hadoop Krishnakumar - July 23, 2015 0

In our previous guides, we saw how to run wordcount MapReduce program on a single node Hadoop cluster. Now we will understand the MapReduce...

Efficiently making map reduce job to take new input splits even after the job has started

Learn to create input splits on an incoming data with MapReduce Programming

Bigdata and Hadoop Krishnakumar - October 18, 2015 0

Introduction Map reduce is the core technology of Hadoop and is the backbone of big data and Hadoop framework. This technology works in conjunction with...

Importance Of Exploratory Data Analysis Before ML Modelling

Bigdata and Hadoop Sonal - September 25, 2021 0

Exploratory Data Analysis (EDA) is the crucial process of using summary statistics and graphical representations to perform preliminary investigations on data in order to...

Learn How To Write Advanced Queries To Manipulate Data Using Hive

Bigdata and Hadoop Krishnakumar - September 5, 2016 0

In previous Hive tutorials we have looked at installing and configuring Hive, data modeling and use of partitions to improve query response time. For...

Bigdata and Hadoop

Flume Installation and Streaming Twitter Data Using Flume

A step by step guide to install Hadoop cluster on Amazon EC2