Use Partitioning

Learn How To Use Partitioning In Hive To Improve Query Performance

In previous Hive tutorials we have have looked at Hive as the Hadoop project that offers data warehousing features. Installing and configuring Hive was...
Big Data Analytics

Know More About Data Science And Big Data Analytics

‘Data has become important in our society today’ is an understatement. The importance of data can be seen across multiple sectors with many companies...
11.Visual data mining with predictive analysis in hadoop

Visual data mining with predictive analysis in Hadoop

Visual data mining striking in collaboration with big data  The new era enables something new within it, it’s the data era and it’s all about...
Learn-How-to-Query,-Summarize-and-Analyze-Data-using-Apache-Hive

Learn How to Query, Summarize and Analyze Data using Apache Hive

Apache Hive is project within the Hadoop ecosystem that provides data warehouse capabilities. It was not designed for processing OLTP workloads. It has features...
Data Cleaning

What Makes Data Cleaning so Essential?

Data Science, a field about which every geek, businessman, entrepreneur, programmer, and visionaries are talking about. When you will go to Google and search...
14.Passing multiple file for same input in hadoop

Passing Multiple Files for Same Input in Hadoop

Introduction Hadoop is well known for its data processing capability for searching and sorting and can also be used for batch processing analysis. In order to...
analytics applications

Why Python is important for big data and analytics applications?

Python Programming is a general purpose programming language that is open source, flexible, powerful and easy to use. One of the most important features...
Learn-how-to-write-MapReduce-Programs-using-Pig-Latin-740X296

Learn how to write Mapreduce Programs using Pig Latin

In the Hadoop ecosystem Pig offers features for performing extraction, transformation and loading of data (ETL). In ETL the main objective is to acquire...
Big_data_and_Hadoop_for_analysis_of_unstructured_data_using_clustering_mechanism

Getting Familiar with Big Data and Hadoop Technology

Introduction to big data Have you ever introspect of how Facebook actually works, how is it viable that your click streams are based on your...
Improving business performance using hadoop for unstructured data

Improving business performance using Hadoop for unstructured data

Hadoop Hadoop everywhere! Companies like Microsoft, IBM and Oracle are building business solutions for unstructured data analysis similar to Apache Hadoop (used for complex data sets)...
- Advertisement -

Marketing

Java-Myths-Featured-Image

Most Common Myths Surrounding Java Programming

In this article, we will study about myths of java programming which are just a misconception and incorrect views about java. Different myths of Java...
Kotlin & Spring

Beginners Guide To Couchbase With Kotlin & Spring

Couchbase server is an open source, multi-model NoSQL document-oriented database. Perhaps, it won’t be wrong to say that it is a merger of two popular NoSQL technologies: ...
Java Stream Collectors

Functionality and Uses Of Java Stream Collectors

With Java 8, we were introduced to a new abstraction called Stream, using which we can process data in a declarative manner. This, when...