Skip to content

Category Archives: Hadoop

Finding top 10 or 20 records from a large dataset is the heart of many recommendation systems and it is also an important attribute for… Read More
Pig is a high-level platform or tool which is used to process the large datasets. It provides a high-level of abstraction for processing over the… Read More
With growing data velocity the data size easily outgrows the storage limit of a machine. A solution would be to store the data across a… Read More
One of the three components of Hadoop is Map Reduce. The first component of Hadoop that is, Hadoop Distributed File System (HDFS) is responsible for… Read More
The definition of a powerful person has changed in this world. A powerful is one who has access to the data. This is because data… Read More
Overview: Apache Hadoop is an open source framework intended to make interaction with big data easier, However, for those who are not acquainted with this… Read More
YARN stands for “Yet Another Resource Negotiator“. It was introduced in Hadoop 2.0 to remove the bottleneck on Job Tracker which was present in Hadoop… Read More
Prerequisites – Introduction to Hadoop, Apache HBase HBase architecture has 3 main components: HMaster, Region Server, Zookeeper.    Figure – Architecture of HBase  All the 3 components… Read More
Prerequisite – Introduction to Hadoop, Apache Hive The major components of Hive and its interaction with the Hadoop is demonstrated in the figure below and all… Read More
Prerequisites – Introduction to Hadoop, Computing Platforms and Technologies Apache Hive is a data warehouse and an ETL tool which provides an SQL-like interface between the… Read More
Hadoop is an open source framework overseen by Apache Software Foundation which is written in Java for storing and processing of huge datasets with the… Read More
What is Hadoop? Hadoop is an open source software programming framework for storing a large amount of data and performing the computation. Its framework is… Read More
Data science is an interdisciplinary field of scientific methods, processes, algorithms and systems to extract knowledge or insights from data in various forms, either structured… Read More
“No power on earth can stop an idea whose time has come.” – Victor Hugo Big data is one such remarkable idea. In today’s socially… Read More

Start Your Coding Journey Now!