Skip to content

Category Archives: Hadoop

MapReduce is a programming model used to perform distributed processing in parallel in a Hadoop cluster, which Makes Hadoop working so fast. When you are… Read More
Before head over to learn about the HDFS(Hadoop Distributed File System), we should know what actually the file system is. The file system is a… Read More
Before we start learning about the Hadoop cluster first thing we need to know is what actually cluster means. Cluster is a collection of something,… Read More
Map-Reduce is a processing framework used to process data over a large number of machines. Hadoop uses Map-Reduce to process the data distributed in a… Read More
All of us are familiar with the disaster that happened on April 14, 1912. The big giant ship of 46000-ton in weight got sink-down to… Read More
Map-Reduce is a programming model that is mainly divided into two phases i.e. Map Phase and Reduce Phase. It is designed for processing the data… Read More
Map-Reduce is a programming model that is mainly divided into two phases Map Phase and Reduce Phase. It is designed for processing the data in… Read More
Here, we will write a Map-Reduce program for analyzing weather datasets to understand its data processing programming model. Weather sensors are collecting weather information across… Read More
Hadoop is an open-source framework written in Java that uses lots of other analytical tools to improve its data analytics operations. The article demonstrates the… Read More
Hive is a group of keys, subkeys in the registry that has a set of supporting files containing backups of the data. Basically, the hive… Read More
In general, a Hadoop cluster performs security on many layers. The level of protection depends upon the organization’s requirements. In this article, we are going… Read More
Apache Hive was initially developed by Facebook in 2010. It is a Data Warehousing package, used for Data Analysis. Hive is used for Managing and… Read More
Wasting the useful information hidden behind the data can be a dangerous roadblock for industries, ignoring this information eventually pulls your industry growth back. Data?… Read More
1. Hive : Hive is an open-source data warehouse software. It is built on the top of Hadoop. It also provides HiveQL which is similar… Read More
In Hadoop, we can receive multiple jobs from different clients to perform. The Map-Reduce framework is used to perform multiple tasks in parallel in a… Read More

Start Your Coding Journey Now!