Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is HDFS (Hadoop Distributed File System)

Handbook of Research on Big Data Storage and Visualization Techniques
A distributed and scalable storage system of Hadoop framework which stores large quantities of data in a distributed fashion across clusters of commodity hardware.
Published in Chapter:
Hadoop Framework for Handling Big Data Needs
Rupali Ahuja (University of Delhi, India)
DOI: 10.4018/978-1-5225-3142-5.ch004
Abstract
The data generated today has outgrown the storage as well as computing capabilities of traditional software frameworks. Large volumes of data if aggregated and analyzed properly may provide useful insights to predict human behavior, to increase revenues, get or retain customers, improve operations, combat crime, cure diseases, etc. In conclusion, the results of effective Big Data analysis can be used to provide actionable intelligence for humans, as well as for machine consumption. New tools, techniques, technologies and methods are being developed to store, retrieve, manage, aggregate, correlate and analyze Big Data. Hadoop is a popular software framework for handling Big Data needs. Hadoop provides a distributed framework for processing and storage of large datasets. This chapter discusses in detail the Hadoop framework, its features, applications and popular distributions, and its Storage and Visualization tools.
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Introduction to Smart City and Agricultural Revolution: Big Data and Internet of Things (IoT)
A Java based file system that provides scalable and reliable data storage and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.
Full Text Chapter Download: US $37.50 Add to Cart
Insight Into Big Data Analytics: Challenges, Recent Trends, and Future Prospects
HDFS is the distributed file system responsible for storage, management and high throughput access of application data. HDFS splits the input dataset into manageable data chunks and stores them to different machines on Hadoop cluster.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR