ALERT

[FREE DEMO] Deploy Your Enterprise Cloud in Minutes

Hadoop YARN

Definition - What does Hadoop YARN mean?

Hadoop YARN is a specific component of the open source Hadoop platform for big data analytics, licensed by the non-profit Apache software foundation.

Major components of Hadoop include a central library system, a Hadoop HDFS file handling system, and Hadoop MapReduce, which is a batch data handling resource. In addition to these, there’s Hadoop YARN, which is described as a clustering platform that helps to manage resources and schedule tasks. The Apache software foundation, the license holder for Hadoop, describes Hadoop YARN as 'next-generation MapReduce’ or 'MapReduce 2.0.’

Techopedia explains Hadoop YARN

Experts explain that the key concept of YARN involves setting up both global and application-specific resource management components. This helps to allocate resources to particular applications and manage other kinds of resource monitoring tasks. In YARN, an application submission client submits an application to the YARN resource manager. YARN 'schedules’ applications in order to prioritize tasks and maintain big data analytics systems. This is just one part of a greater architecture for aggregating and sorting data, conducting specific queries to retrieve data, and otherwise using Hadoop and related tools to manipulate big data for business intelligence and much more. Businesses use these kinds of platforms to look at supply chains, document product and service operations, keep track of customer information, and for many other kinds of powerful data-driven and automated business processes.

Techopedia Deals

Connect with us

Techopedia on Linkedin
Techopedia on Linkedin
Tweat cdn.techopedia.com
"Techopedia" on Twitter


'@Techopedia'
Sign up for Techopedia's Free Newsletter!

Email Newsletter

Join thousands of others with our weekly newsletter

Resources
The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.