Cisco CloudCenter: Get the Hybrid IT Advantage

Apache Mahout

Definition - What does Apache Mahout mean?

Apache Mahout is a project of the Apache Software Foundation which is implemented on top of Apache Hadoop and uses the MapReduce paradigm.

It is also used to create implementations of scalable and distributed machine learning algorithms that are focused in the areas of clustering, collaborative filtering and classification. Mahout contains Java libraries for common math algorithms and operations focused on statistics and linear algebra, as well as primitive Java collections.

Techopedia explains Apache Mahout

Apache Mahout is all about machine learning and the project is aimed at making a powerful tool for building intelligent applications faster and easier.

This used to be the exclusive domain of academics and corporations with large research budgets, but in today’s data-driven world, the need for intelligent applications that can learn from data and user data is increasing.

Apache Mahout is used for creating applications with machine-learning techniques such as clustering, categorization, and collaborative filtering for finding commonalities in large data groups or for tagging large volumes of web content.

Mahout scalability:
  • Scalable to large data sets - the core algorithms are implemented on large scalable, distributed systems.
  • Scalable to support different business cases - distributed under commercially friendly Apache Software License
  • Scalable community - there is a vast, vibrant, diverse and responsive community to facilitate discussions on the project and its potential use cases.
Share this:

Connect with us

Email Newsletter

Join thousands of others with our weekly newsletter

The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.