SQL on Hadoop

Definition - What does SQL on Hadoop mean?

SQL on Hadoop is a type of analytical application tool — the SQL implementation on the Hadoop platform, which combines standard SQL-style querying of structured data with the Hadoop data framework. Hadoop is a relatively new platform, as is big data itself, and not many professionals are experts in it, but SQL on Hadoop simplifies access to the Hadoop framework and makes it easier to implement on current enterprise systems.

Techopedia explains SQL on Hadoop

SQL on Hadoop refers to various implementations of SQL for the Hadoop platform. MapReduce, which is Hadoop's cluster job mapper and result organizer, supports SQL as a major use-case as well as other processing methods. Therefore, it makes sense to create powerful tools for allowing SQL, which is one of the most widely used languages for database query and manipulation. As Hadoop gains popularity for enterprise data architecture, SQL is key for proper adoption for both loosely-structured data and structured data used in Hadoop.

SQL on Hadoop key drivers include:

  • Leveraging existing SQL skills present in most organizations
  • Reusing extract transform load (ETL), business intelligence (BI) and analytics infrastructure investments in Hadoop

Some SQL on Hadoop implementations include:

  • Apache Spark SQL
  • Apache Hive
  • Apache Tajo
  • Apache Drill
  • HP Vertica on MapR
  • ODBC Drivers
  • Presto
  • Shark
Share this:

Connect with us

Email Newsletter

Join thousands of others with our weekly newsletter

The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.