Hadoop Native SQL Database powered by Apache HAWQ

Deliver more value from your Hadoop platform

  • Benefits
  • Features

Pivotal HDB, powered by Apache HAWQ (incubating), is the Apache Hadoop® native SQL database for data science and machine learning workloads. With Pivotal HDB, you can ask more questions of your data in Hadoop more often to gain deeper, actionable insights. You no longer need to sample data or move it to another platform to perform advanced analytics.

Quickly unlock business insights with exceptional performance

Run predictive analytics with higher accuracy by using your entire dataset, not just a portion of it, directly in Hadoop. Execute near-real-time, ad-hoc queries at scale. Complete analytics tasks faster – in seconds or minutes, not hours or days.

Integrate SQL BI tools with confidence

Tap into a large ecosystem of data analysis and data visualization tools such as SAS, Tableau and more. Leverage existing SQL skills to ramp up and execute quickly in Hadoop.

Iterate advanced analytics and machine learning in database

Utilize advanced statistical functions and algorithms to spot relationships, patterns, and trends - with speed and at scale. Operate directly in Hadoop data.


The future of SQL on Hadoop is Hadoop Native SQL. Pivotal HDB is an elastic SQL query engine that combines exceptional MPP-based analytics performance, robust ANSI SQL compliance, and integrated Apache MADlib (incubating) machine learning – enabling you to run fast ad hoc queries and fast predictive analytics. Evolved from over a decade’s worth of intellectual property from Pivotal Greenplum™ and open source PostgreSQL, Pivotal HDB operates natively in Hadoop, which simplifies overall system management of cluster resources. Other Pivotal HDB features include:

High-Performance Architecture

Pivotal HDB’s parallel processing architecture delivers high performance throughput and low latency (potentially, near-real-time) query responses that can scale to petabyte-sized datasets. Pivotal HDB also features a cutting-edge, cost-based SQL query optimizer and dynamic pipelining technology for efficient performance operation.

Robust ANSI SQL Compliance

Pivotal HDB complies with ANSI SQL-92, -99, and -2003 standards, plus OLAP extensions. Leverage existing SQL expertise and existing SQL-based applications and BI/data visualization tools. Execute complex queries and joins, including roll-ups and nested queries.

Deep Analytics and Machine Learning

10+ years of massively parallel processing development from Greenplum, Postgres, and MADlib at your fingertips. Pivotal HDB integrates statistical and machine learning capabilities that can be natively invoked from SQL and applied natively to large data sets across a Hadoop cluster. Pivotal HDB supports PL/Python, PL/Java and PL/R programming languages.

Flexible Data Format Support

HDB supports multiple data file formats including Apache Parquet and HDB binary data files, plus HBase and Avro via HDB’s Pivotal Extension Framework (PXF) services. HDB interfaces with HCatalog, which enables you to query an even broader range of data formats.

Tight Integration with Hadoop Ecosystem

Pivotal HDB plugs into the Apache Ambari installation, management and configuration framework. This provides a Hadoop-native mechanism for installation and deployment of Pivotal HDB and for monitoring cluster resources across Pivotal HDB and the rest of the Hadoop ecosystem.

Works with ODPi Compliant Platforms

Pivotal HDB is interoperable with ODPi-runtime compliant distributions of Hadoop, such as Hortonworks Data Platform.


GE Shores Up Savings For Aviation With a Data Lake

Contact Us

Thank you for your interest!
We will get back to you shortly.