Dataiku can be deployed on-premises or in the cloud (e.g. AWS, Azure, etc) and connect via JDBC to Pivotal® Greenplum deployments. Dataiku users can then connect to, load, transform and query data tables stored within Pivotal Greenplum.
To facilitate visual development, data engineers can create custom SQL Recipes in Dataiku to invoke in-database analytics functions of Pivotal Greenplum such as those for data preparation and machine learning in Apache MADlib, for geospatial analysis in PostGIS, and text analytics in GPText. This allows data science teams to leverage the MPP architecture of Pivotal Greenplum to process terabyte and petabyte sized data sets in parallel for faster results.