Pentaho and Analytic Databases
Native Support, Visual Interfaces, Immediate Access, Interactive Discovery
Pentaho Business Analytics empowers users to easily prepare, model, visualize, explore and make predictions from data sets stored in high performance analytic databases. Pentaho simplifies the end-to-end data analytics process by providing a complete platform from data to analytics.
A fully integrated solution, Pentaho equips users with powerful data preparation and business analytics that allow immediate exploration and discovery of big data trends and insights.
Visual development interfaces for data preparation and modeling in Analytic Databases
Pentaho’s visual development tools drastically reduce the time to design, develop and deploy analytics compared to traditional custom coding approaches. Pentaho’s visual interfaces allow developers and data scientists to work with analytic databases using the familiar extract-transform-load (ETL) and meta data modeling approaches, instead of hand coding scripts.
In addition, Pentaho simplifies the process of supporting evolving business requirements. Users can dynamically alter schemas, including adding new columns and tables, and adapt load jobs to easily integrate data from multiple sources without the need for manual scripting and programming.
Having familiar and easy-to-use interfaces eliminates the need to hire analytic database specialists to deploy and maintain your system with manual coding.
Graphical job steps for orchestration across multiple jobs and sources
Pentaho provides detailed graphical job steps for orchestrating jobs in analytic databases and in other large data stores.
These include conditional checking steps, event waiting steps, execution steps and notification steps. Together these steps enable easy visual assembly of powerful job flow logic across multiple jobs and data sources.
Rich visualization and interactive data discovery
A tightly coupled data integration and business analytics platform enables IT and business users to easily access, integrate, visualize and explore large data volumes stored in high performance analytic databases through:
- Rich visualization – Interactive web-based interfaces for ad hoc reporting, charting and dashboards
- Flexible exploration – Views of data across dimensions such as time, product, and geography and across measures such as revenue and quantity
- Predictive analysis – Powerful data mining and predictive analytics capabilities using advanced statistical and data mining algorithms such as classification, regression, clustering and association rules
High performance modern engine
Pentaho Data Integration is a modern data flow engine designed for high performance computing environments. The engine scales out to distribute processing across server clusters to fully leverage 64-bit multi-core CPUs.
Point-and-click configuration for bulk loaders
Pentaho provides native support for many of the leading analytic database bulk loader utilities, which are command-line utilities for parallel loading of large data sets.
Pentaho also provides a visual point-and-click configuration interface for incorporating bulk loaders.
Push-down, in-database processing
Pentaho gives the option to push down transformation operations into the analytic database engine itself, instead of performing transformation operations within the ETL engine, often referred to as extract-load-transform (ELT). This enables organizations to fully leverage their investment in analytic databases, or alternatively leverage investments in high performance ETL servers.
Fast query execution with native SQL dialects
Pentaho goes beyond ANSI-standard SQL to enable use of database-specific query features that often results in faster query execution leading to significant performance benefits for end users.
Databases with native SQL dialect include Informix, Ingres, Interbase, LucidDB, SQL Server, MySQL, Oracle, Postgres, Sybase and Teradata.
Broad support for analytic database platforms
Pentaho's unparalleled native support for analytic databases includes:
- EMC Greenplum
- HP NonStop SQL/MX
- HP Vertica
- IBM Netezza
- Infobright
- Actian Vectorwise
- LucidDB
- MonetDB
- Teradata