Pivotal Greenplum-Informatica Connector 1.0.0 (Experimental) Release Notes

The Pivotal Greenplum-Informatica Connector supports high speed data transfer from an Informatica PowerCenter cluster to a Pivotal Greenplum Database cluster for batch and streaming ETL operations.

This an experimental release of Pivotal Greenplum-Informatica Connector that includes a new architecture, new features, and other improvements as compared to the previous Informatica connector.

Warning: Experimental features are not recommended or supported for production deployments. These features may change in or be removed from future versions of the product based on further testing and feedback. Moreover, any features that may be visible in the open source code but that are not described in the product documentation should be considered experimental and unsupported for production use.

Supported Platforms

The following table identifies the supported component versions for Pivotal Greenplum-Informatica Connector:

Greenplum-Informatica Connector Version Pivotal Greenplum Version PowerCenter Version
1.0.0 (Experimental) 5.6.x 10.x

Refer to the Pivotal Greenplum Database documentation for detailed information on Pivotal Greenplum Database.

See the Informatica PowerCenter documentation for information about the PowerCenter platform and tools.

New Features

This release of the Pivotal Greenplum-Informatica Connector features an entirely new architecture compared to previous versions of the connector. The main feature of the new connector architecture is the Greenplum Streaming Server (GPSS), which manages all communication between the Informatica PowerCenter node and the Pivotal Greenplum Database cluster. GPSS enables several new features, such as:

  • Improved Message and Data Transfer

    The Pivotal Greenplum-Informatica Connector uses an improved binary format, gRPC, to transfer ETL requests and data from the PowerCenter node to Greenplum Database. This removes the need to configure and maintain named pipes on the PowerCenter node, as with previous connector implementations.

  • Better Support for Multiple ETL Requests

    Because GPSS runs as a service in Greenplum Database, you can more easily initiate multiple, simultaneous ETL requests. The GPSS service provides equivalent functionality to gpfdist and gpload functionality as necessary in the Greenplum Database cluster to support multiple ETL jobs. Previous connector implementations required the actual gpload and gpfdist utilities on the local PowerCenter node, and running multiple ETL processes required starting multiple instances of those utilities for data transfer.

Known Issues and Limitations

Known issues and limitations related to the this experimental release of the Pivotal Greenplum-Informatica Connector include:

  • When performing an Update operation, GPSS displays error messages such as Please use following query to access the detailed error and /api.Gpss/Close SuccessCount:4 ErrorCount:6 <nil>. You can ignore these messages, as they only indicate that the update number is not equal to number of lines loaded.
  • The PreSQL, PostSQL, TruncateTable, and Timeout configuration properties are not yet supported. See Configuring Greenplum Target Session Properties.
  • The Timeout value is fixed at “0”, which means that the connector will wait indefinitely for a response from GPSS.
  • The Informatica Developer Tool is the only client tool that Pivotal has tested with Pivotal Greenplum-Informatica Connector. For example, compatibility with the PowerCenter Client Tools has not been tested.
  • The Informatica Developer Tools limits the precision value of Greenplum Database loader target columns to 65,536.
  • Server components for the connector have only been tested with the Linux version of Informatica Data Integration Service.
  • Informatica does not support removing the registration for a plugin, so there is currently no mechanism for removing a Pivotal Greenplum-Informatica Connector installation from Informatica.