DataStage connectors

A DataStage connector is a palette node that provides data connectivity and metadata integration for external data sources, such as relational databases, public cloud storage services, or messaging software.

Connectors for remote data sources

For connectors for remote data sources, you need to create a project connection asset for the associated DataStage connector before you can load data to or from it in DataStage. A connection contains the information necessary to connect to the data source. For instructions, see Adding connections to projects.

The "(optimized)" version of a connection, Db2 (optimized), Netezza (optimized), Oracle (optimized), and Salesforce.com (optimized), gives you increased performance and more features such as before and after SQL statements, and sparse lookup and rejects links. However, you cannot use the connection with other tools in Cloud Pak for Data as a Service. You can use the connections that are available to other tools (for example, Salesforce.com), if you already created the connection in Cloud Pak for Data as a Service, and you want to reuse it in DataStage.

After you create the connection asset, open DataStage, and add the associated connector to the canvas. Double-click the connector and select the connection from the Stage tab. Go to Properties > Connection.

The connectors are listed on the DataStage palette so that you can build your flow and add the connection assets later.

DataStage supports these project connections:

IBM services

Third-party services

* Denotes a project connection that is for DataStage only.

Other types of connector nodes

These entries in the palette do not require that you create a connection asset in the project.