DataStage connectors
A DataStage connector is a palette node that provides data connectivity and metadata integration for external data sources, such as relational databases, public cloud storage services, or messaging software.
Connectors for remote data sources
For connectors for remote data sources, you need to create a project connection asset for the associated DataStage connector before you can load data to or from it in DataStage. A connection contains the information necessary to connect to the data source. For instructions, see Adding connections to projects.
The "(optimized)" version of a connection, Db2 (optimized), Netezza (optimized), Oracle (optimized), and Salesforce.com (optimized), gives you increased performance and more features such as before and after SQL statements, and sparse lookup and rejects links. However, you cannot use the connection with other tools in Cloud Pak for Data as a Service. You can use the connections that are available to other tools (for example, Salesforce.com), if you already created the connection in Cloud Pak for Data as a Service, and you want to reuse it in DataStage.
After you create the connection asset, open DataStage, and add the associated connector to the canvas. Double-click the connector and select the connection from the Stage tab. Go to .
The connectors are listed on the DataStage palette so that you can build your flow and add the connection assets later.
DataStage supports these project connections:
IBM services
- Cloud Object Storage
- Data Virtualization Manager for z/OS
- Databases for PostgreSQL
- Db2
- Db2 (optimized)*
- Db2 Big SQL
- Db2 for i
- Db2 for z/OS
- Db2 Hosted
- Db2 on Cloud
- Db2 Warehouse
- Informix
- Netezza (optimized)*
- Netezza (PureData System for Analytics)
Third-party services
- Amazon RDS for PostgreSQL
- Amazon RedShift
- Amazon
S3
Note: In the Details card, select Use DataStage properties to access the DataStage-specific properties. The DataStage-specific properties provide more features and granular control of the flow execution, similar to DataStage "optimized" connectors. - Apache
Hive
Note: In the Details card, select Use DataStage properties to access the DataStage-specific properties. The DataStage-specific properties provide more features and granular control of the flow execution, similar to DataStage "optimized" connectors. - Apache Kafka*
- FTP (remote file system transfer)
- Google BigQuery
- Google Cloud Storage
- Greenplum
- HTTP
- Microsoft Azure Blob Storage
- Microsoft Azure Data Lake Store
- Microsoft Azure File Storage
- Microsoft SQL Server
- Oracle
- Oracle (optimized)*
- PostgreSQL
- Salesforce.com
- Salesforce.com (optimized)*
- SAP OData
- Snowflake
Note: In the Details card, select Use DataStage properties to access the DataStage-specific properties. The DataStage-specific properties provide more features and granular control of the flow execution, similar to DataStage "optimized" connectors. For example for Snowflake, DataStage properties have explicit options for Create and Append operations. - Teradata
Other types of connector nodes
These entries in the palette do not require that you create a connection asset in the project.