ALERT

[FREE DEMO] Deploy Your Enterprise Cloud in Minutes

DataStage Parallel Extender (DataStage PX)

Definition - What does DataStage Parallel Extender (DataStage PX) mean?

DataStage Parallel Extender (DataStage PX) is an IBM data integration tool. It is one among the many widely used extraction, transformation and loading (ETL) tools in the data warehousing industry. This tool can collect information from heterogeneous sources, perform transformations as per a business's needs and load the data into respective data warehouses.

DataStage PX may also be called DataStage Enterprise Edition.

Techopedia explains DataStage Parallel Extender (DataStage PX)

DataStage Parallel Extender has a parallel architecture to process data. The two main types of parallelism implemented in DataStage PX are pipeline and partition parallelism. The ability to process data in a parallel fashion speeds up data processing to a large extent.

DataStage Parallel Extender incorporates a variety of stages through which source data is processed and reinforced into target databases. These are defined in terms of terabytes. Besides stages, DataStage PX uses containers to reuse the job components and sequences to run and schedule multiple jobs at the same time.

The commonly used stages in DataStage Parallel Extender include:

  • Transformer
  • Aggregator
  • Data set
  • Copy
  • Change apply
  • Modify
  • Filter
  • Join
  • Merge
  • Look up

Techopedia Deals

Connect with us

Techopedia on Linkedin
Techopedia on Linkedin
Tweat cdn.techopedia.com
"Techopedia" on Twitter


'@Techopedia'
Sign up for Techopedia's Free Newsletter!

Email Newsletter

Join thousands of others with our weekly newsletter

Resources
The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.