Post-Processing Deduplication (PPD)
Definition - What does Post-Processing Deduplication (PPD) mean?
Post-process deduplication (PPD) refers to a system where software processes filter redundant data from a data set after it has been transferred to a data storage location. This can also be called asynchronous deduplication, and is often used where managers consider it inefficient or unfeasible to remove redundant data before or during transfer.
Techopedia explains Post-Processing Deduplication (PPD)
While managers or administrators may find it easier to use a post-process deduplication method, there are drawbacks to this type of data optimization. One is the fact that the data storage destination will need to have enough space to fit the larger unfiltered data set. Assuming that data managers have ample storage and that parsing data in storage doesn’t pose technical difficulties, the post-process deduplication method can often be a desirable way to clean up a data set for future use after it has already been carefully tucked away in "cold storage."
Why Traditional Database Technology Fails to Scale
Join thousands of others with our weekly newsletter
The 4th Era of IT Infrastructure: Superconverged Systems:
Approaches and Benefits of Network Virtualization:
Free E-Book: Public Cloud Guide:
Free Tool: Virtual Health Monitor:
Free 30 Day Trial – Turbonomic: