ALERT

[LAST CHANCE] Data Layer: Modern Business, Defined

Post-Processing Deduplication (PPD)

Definition - What does Post-Processing Deduplication (PPD) mean?

Post-process deduplication (PPD) refers to a system where software processes filter redundant data from a data set after it has been transferred to a data storage location. This can also be called asynchronous deduplication, and is often used where managers consider it inefficient or unfeasible to remove redundant data before or during transfer.

Techopedia explains Post-Processing Deduplication (PPD)

Post-process deduplication can be contrasted to a practice called in-line deduplication where the redundant data is taken out as the data is transferred for storage. One of the reasons that administrators may choose a post-process deduplication approach is when inline deduplication can slow down the transfer process and make it more difficult to easily and efficiently archive data.

While managers or administrators may find it easier to use a post-process deduplication method, there are drawbacks to this type of data optimization. One is the fact that the data storage destination will need to have enough space to fit the larger unfiltered data set. Assuming that data managers have ample storage and that parsing data in storage doesn’t pose technical difficulties, the post-process deduplication method can often be a desirable way to clean up a data set for future use after it has already been carefully tucked away in "cold storage."

Techopedia Deals

Connect with us

Techopedia on Linkedin
Techopedia on Linkedin
Tweat cdn.techopedia.com
"Techopedia" on Twitter


'@Techopedia'
Sign up for Techopedia's Free Newsletter!

Email Newsletter

Join thousands of others with our weekly newsletter

Resources
The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.