Data Verification

Definition - What does Data Verification mean?

Data verification is the process of checking data for accuracy after a data migration. There are different types of verification:

  • Full verification, where all of the data is checked
  • Sampling verification, where a small sample of the data is checked

Data verification can be both expensive and time-consuming to carry out.

Techopedia explains Data Verification

When data is migrated from a data warehouse for use in a big data processing system, the data needs to be checked to ensure that it is accurate. Everything from spelling errors to inaccurate numbers to data loss could jeopardize a big data project.

One method of verifying the data is comparing data in one system to the migrated data in the other one-to-one, but this can be time-consuming and the costs of running two systems can be expensive.

It is also possible to check just a subset of the data, but a sample cannot possibly represent all of the data. Administrators must weigh the tradeoff between keeping the time and expense of data verification down while ensuring accuracy. Automating the process is one solution.

Share this:

Connect with us

Email Newsletter

Join thousands of others with our weekly newsletter

Resources
The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.