Margaret Rouse is an award-winning technical writer and teacher known for her ability to explain complex technical subjects simply to a non-technical, business audience. Over…
Data verification is the process of checking data for accuracy after a data migration. There are different types of verification:
Data verification can be both expensive and time-consuming to carry out.
When data is migrated from a data warehouse for use in a big data processing system, the data needs to be checked to ensure that it is accurate. Everything from spelling errors to inaccurate numbers to data loss could jeopardize a big data project.
One method of verifying the data is comparing data in one system to the migrated data in the other one-to-one, but this can be time-consuming and the costs of running two systems can be expensive.
It is also possible to check just a subset of the data, but a sample cannot possibly represent all of the data. Administrators must weigh the tradeoff between keeping the time and expense of data verification down while ensuring accuracy. Automating the process is one solution.
Techopedia’s editorial policy is centered on delivering thoroughly researched, accurate, and unbiased content. We uphold strict sourcing standards, and each page undergoes diligent review by our team of top technology experts and seasoned editors. This process ensures the integrity, relevance, and value of our content for our readers.
Margaret is an award-winning technical writer and teacher known for her ability to explain complex technical subjects to a non-technical business audience. Over the past twenty years, her IT definitions have been published by Que in an encyclopedia of technology terms and cited in articles by the New York Times, Time Magazine, USA Today, ZDNet, PC Magazine, and Discovery Magazine. She joined Techopedia in 2011. Margaret's idea of a fun day is helping IT and business professionals learn to speak each other’s highly specialized languages.
What is Differential Privacy? Differential privacy is a mathematical framework for determining a quantifiable and adjustable level of privacy protection....
Margaret RouseTechnology Expert
What is cPanel Used For? cPanel is a crucial tool to help you access hosting features via a simple, non-technical...
Ilijia MiljkovacTechnology Writer
What is Operational Technology? Operational Technology, or OT, refers to the hardware and software systems that are used to control...
Marshall GunnellIT & Cybersecurity Expert
Trending NewsLatest GuidesReviewsTerm of the Day