Data Profiling

What Does Data Profiling Mean?

Data profiling is a technique used to examine data for different purposes like determining accuracy and completeness. This process examines a data source such as a database to uncover the erroneous areas in data organization. Deployment of this technique improves data quality.


Data profiling is also referred to as data discovery.

Techopedia Explains Data Profiling

Data profiling is the method of examining the data available in a data source and collecting statistics and information about that data. Such statistics help to identify the use and data quality of metadata. This method is widely used in enterprise data warehousing.

Data profiling clarifies the structure, relationship, content and derivation rules of data, which aid in the understanding of anomalies within metadata. Data profiling uses different kinds of descriptive statistics including mean, minimum, maximum, percentile, frequency and other aggregates such as count and sum. The additional metadata information obtained during profiling is data type, length, discrete values, uniqueness and abstract type recognition.


Related Terms

Margaret Rouse
Technology Expert

Margaret is an award-winning technical writer and teacher known for her ability to explain complex technical subjects to a non-technical business audience. Over the past twenty years, her IT definitions have been published by Que in an encyclopedia of technology terms and cited in articles by the New York Times, Time Magazine, USA Today, ZDNet, PC Magazine, and Discovery Magazine. She joined Techopedia in 2011. Margaret's idea of a fun day is helping IT and business professionals learn to speak each other’s highly specialized languages.