[WEBINAR] Application Acceleration: Faster Performance for End Users


Definition - What does Extraction mean?

Extraction is the process of deriving relevant information from data sources in a specific pattern for use in a data warehousing environment. Extraction adds meaning to the data and is the first step of the data transformation process. Extraction picks out only certain data that fit a condition or category from a huge collection of data coming from various sources.

Techopedia explains Extraction

In a data warehousing environment, a huge collection of data coming from various structures and unstructured sources must be processed, transformed and stored to derive meaningful conclusions and predictions. The data coming from the primary sources must be imported into the data warehousing system in a systematic manner that makes it easy to perform the various operations on data. This process is called extraction. Extraction adds structure to otherwise unstructured data by following certain rules. The following are some of the techniques used in data extraction:

  • Pattern matching
  • Table-based approach
  • Text analytics

Techopedia Deals

Connect with us

Techopedia on Linkedin
Techopedia on Linkedin
"Techopedia" on Twitter

Sign up for Techopedia's Free Newsletter!

Email Newsletter

Join thousands of others with our weekly newsletter

Free Whitepaper: The Path to Hybrid Cloud
Free Whitepaper: The Path to Hybrid Cloud:
The Path to Hybrid Cloud: Intelligent Bursting To Amazon Web Services & Microsoft Azure
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.