ALERT

[LAST CHANCE] Data Layer: Modern Business, Defined

Extraction

Definition - What does Extraction mean?

Extraction is the process of deriving relevant information from data sources in a specific pattern for use in a data warehousing environment. Extraction adds meaning to the data and is the first step of the data transformation process. Extraction picks out only certain data that fit a condition or category from a huge collection of data coming from various sources.

Techopedia explains Extraction

In a data warehousing environment, a huge collection of data coming from various structures and unstructured sources must be processed, transformed and stored to derive meaningful conclusions and predictions. The data coming from the primary sources must be imported into the data warehousing system in a systematic manner that makes it easy to perform the various operations on data. This process is called extraction. Extraction adds structure to otherwise unstructured data by following certain rules. The following are some of the techniques used in data extraction:

  • Pattern matching
  • Table-based approach
  • Text analytics

Techopedia Deals

Connect with us

Techopedia on Linkedin
Techopedia on Linkedin
Tweat cdn.techopedia.com
"Techopedia" on Twitter


'@Techopedia'
Sign up for Techopedia's Free Newsletter!

Email Newsletter

Join thousands of others with our weekly newsletter

Resources
The 4th Era of IT Infrastructure: Superconverged Systems
The 4th Era of IT Infrastructure: Superconverged Systems:
Learn the benefits and limitations of the 3 generations of IT infrastructure – siloed, converged and hyperconverged – and discover how the 4th...
Approaches and Benefits of Network Virtualization
Approaches and Benefits of Network Virtualization:
Businesses today aspire to achieve a software-defined datacenter (SDDC) to enhance business agility and reduce operational complexity. However, the...
Free E-Book: Public Cloud Guide
Free E-Book: Public Cloud Guide:
This white paper is for leaders of Operations, Engineering, or Infrastructure teams who are creating or executing an IT roadmap.
Free Tool: Virtual Health Monitor
Free Tool: Virtual Health Monitor:
Virtual Health Monitor is a free virtualization monitoring and reporting tool for VMware, Hyper-V, RHEV, and XenServer environments.
Free 30 Day Trial – Turbonomic
Free 30 Day Trial – Turbonomic:
Turbonomic delivers an autonomic platform where virtual and cloud environments self-manage in real-time to assure application performance.