Techopedia Explains Data LakeThe data lake architecture is a store-everything approach to big data. Data are not classified when they are stored in the repository, as the value of the data is not clear at the outset. As a result, data preparation is eliminated. A data lake is thus less structured compared to a conventional data warehouse. When the data are accessed, only then are they classified, organized or analyzed.
Hadoop, an open-source framework for processing and analyzing big data, can be used to sift through the data in the repository.
- Video: Mike Olson, CEO of Cloudera, on Hadoop and Future Research Possibilities
- 7 Things to Know About Hadoop
- How Hadoop Helps Solve the Big Data Problem
- 7 Things You Must Know About Big Data Before Adoption
- 6 Big Myths About Managing Big Data
- The Key to Quality Big Data Analytics: Understanding 'Different' - TechWise Episode 4 Transcript