Gauthier Vasseur and I reconvened and discussed Data and Data Lakes. These conversations are really stating to build on each other…and the more we talk about Data, Big Data and its impact on all aspect of our lives, the more I learn. I am also getting a much better understanding of how dynamic the sector is and its continued potential for massive growth. We are in the early stages of the influence of Big Data for sure.
This episode was super informative as usual and we covered some of the following topics.
What is a data lake and how does it differ from a data warehouse?
“A data lake is a storage repository that holds a vast amount of raw data in its native format, including structured, semi-structured, and unstructured data. The data structure and requirements are not defined until the data is needed.” (Borrowed that from here.)
I wanted to know more about the differences between structured, semi-structured and unstructured data…Gauthier obliged.
I had heard about Hadoop and wanted to know why it was relevant to data lakes and why and how it was originally developed.
Finally, we discussed how data lakes are used to address real world business questions and queries.
You should not miss Gauthier’s insights.
ItsCoolerPlus…all of your social media, a world of premium e-commerce AND four live streams of global television all in one place.
You have never seen anything like this before.
Click on the image below.