Introduction. It is the year 2020, no more time for large and expensive clusters. These days, a modern Data Lake, built in a Cloud environment, should use as much as possible Cloud Native, Serverless services, to get the full agility, elasticity, and efficiency provided by the Public Cloud Paradigm. In this…