Abstract: The efficiency of a Spark-based ETL system relies significantly on the optimal partitioning of numerous Delta Tables, taking into account the query patterns of end-users and dataset ...