Apache Iceberg – A Table Format for Huge Analytic Datasets

Apache Iceberg is a new format for tracking very large scale tables that are designed for object stores like S3. This talk will include why Netflix needed to build Iceberg, the project’s high-level design, and will highlight the details that unblock better query performance.

Ryan Blue works on open source data projects at Netflix. He is one of the original creators of Apache Iceberg, and is a committer in the Apache Spark, Parquet, and Avro communities.

展开查看详情

1.

2.

3.

4.

5.

6.

7.

8.

9.

10.

11.

12.

13.

14.

15.

16.

17.

18.

19.

20.

21.

22.

23.

24.

25.

26.

27.

28.