展开查看详情
1.Introduction to Modern Data Science Sam Kreter
2.Code and Slides github.com/samkreter/KubeconAsia2018
3. The Process Business Need / Problem Discovery
4. The Process Business Need / Problem Discovery Development
5. The Process Business Need / Problem Discovery Development Production / Actual User Impact
6. The Process Business Need / Problem Discovery Development Production / Actual User Impact
7.
8. The Process Business Need / Problem Discovery Development Production / Actual User Impact
9.The Data Science Pipeline
10.
11.
12.
13.
14.Pipelines
15.The Data Science Pipeline
16.Principles
17.1. Autonomy
18.I HAVE A VERY PARTICULAR SET OF SKILLS
19.1. Autonomy
20.
21.
22.Autonomy
23.Containerization 1. Single Operations per Container
24.Containerization 1. Single Operations per Container 2. Use Parameterize Data Flow • Data Inputs • Data Outputs
25.Distributing Workloads
26.2. Reproducibility
27.Data Versioning
28.Reproducibility For Developers
29.Reproducibility For Developers For the Team