Guillaume Eynard-Bontemps, Emmanuelle Sarrazin for ISAE-Supaero
Introduction to Big Data and its Ecosystem
Big Data Platforms, Hadoop and beyond
Spark Introduction and exercise
Play with MapReduce through Spark
Introduction to Cloud Computing
Includes first interaction with Google Cloud.
Includes Docker exercises.
Containers Orchestration, Kubernetes
Includes Kubernetes exercices.
Object Storage and Cloud Optimized datasets
Deploy Data processing platform on the Cloud
The rise of the Python ecosystem for Data Processing
Includes Pandas library tutorial, Xarray library tutorial
Includes Parallel tutorials
Includes Large dataset tutorials
Includes Dask tutorial.
If necessary, finish data processing platform deployment