Alan Chin - Explore and Extend AI Pipeline Runtimes with Elyra and JupyterLab | JupyterCon 2020

Published: 02 November 2020
on channel: JupyterCon
196
1

Brief Summary
Using Jupyter to build and train models is only part of the process in creating a data workflow. Managing environments, artifact handling and system resources are a few of the concepts in workflow creation. Elyra’s pipeline extension abstracts patterns in workflow development to provide a friendly interface while integrating with workflow orchestrators like Kubeflow Pipelines and Apache Airflow.

Outline
This presentation will detail how Elyra creates notebook based data pipelines with Jupyterlab, Papermill and Kubeflow Pipelines, all without having to leave your web browser. Pipeline construction typically involves an infrastructure team tasked with deploying and keeping the pipeline operational. These tasks can vary in granularity and include environmental setup (dependencies, learning frameworks, container images), artifact handling (datasets, file ingestion, intermediate files and archiving) and the assembly of the these parts into a pipeline. As the number of variations in the pipeline increase, so does the amount of work and time needed to set it up. The goal of using Elyra is to help alleviate this problem by surfacing concepts and patterns common in pipeline construction into a familiar interface and `self-serve’ model for Data Scientists and Engineers. We will demonstrate how Elyra can rapidly prototype data workflows without the need to know or write any pipeline code while still being able to take advantage of popular pipeline runtimes. We will look at how Elyra integrates with Kubeflow and Airflow, our experiences (good and bad) while developing this extension and our roadmap for the future. Attendees should have basic working knowledge of Jupyterlab and basic knowledge of Kubernetes Elyra - https://github.com/elyra-ai/elyra nteract Papermill - https://github.com/nteract/papermill Kubeflow Pipelines - https://github.com/kubeflow/pipelines

-----

JupyterCon brings together data scientists, business analysts, researchers, educators, developers, core Project contributors, and tool creators for in-depth training, insightful keynotes, networking, and practical talks exploring the Project Jupyter ecosystem.

https://jupytercon.com/

JupyterCon is possible thanks to the generous support of our sponsors, and the labor of many volunteer organizers.

https://jupytercon.com/sponsors/

https://jupytercon.com/about/#Organiz...


Watch video Alan Chin - Explore and Extend AI Pipeline Runtimes with Elyra and JupyterLab | JupyterCon 2020 online without registration, duration hours minute second in high quality. This video was added by user JupyterCon 02 November 2020, don't forget to share it with your friends and acquaintances, it has been viewed on our site 196 once and liked it 1 people.