In this Lecture Lab - GSP807 will teach how to use the Pipeline Studio in Cloud Data Fusion to build an ETL pipeline. Pipeline Studio exposes the building blocks and built-in plugins for you to build your batch pipeline, one node at a time. You will also use the Wrangler plugin to build and apply transformations to your data that goes through the pipeline.
The most common source of data for ETL applications is typically data stored in comma separated value (CSV) format text files, as many database systems export and import data in this fashion. For the purposes of this lab you’ll be using a CSV file, but the same techniques can be applied to DB sources as well as any other data source that you have available.
The output will be written to a BigQuery table and you’ll use standard SQL to perform data analytics against this target data set.
The tasks performed will be
1: Create a batch pipeline with Pipeline Studio in Cloud Data Fusion
2: Use Wrangler to interactively transform data
3: Write output into BigQuery
Please subscribe to our channel - and help us create more free content.
Смотрите видео ETL Pipeline for Beginners - No Code Lab | Google Data Fusion онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь SkillCurb 23 Июль 2022, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 3,807 раз и оно понравилось 58 людям.