🤗 Hugging Cast S2E3 - Deploying LLMs on Google Cloud

Published: 26 April 2024
on channel: HuggingFace
1,879
59

Hugging Cast is a live show about building AI with open source.

In this episode, Philipp, Alvaro and Jeff demo 3 new ways to deploy open models on Google Cloud:
1️⃣ with Hugging Face Inference Endpoints
2️⃣ within Google Cloud Model Garden on Vertex AI or GKE
3️⃣ using TGI for TPU in our new library optimum-tpu


Watch video 🤗 Hugging Cast S2E3 - Deploying LLMs on Google Cloud online without registration, duration hours minute second in high quality. This video was added by user HuggingFace 26 April 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 1,879 once and liked it 59 people.