ResNet50 ViT - Vision Transformer with ResNet50 Implementation in TensorFlow

Published: 01 January 1970
on channel: Idiot Developer
7,693
161

In this video, we are going to build a Hybrid Vision Transformer, where we combine the ResNet50 and the Vision Transformer to build the ResNet50 Vision Transformer (ResNet50 ViT). The ResNet50 ViT is implemented using the TensorFlow framework in the Keras API.

Timeline:
00:00 - Introduction
00:28 - What is Vision Transformer?
01:24 - ResNet50 + Vision Transformer explained.
03:08 - Import required libraries.
03:55 - ViT Base model configuration.
06:24 - Input layer.
08:25 - Loading pre-trained ResNet50.
10:12 - Patch embeddings.
12:41 - Position embeddings.
15:25 - Adding patch embeddings and position embeddings.
16:27 - Adding Class Token.
22:49 - Implementing Transformer Encoder.
25:44 - Implementing MLP (Multilayer Perceptron)
26:53 - Adding Transformer Encoder and MLP to the Vision Transformer
28:03 - Adding classification head.
30:10 - Executing the ResNet50 Vision Transformer (ResNet50 ViT)
31:15 - Ending -- SUBSCRIBE

Code: https://github.com/nikhilroxtomar/Vis...

Support:
   / @idiotdeveloper  
https://www.buymeacoffee.com/nikhilro...

Follow Me:
BLOG: https://idiotdeveloper.com https://sciencetonight.com
TELEGRAM: https://t.me/idiotdeveloper
FACEBOOK:   / idiotdeveloper  
TWITTER:   / nikhilroxtomar  
INSTAGRAM: https://instagram/nikhilroxtomar
PATREON:   / idiotdeveloper  


Watch video ResNet50 ViT - Vision Transformer with ResNet50 Implementation in TensorFlow online without registration, duration hours minute second in high quality. This video was added by user Idiot Developer 01 January 1970, don't forget to share it with your friends and acquaintances, it has been viewed on our site 7,693 once and liked it 161 people.