In this video we talk about how we can build LLMs whose weights can be represented by 1.58 bits and what are the advantages of doing so, by analyzing the paper "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits".
References
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
“The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits” paper: https://arxiv.org/abs/2402.17764
Related Videos
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Why Language Models Hallucinate: • Why LLMs Hallucinate
Transformer Self-Attention Mechanism Explained: • Transformer Self-Attention Mechanism ...
Jailbroken: How Does LLM Safety Training Fail? - Paper Explained: • Jailbroken: How Does LLM Safety Train...
How to Fine-tune Large Language Models Like ChatGPT with Low-Rank Adaptation (LoRA): • Low-Rank Adaptation (LoRA) Explained
Multi-Head Attention (MHA), Multi-Query Attention (MQA), Grouped Query Attention (GQA) Explained: • Multi-Head Attention (MHA), Multi-Que...
LLM Prompt Engineering with Random Sampling: Temperature, Top-k, Top-p: • LLM Prompt Engineering with Random Sa...
Contents
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
00:00 - Abstract & Intro
02:43 - Figure 1 - Pareto solution to reduce inference cost
04:54 - BitNet 1.58 Explained
09:31 - Results
13:12 - Conclusion & Future Work
Follow Me
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
🐦 Twitter: @datamlistic / datamlistic
📸 Instagram: @datamlistic / datamlistic
📱 TikTok: @datamlistic / datamlistic
Channel Support
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
The best way to support the channel is to share the content. ;)
If you'd like to also support the channel financially, donating the price of a coffee is always warmly welcomed! (completely optional and voluntary)
► Patreon: / datamlistic
► Bitcoin (BTC): 3C6Pkzyb5CjAUYrJxmpCaaNPVRgRVxxyTq
► Ethereum (ETH): 0x9Ac4eB94386C3e02b96599C05B7a8C71773c9281
► Cardano (ADA): addr1v95rfxlslfzkvd8sr3exkh7st4qmgj4ywf5zcaxgqgdyunsj5juw5
► Tether (USDT): 0xeC261d9b2EE4B6997a6a424067af165BAA4afE1a
#llm #quantization #1bitllm
Смотрите видео The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits - Paper Explained онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь DataMListic 13 Март 2024, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 2,509 раз и оно понравилось 72 людям.