Llama 3 degrades much more than Llama 2 when quantized 🤔 | New LLM Paper Finds out

Опубликовано: 12 Май 2024
на канале: Rohan-Paul-AI
512
20

Paper : https://arxiv.org/abs/2404.14047

🐦 Connect with me in TWITTER:   / rohanpaul_ai  

Llama 3 degrades much more than Llama 2 when quantized. 🤔

📌 Most possible reason because Llama 3, trained on a record 15T tokens, captures extremely nuanced data relationships, utilizing even the minutest decimals in BF16 precision fully. Making it more sensitive to quantization degradation.

📌 So sensitive that even the smallest decimal points of each parameter offered by BF16 precision were filled and had a purpose. Other LLMs were trained for far less (2T), and thus did not have time to saturate smaller precision ranges of the parameters like Llama-3 did, and thus are not affected by quantization as much.

----

Checkout the MASSIVELY UPGRADED 2nd Edition of my Book (with 1300+ pages of Dense Python Knowledge) 🐍🔥

Covering 350+ Python 🐍 Core concepts ( 1300+ pages ) 🚀

🟠 Book Link - https://rohanpaul.gumroad.com/l/pytho...

-----------------

Hi, I am a Machine Learning Engineer | Kaggle Master. Connect with me on 🐦 TWITTER:   / rohanpaul_ai   - for daily in-depth coverage of Large Language Model bits

----------------

You can find me here:

**********************************************

🐦 TWITTER:   / rohanpaul_ai  
👨🏻‍💼 LINKEDIN:   / rohan-paul-ai  
👨‍🔧 Kaggle: https://www.kaggle.com/paulrohan2020
👨‍💻 GITHUB: https://github.com/rohan-paul
🧑‍🦰 Facebook :   / rohan.paul.562  
📸 Instagram:   / rohan_paul_2020  


**********************************************


Other Playlist you might like 👇

🟠 MachineLearning & DeepLearning Concepts & interview Question Playlist - https://bit.ly/380eYDj

🟠 ComputerVision / DeepLearning Algorithms Implementation Playlist - https://bit.ly/36jEvpI

🟠 DataScience | MachineLearning Projects Implementation Playlist - https://bit.ly/39MEigt

🟠 Natural Language Processing Playlist : https://bit.ly/3P6r2CL

----------------------

#LLM #Largelanguagemodels #Llama3 #LLMfinetuning #opensource #NLP #ArtificialIntelligence #datascience #textprocessing #deeplearning #deeplearningai #100daysofmlcode #neuralnetworks #datascience #generativeai #generativemodels #OpenAI #GPT #GPT3 #GPT4 #chatgpt #genai


Смотрите видео Llama 3 degrades much more than Llama 2 when quantized 🤔 | New LLM Paper Finds out онлайн без регистрации, длительностью часов минут секунд в хорошем качестве. Это видео добавил пользователь Rohan-Paul-AI 12 Май 2024, не забудьте поделиться им ссылкой с друзьями и знакомыми, на нашем сайте его посмотрели 512 раз и оно понравилось 20 людям.