Fugaku-LLM: Japan's Supercomputer Trains Revolutionary Language Model for AI in Science and Social S

Published: 14 May 2024
on channel: AI Insight News
66
0

SUBSCRIBE CHANNEL: https://bit.ly/AIInsightNews
-----------------
The Fugaku-LLM, a large language model developed in Japan using the Fugaku supercomputer, has 13 billion parameters and outperforms other models in tasks related to humanities and social sciences. It was trained on a combination of Japanese, English, mathematics, and code data. The model's results are available on GitHub and Hugging Face for further development, with potential applications in AI for science and social simulations. The project was a collaboration between multiple institutions. Comments on the post discuss the use of CPUs instead of GPUs for training, comparisons to other models like GPT-4, the efficiency of the training process, and the challenges of decentralized architecture for training models.

🔗 https://www.fujitsu.com/global/about/...

#AI #Language Model #GPT #GPT4 #LLM #OpenAI


Watch video Fugaku-LLM: Japan's Supercomputer Trains Revolutionary Language Model for AI in Science and Social S online without registration, duration hours minute second in high quality. This video was added by user AI Insight News 14 May 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 6 once and liked it people.