OpenAI o1 VS Sonnet 3.5 in Coding Physics Games - AI Showdown

Published: 13 September 2024
on channel: Eduards Ruzga
21,036
562

In this video, we put two AI models to the test: OpenAI's new o1 and Sonet 3.5 from Antropic using Claude and WebSync. We challenge them to create a car parking simulator with realistic physics. Watch as we explore the capabilities of o1's advanced reasoning and see how it stacks up against Sonnet 3.5

Links:
Failed Claude artifact:
https://claude.site/artifacts/3949a0f...

Failed WebSim:
https://websim.ai/c/euVVHmpCGkFyNxczp

ChatGPT o1 result:
https://codepen.io/wonderwhy-er/pen/q...
And here is the chat
https://chatgpt.com/share/66e34a3d-13...


WebSim reusing and improving o1 result:
https://websim.ai/@wonderwhy_er/gta-2...

ChatGPT o1 failed 3d variant:
https://codepen.io/wonderwhy-er/pen/z...
Here is the chat for the 3d game
https://chatgpt.com/share/66e42ded-c5...

Chapters:
0:00 - Introduction: How to judge o1 quality? By comparison with Sonnet 3.5!
0:32 - What is OpenAI o1?
01:54 - What will the challenge be? Parking simulator Sonnet 3.5 failed to build
02:57 - First Claude with Sonnet 3.5 and Artifacts, gray screen
04:11 - WebSim turn, crazy physics, and why it's hard
06:16 - What did ChatGPT with o1 do? Worked from the first try!
09:44 - But what if WebSim and o1 collaborate? Magic happens!
12:00 - Is the sky the limit? Far from it, let's push o1 to the breaking point
19:18 - Conclusions and what's next

#openai #openo1 #claudeai #websim #chatgpt #aicomparison #aiprogramming #artificialintelligence


Watch video OpenAI o1 VS Sonnet 3.5 in Coding Physics Games - AI Showdown online without registration, duration hours minute second in high quality. This video was added by user Eduards Ruzga 13 September 2024, don't forget to share it with your friends and acquaintances, it has been viewed on our site 21,03 once and liked it 56 people.