Large language models are rubbish at elementary level math

nodetechno7 juillet 18, 2024

“9.11 and 9.9, which one is bigger?” Questions as simple as this confuse large language models including OpenAI’s GPT-4o, Moonshot-created Kimi, and ByteDance’s Doubao, according to a post by local media Yicai. Chatbots from China’s Baidu and Tencent generate the correct answer despite using different methods, the former comparing fractional parts after concluding the integer parts are the same and the latter, Tencent’s Hunyuan, concluding that 9.9 is the bigger number by computing that 9.11 minus 9.9 is negative. ChatGPT and Kimi, which both gave a wrong answer to the first prompt, were correct after users clarified: “in terms of numerical value.” AI-powered chatbots are fed by internet data and trained to chat with humans in a natural way so that they can perform text-based knowledge-based tasks. [Yicai, in Chinese]

TechNode

Techno Node

Ticker

Large language models are rubbish at elementary level math

Enregistrer un commentaire

0 Commentaires

Subscribe Us

Popular Posts

Xiaomi updates progress on humanoid robots in auto factory, achieves 98% success rate in some tasks

SHEIN receives CSRC filing notice for Hong Kong listing

ByteDance denies plans to enter smart driving business

miHoYo launches AI companion app BSide: Olivia Lin on Steam Early Access

ByteDance introduces a new music app with feature that pays users to listen

ByteDance explores autonomous driving for unmanned logistics

Xiaomi opens reservations for SkyNomad SUV series

Seres Group expects H1 2026 net loss of $220 million–$270 million

Google reportedly beats Apple to TSMC’s 2nm chip debut with Tensor G6

Apple CEO Tim Cook and successor John Ternus meet POP MART founder Wang Ning at Apple Park

Random Posts

Recent in Sports

Popular Posts

Xiaomi updates progress on humanoid robots in auto factory, achieves 98% success rate in some tasks

SHEIN receives CSRC filing notice for Hong Kong listing

ByteDance denies plans to enter smart driving business

Footer Menu Widget

Ticker

Ad Code

Large language models are rubbish at elementary level math

Ces posts pourraient vous intéresser

Enregistrer un commentaire

0 Commentaires

Social Plugin

Subscribe Us

Popular Posts

Random Posts

Recent in Sports

Popular Posts

Footer Menu Widget