Arab Press

بالشعب و للشعب
Saturday, May 31, 2025

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

Arab Press
0:00
0:00
Close
Meta and Anduril Collaborate on AI-Driven Military Augmented Reality Systems
EU Central Bank Pushes to Replace US Dollar with Euro as World’s Main Currency
European and Arab Ministers Convene in Madrid to Address Gaza Conflict
Head of Gaza Aid Group Resigns Amid Humanitarian Concerns
U.S. Health Secretary Ends Select COVID-19 Vaccine Recommendations
Trump Warns Putin Is 'Playing with Fire' Amid Escalating Ukraine Conflict
India and Pakistan Engage Trump-Linked Lobbyists to Influence U.S. Policy
U.S. Halts New Student Visa Interviews Amid Enhanced Security Measures
Trump Administration Cancels $100 Million in Federal Contracts with Harvard
SpaceX Starship Test Flight Ends in Failure, Mars Mission Timeline Uncertain
King Charles Affirms Canadian Sovereignty Amid U.S. Statehood Pressure
Iranian Revolutionary Guard Founder Warns Against Trusting Regime in Nuclear Talks
Netanyahu Accuses Starmer of Siding with Hamas
Calls Grow to Resume Syrian Asylum Claims in UK
UAE Offers Free ChatGPT Plus Subscriptions to Citizens
Denmark Increases Retirement Age to 70, Setting a European Precedent
Iranian Director Jafar Panahi Wins Palme d'Or at Cannes
Israeli Airstrike Kills Nine Children of Gaza Doctor
Lebanon Initiates Plan to Disarm Palestinian Factions
Iran and U.S. Make Limited Progress in Nuclear Talks
Trump Administration's Tariff Policies and Dollar Strategy Spark Global Economic Debate
OpenAI Acquires Jony Ive’s Startup for $6.5 Billion to Build a Revolutionary “Third Core Device”
Turkey Weighs Citizens in Public as Erdoğan Launches National Slimming Campaign
UK Suspends Trade Talks with Israel Amid Gaza Offensive
Iran and U.S. Set for Fifth Round of Nuclear Talks Amid Rising Tensions
Russia Expands Military Presence Near Finland Amid Rising Tensions
Indian Scholar Arrested in Crackdown Over Pakistan Conflict Commentary
Israel Eases Gaza Blockade Amid Internal Dispute Over Military Strategy
President Biden’s announcement of advanced prostate cancer sparked public sympathy—but behind closed doors, Democrats are in panic
Mount Lewotobi Laki-Laki Erupts Again, Spewing Ash Cloud over Flores Island
Indian jet shootdown: the all-robot legion behind China’s PL-15E missiles
The Chinese Dragon: The True Winner in the India-Pakistan Clash
Australia's Venomous Creatures Contribute to Life-Saving Antivenom Programme
The Spanish Were Right: Long Working Hours Harm Brain Function
Did Former FBI Director Call for Violence Against Trump? Instagram Post Sparks Uproar
US and UAE Partner to Develop Massive AI Data Center Complex
Apple's $95 Million Siri Settlement: Eligible Users Have Until July 2 to File Claims
US and UAE Reach Preliminary Agreement on Nvidia AI Chip Imports
President Trump and Elon Musk Welcomed by Emir of Qatar Sheikh Tamim with Cybertruck Convoy
Strong Warning Issued: Do Not Use General Chatbots for Medical, Legal, or Educational Guidance
NVIDIA and Saudi Arabia Launch Strategic Partnership to Establish AI Centers
Trump Meets Syrian President Ahmad al-Shara in Historic Encounter
US and Saudi Arabia Sign Landmark Agreements Across Multiple Sectors
Why Saudi Arabia Rolled Out a Purple Carpet for Donald Trump Instead of Red
Elon Musk Joins Trump Meeting in Saudi Arabia
Trump says it would be 'stupid' not to accept gift of Qatari plane
Quantum Computing Threatens Bitcoin Security
Michael Jordan to Serve as Analyst for NBA Games
Senate Democrats Move to Censure Trump Over Qatar Jet Gift
Hamas Releases Last Living US Hostage from Gaza Amid Ongoing Conflict
×