PanamaTimes

Wednesday, Jul 09, 2025

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

PanamaTimes
0:00
0:00
Close
U.S. Enacts Sweeping Tax and Spending Legislation Amid Trade Policy Shifts
AI Raises Alarms Over Long-Term Job Security
House Oversight Committee Subpoenas Former Jill Biden Aide Amid Investigation into Alleged Concealment of President Biden's Cognitive Health
OpenAI Secures Multimillion-Dollar AI Contracts with Pentagon, India, and Grab
Brazilian Congress Rejects Lula's Proposed Tax Increase on Financial Transactions
Landslide in Bello, Colombia, Results in Multiple Casualties
Papa Johns pizza surge near the Pentagon tipped off social media before Trump's decisive Iran strike
Juncker Criticizes EU Inaction on Trump Tariffs
Minnesota Lawmaker Melissa Hortman and Husband Killed in Targeted Attack; Senator John Hoffman and Wife Injured
Wreck of $17 Billion San José Galleon Identified Off Colombia After 300 Years
Sole Survivor of Air India Crash Recounts Escape
Coinbase CEO Warns Bitcoin Could Supplant US Dollar Amid Mounting National Debt
UK and EU Reach Agreement on Gibraltar's Schengen Integration
Israeli Finance Minister Imposes Banking Penalties on Palestinians
U.S. Inflation Rises to 2.4% in May Amid Trade Tensions
Trump's Policies Prompt Decline in Chinese Student Enrollment in U.S.
Global Oceans Near Record Temperatures as CO₂ Levels Climb
Trump Announces U.S.-China Trade Deal Covering Rare Earths
Smuggled U.S. Fuel Funds Mexican Cartels Amid Crackdown
Protests Erupt in Los Angeles with Symbolic Flag Burning
Trump Administration Issues New Travel Ban Targeting 12 Countries
Man Group Mandates Full-Time Office Return for Quantitative Analysts
JPMorgan Warns Analysts Against Accepting Future-Dated Job Offers
Builder.ai Faces Legal Scrutiny Amid Financial Misreporting Allegations
Japan Grapples with Rice Shortage Amid Soaring Prices
Goldman Sachs Reduces Risk Exposure Amid Market Volatility
HSBC Chairman Mark Tucker to Return to AIA as Non-Executive Chair
Israel Confirms Arming Gaza Clan to Counter Hamas Influence
Judge Blocks Trump's Ban on International Students at Harvard
Trump Proposes Travel Ban on 'Uncontrolled' Countries
Panama Port Owner Balances US-China Pressures
Trump Administration Accused of Obstructing Deportation Cases
Trump’s China Strategy Remains a Geopolitical Puzzle
Eurozone Inflation Falls Below ECB Target to 1.9%
Call for a New Chapter in Globalisation Emerges
Blackstone and Rivals Diverge on Private Equity Strategy
Mayor’s Security Officer Implicated | Shocking New Details Emerge in NYC Kidnapping Case
Bangkok Ranked World's Top City for Remote Work in 2025
Denmark Increases Retirement Age to 70, Setting a European Precedent
Netanyahu Accuses Western Leaders of 'Emboldening Hamas'
Escalating Trade Tensions and Market Reactions
OnlyFans Reportedly in Talks for $8 Billion Sale
JBS Gains Shareholder Approval for U.S. Stock Listing
Booz Allen Hamilton to Cut 2,500 Jobs Amid Federal Spending Reductions
Trump Signs Executive Orders to Accelerate Nuclear Energy Development
Harvard Temporarily Blocks Trump Administration's International Student Ban
Nippon Steel Forms Partnership with U.S. Steel, Headquarters to Remain in Pittsburgh
Trump Expands Tariff Threats to Apple and Samsung Devices
Oracle and OpenAI Plan $40 Billion Nvidia Chip Purchase for AI Data Center
Trump Threatens 50% Tariff on EU Goods, Markets React
×