PanamaTimes

Friday, Jul 26, 2024

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Students Beat ChatGPT At This Exam, Score 76%, Compared To Chatbot's 47%

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."
Researchers found students to have fared better at accounting exams than ChatGPT, OpenAI's chatbot product.

Despite this, they said that ChatGPT's performance was "impressive" and that it was a "game changer that will change the way everyone teaches and learns - for the better."

The researchers from Brigham Young University (BYU), US, and 186 other universities wanted to know how OpenAI's technology would fare on accounting exams. They have published their findings in the journal Issues in Accounting Education.

In the researchers' accounting exam, students scored an overall average of 76.7 per cent, compared to ChatGPT's score of 47.4 per cent.

While in 11.3 per cent of the questions, ChatGPT was found to score higher than the student average, doing particularly well on accounting information systems (AIS) and auditing, the AI bot was found to perform worse on tax, financial, and managerial assessments. Researchers think this could possibly be because ChatGPT struggled with the mathematical processes required for the latter type.

The AI bot, which uses machine learning to generate natural language text, was further found to do better on true/false questions (68.7 per cent correct) and multiple-choice questions (59.5 per cent), but struggled with short-answer questions (between 28.7 and 39.1 per cent).

In general, the researchers said that higher-order questions were harder for ChatGPT to answer. In fact, sometimes ChatGPT was found to provide authoritative written descriptions for incorrect answers, or answer the same question different ways.

They also found that ChatGPT often provided explanations for its answers, even if they were incorrect. Other times, it went on to select the wrong multiple-choice answer, despite providing accurate descriptions.

Researchers importantly noted that ChatGPT sometimes made up facts. For example, when providing a reference, it generated a real-looking reference that was completely fabricated. The work and sometimes the authors did not even exist.

The bot was seen to also make nonsensical mathematical errors such as adding two numbers in a subtraction problem, or dividing numbers incorrectly.

Wanting to add to the intense ongoing debate about how how models like ChatGPT should factor into education, lead study author David Wood, a BYU professor of accounting, decided to recruit as many professors as possible to see how the AI fared against actual university accounting students.

His co-author recruiting pitch on social media exploded: 327 co-authors from 186 educational institutions in 14 countries participated in the research, contributing 25,181 classroom accounting exam questions.

They also recruited undergraduate BYU students to feed another 2,268 textbook test bank questions to ChatGPT. The questions covered AIS, auditing, financial accounting, managerial accounting and tax, and varied in difficulty and type (true/false, multiple choice, short answer).
Newsletter

Related Articles

PanamaTimes
0:00
0:00
Close
Mexican Drug Lords El Mayo and El Chapo's Son Arrested in Texas
World's Hottest Day Recorded on July 21
Joe Biden Withdraws from 2024 US Presidential Race
A Week of Turmoil: Key Moments in US Politics
Global IT Outage Sparks Major Concerns
Global IT Outage Unveils Digital Vulnerabilities
Secret Service Criticized for Lack of Sniper Protection During Trump Shooting
Colombian Court Annuls Amazon Tribes’ Carbon Credit Deal
Sunita Williams Safe on ISS, to Address Earth on July 10
Biden Affirms Commitment To Presidential Race
Boeing Pleads Guilty Over 737 MAX Crashes
Beryl Storm Hits Texas, Killing 2 and Causing Major Power Outages
2024 Predicted to Be World's Hottest Year
Macron Faces New Political Challenges Despite Election Relief
Florida Man Arrested Over Attempt to Withdraw One Cent
Anger mounts at Biden’s top team after disastrous debate
Bolivian President Luis Arce Denies 'Self-Coup' Allegations
Steve Bannon Begins 4-Month Prison Sentence
Biden Warns of 'Dangerous Precedent' After Supreme Court Immunity Ruling in Trump Case
Elon Musk Accuses Kamala Harris of Misleading Post on Trump's Abortion Stance
Hunter Biden Sues Fox News Over 'Revenge Porn' Allegations
New York Times Editorial Board Urges Biden to Exit Presidential Race
US Supreme Court Overturns Obstruction Charges Against January 6 Rioters
US Voters Prefer Biden's Democracy Approach, Trump's Economy Plan: Report
Attempted Coup in Bolivia: President Urges Public Mobilization
Top-Secret US Underwater Drone 'Manta Ray' Revealed on Google Maps
United States Bans Kaspersky Antivirus
Inside El Salvador’s 40,000 Inmate Mega-Prison
Toyota, Mazda, Honda, and Suzuki have committed fraud; falsified safety test results
El Salvador's Bitcoin Holdings Reach $350 Million
Teens Forming Friendships with AI Chatbots
WhatsApp Rolls Out Major Redesign
Neuralink's First Brain Implant Experiences Issue
Apple Unveils New iPad Pro with M4 Chip, Misleading AI Claims
OpenAI to Announce Google Search Competitor
Apple Apologizes for Controversial iPad Pro Ad Featuring Instrument Destruction
German politician of the AFD party, Marie-Thérèse Kaiser was just convicted & fined $6,000+
Changpeng Zhao Sentenced to Four Months in Jail
Biden Administration to Relax Marijuana Regulations
101-Year-Old Woman Mistaken for a Baby by American Airlines: Comical Mix-Up during Flight Check-in
King Charles and Camilla enjoying the Inuit voice singing performance in Canada.
New Study: Vaping May Lower Fertility in Women Trying to Get Pregnant
U.S. DOJ Seeks Three-Year Sentence for Binance Founder Changpeng Zhao
Headlines - Thursday, 23 April 2024
Illinois Woman Wins $45M Lawsuit Against Johnson & Johnson and Kenvue for Mesothelioma Linked to Baby Powder
Panama's lates news for Friday, April 19
Creative menu of a Pizza restaurant..
You can be a very successful player, but a player with character is another level!
Experience the Future of Dining: My Visit to an AI-Powered Burger Joint
Stabbing rampage terror attack in Sydney, at least four people killed, early reports that a baby was among those stabbed.
×