uniladtech homepage
  • News
    • Tech News
    • AI
  • Gadgets
    • Apple
    • iPhone
  • Gaming
    • Playstation
    • Xbox
  • Science
    • News
    • Space
  • Streaming
    • Netflix
  • Vehicles
    • Car News
  • Social Media
    • WhatsApp
    • YouTube
  • Advertise
  • Terms
  • Privacy & Cookies
  • LADbible Group
  • LADbible
  • UNILAD
  • SPORTbible
  • GAMINGbible
  • Tyla
  • FOODbible
  • License Our Content
  • About Us & Contact
  • Jobs
  • Latest
  • Archive
  • Topics A-Z
  • Authors
Facebook
Instagram
X
TikTok
Snapchat
WhatsApp
Submit Your Content
AI researchers create 'humanity's last exam' to probe true limits of machine intelligence

Home> News> AI

Published 10:48 10 Mar 2026 GMT

AI researchers create 'humanity's last exam' to probe true limits of machine intelligence

The test is able to benchmark the progress of AI bots

Rikki Loftus

Rikki Loftus

google discoverFollow us on Google Discover
Featured Image Credit: Vithun Khamsong/Getty Images
AI
News
Tech News
Robots
Science
Social Media
Reddit

Advert

Advert

Advert

AI researchers are working on creating what they call ‘humanity’s last exam’ in order to probe the true limits of machine intelligence.

The tech industry has exploded with AI advancements in recent years and it doesn’t appear to be slowing down anytime soon.

In fact, a team of researchers are now working on a test that is able to benchmark the progress of AI bots.

The research team published a paper in the Association for Computing Machinery, where they explained: “Participants in our experiment were no better than chance at identifying GPT-4 after a five minute conversation, suggesting that current AI systems are capable of deceiving people into believing that they are human.

Advert

“The results here likely set a lower bound on the potential for deception in more naturalistic contexts where, unlike the experimental setting, people may not be alert to the possibility of deception or exclusively focused on detecting it.”

The research team published a paper in the Association for Computing Machinery (alexsl/Getty Images)
The research team published a paper in the Association for Computing Machinery (alexsl/Getty Images)

The paper continued: “Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities.

“However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities.”

A study into the test for AI systems was published in Nature, which detailed how it is essential for there to be precise measurements for AI capabilities as these systems ‘approach human expert performance in many domains’.

This will assist with informing research, governance and the broader public on the improvements AI is making.

The study continued: “To establish a common reference point for assessing these capabilities, we publicly release a large number of 2,500 questions from HLE to enable this precise measurement, while maintaining a private test set to assess potential model overfitting.”

The test is able to benchmark the progress of AI bots (Vithun Khamsong/Getty Images)
The test is able to benchmark the progress of AI bots (Vithun Khamsong/Getty Images)

Many people have taken to social media to share their own reactions to the research, with one user writing on Reddit: “Scientists created an exam so broad, challenging and deeply rooted in expert human knowledge that current AI systems consistently fail it. “Humanity’s Last Exam” introduces 2,500 questions spanning mathematics, humanities, natural sciences, ancient languages and highly specialized subfields.”

This prompted many to reply, with another saying: “I fail that exam too. Most people do too since you can only be an expert in a few fields.”

A third commented: “I would contend that if these questions are all ever answered correctly, you know it is an AI, because no single human could have that broad of a knowledge base.”

And a fourth user added: “Well you can already see that the advanced AI versions made huge gains. Matter of time before they ace the test.”

Choose your content:

18 mins ago
an hour ago
3 hours ago
  • Andrew Harnik/Getty Images
    18 mins ago

    X users speculate meaning behind golden bell gifted to Trump from King Charles that only Brits will understand

    King Charles spent this past week visiting with Trump at the White House

    News
  • Warner Bros. TV / Contributor via Getty
    an hour ago

    TikTok star Noah Beck's mom suspended from job over explicit video with son resurfacing

    It comes shortly after his sister was terminated from her job for supposedly 'grooming' a student

    News
  • Bloomberg / Contributor via Getty
    an hour ago

    Bryan Johnson makes 'unhinged' post revealing his partner's vaginal data with intimate tweet about sex life

    The biohacker has previously shared a detailed 11-step sex routine

    Science
  • Andrew Harnik/Getty Images
    3 hours ago

    Trump makes savage comment about NASA chief's appearance in bizarre clip

    NASA chief Jared Isaacman was ridiculed by President Trump

    News
  • Elon Musk reveals what life could look like after AI takes over jobs from humans
  • Monzo founder reveals two jobs that will seem like a 'joke' in a matter of years thanks to AI
  • Elon Musk bans resumes and cover letters as he searches for employees to build an AI 'brain' in space
  • 'Godmother of AI' predicts the next milestone of artificial intelligence that will send shockwaves through the world