...

Logo Pasino du Havre - Casino-Hôtel - Spa
in partnership with
Logo Nextory

New AI models are more likely to give a wrong answer than admit they don't know

Business • Oct 1, 2024, 5:30 AM
3 min de lecture
1

Newer large language models (LLMs) are less likely to admit they don’t know an answer to a user’s question making them less reliable, according to a new study. 

Artificial intelligence (AI) researchers from the Universitat Politècnica de València in Spain tested the latest versions of BigScience’s BLOOM, Meta’s Llama, and OpenAI's GPT for accuracy by asking each model thousands of questions on maths, science, and geography. 

Researchers compared the quality of the answers of each model and classified them into correct, incorrect, or avoidant answers.

The study, which was published in the journal Nature, found that accuracy on more challenging problems improved with each new model. Still, they tended to be less transparent about whether they could answer a question correctly. 

The earlier LLM models would say they could not find the answers or needed more information to come to an answer, but new models were more likely to guess and produce incorrect responses even to easy questions.  

'No apparent improvement' in solving basic problems

LLMs are deep learning algorithms that use AI to understand, predict, and generate new content based on data sets. 

While the new models could solve more complex problems with more accuracy, the LLMs in the study still made some mistakes when answering basic questions.

"Full reliability is not even achieved at very low difficulty levels," according to the research paper.

"Although the models can solve highly challenging instances, they also still fail at very simple ones".

This is the case with OpenAI’s GPT-4, where the number of "avoidant" answers significantly dropped off from its previous model, GPT-3.5. 

“This does not match the expectation that more recent LLMs would more successfully avoid answering outside their operating range,” the study authors said. 

Researchers concluded then that there's "no apparent improvement" for the models even though the technology has been scaled up. 


Yesterday

Eurozone jobless figure remains stable, according to latest data
Business • 4:45 PM
2 min
The eurozone unemployment rate remained stable at 6.4% throughout June, July and August, and down 0.2% on August last year. Greece, Spain and Sweden have the highest rates of unemployment.
Read the article
Bright light therapy works for about 40% of depression patients, analysis shows
Business • 4:08 PM
3 min
Bright light therapy is a promising early treatment for patients with non-seasonal depression, a new analysis found.
Read the article
Spain should have been punished for lax budget, EU advisors say
Business • 3:10 PM
3 min
The European Commission failed to correctly follow EU law when it forgave Madrid for its high deficit earlier this year, a legal panel of fiscal advisors has said.
Read the article
AI is making cyberattacks more sophisticated and cybersecurity teams are struggling to keep up
Business • 3:03 PM
5 min
A new report found that more than half of the cybersecurity teams said that they were underfunded.
Read the article
Architects build eco-friendly houses with straw and clay amid scorching heatwaves
Business • 3:00 PM
4 min
In Bulgaria, some architects and companies are turning to eco-friendly building materials.
Read the article
'Do we want fewer emissions or more Netflix?': Inside the fight against Europe's data centres
Business • 11:00 AM
10 min
Pockets of data centre activists are fighting back against the expansion of mega computer centres in Europe amid an artificial intelligence (AI) boom.
Read the article
Coca-Cola loses its fizz with plan to axe hundreds of jobs in Germany
Business • 10:59 AM
3 min
Coca-Cola is to shut down five production and logistics sites in Germany, in an attempt to cut costs and adapt to changing logistics trends.
Read the article
LVMH sells Off-White: Is this the end for Virgil Abloh's brand?
Business • 10:53 AM
7 min
Off-White, the Virgil Abloh brand that was bought out by LVMH in 2021 at the height of its popularity now faces cultural irrelevancy. Why?
Read the article
CERN at 70: The cradle of the Higgs boson and World Wide Web looks to the future
Business • 10:06 AM
6 min
Started in 1954, the 7,000 scientists at the European Centre for Nuclear Research (CERN) are focused on the innovations and discoveries of the future.
Read the article
ASML shares plunge amid regulatory headwinds and valuation concerns
Business • 8:53 AM
4 min
Shares of Dutch chip equipment maker ASML are among the third quarter's worst performers in European markets, due to regulatory hurdles and valuation concerns.
Read the article
The Iron Dome: How does Israel’s missile defence system work?
Business • 8:38 AM
8 min
In operation since 2011, the Iron Dome is Israel's first line of defence against rockets. We spoke to an expert to understand how the system works.
Read the article
COVID was paradigm shift for health policymaking, says Commissioner Stella Kyriakides
Business • 6:05 AM
3 min
There’s no turning back from the approach to EU health policymaking developed during the COVID pandemic – and it's why it should remain high on the agenda, the health commissioner said in an interview for the Euronews Health Summit.
Read the article
What were tech commissioner-designate Virkkunen’s policy concerns as an MEP?
Business • 6:00 AM
5 min
Data shows that incoming EU tech commissioner Henna Virkkunen showed growing interest in tech dossiers such as audiovisual laws during her ten years as an MEP.
Read the article
German supermarkets take the fight to British competitors
Business • 5:05 AM
5 min
High investment and low prices appear to be disrupting the established order in the UK supermarket sector.
Read the article
Nike holds off guidance and investor day to allow new CEO time to find his feet
Business • 12:16 AM
5 min
Nike is holding off giving guidance and has postponed investor day as it tries to give incoming CEO Elliott Hill a chance to review current strategies and plan future ones.
Read the article
Hedi Slimane quits as Celine creative director
Business • 12:10 AM
4 min
After seven years as creative director of French luxury fashion house Celine, Hedi Slimane is moving on.
Read the article
TikTok, YouTube and Snapchat’s video recommendations probed by European Commission
Business • 12:09 AM
2 min
The EU executive has started an investigation into social media network practices, given fears that vulnerable people are being fed fake news and content promoting self-harm.
Read the article