AI chatbots unable to precisely summarise information, BBC finds

Imran Rahman-JonesExpertise reporter

Getty Images A phone screen with the app icons ChatGPT, Copilot, Gemini and Perplexity displayed

4 main synthetic intelligence (AI) chatbots are inaccurately summarising information tales, in line with analysis carried out by the BBC.

The BBC gave OpenAI’s ChatGPT, Microsoft’s Copilot, Google’s Gemini and Perplexity AI content material from the BBC web site then requested them questions in regards to the information.

It mentioned the ensuing solutions contained “important inaccuracies” and distortions.

In a weblog, Deborah Turness, the CEO of BBC Information and Present Affairs, mentioned AI introduced “infinite alternatives” however the corporations creating the instruments had been “taking part in with fireplace”.

“We reside in troubled instances, and the way lengthy will it’s earlier than an AI-distorted headline causes important actual world hurt?”, she requested.

An OpenAI spokesperson mentioned: “We help publishers and creators by serving to 300 million weekly ChatGPT customers uncover high quality content material by summaries, quotes, clear hyperlinks, and attribution.”

The opposite tech corporations which personal the chatbots have been approached for remark.

‘Pull again’

In the examine, the BBC requested ChatGPT, Copilot, Gemini and Perplexity to summarise 100 information tales and rated every reply.

It received journalists who had been related consultants within the topic of the article to fee the standard of solutions from the AI assistants.

It discovered 51% of all AI solutions to questions in regards to the information had been judged to have important problems with some kind.

Moreover, 19% of AI solutions which cited BBC content material launched factual errors, comparable to incorrect factual statements, numbers and dates.

In her weblog, Ms Turness mentioned the BBC was looking for to “open up a brand new dialog with AI tech suppliers” so we are able to “work collectively in partnership to search out options”.

She known as on the tech corporations to “pull again” their AI information summaries, as Apple did after complaints from the BBC that Apple Intelligence was misrepresenting information tales.

Some examples of inaccuracies discovered by the BBC included:

Gemini incorrectly mentioned the NHS didn’t suggest vaping as an support to stop smoking
ChatGPT and Copilot mentioned Rishi Sunak and Nicola Sturgeon had been nonetheless in workplace even after that they had left
Perplexity misquoted BBC Information in a narrative in regards to the Center East, saying Iran initially confirmed “restraint” and described Israel’s actions as “aggressive”

Basically, Microsoft’s Copilot and Google’s Gemini had extra important points than OpenAI’s ChatGPT and Perplexity, which counts Jeff Bezos as considered one of its traders.

Usually, the BBC blocks its content material from AI chatbots, nevertheless it opened its web site up in the course of the exams in December 2024.

The report mentioned that in addition to containing factual inaccuracies, the chatbots “struggled to distinguish between opinion and reality, editorialised, and infrequently failed to incorporate important context”.

The BBC’s Programme Director for Generative AI, Pete Archer, mentioned publishers “ought to have management over whether or not and the way their content material is used and AI corporations ought to present how assistants course of information together with the dimensions and scope of errors and inaccuracies they produce”.

An OpenAI spokesperson instructed BBC Information: “We have collaborated with companions to enhance in-line quotation accuracy and respect writer preferences, together with enabling how they seem in search by managing OAI-SearchBot of their robots.txt. We’ll hold enhancing search outcomes.”

Robots.txt is an instruction in an internet web page’s code which asks a bot to not use that web page in search outcomes.

Supply hyperlink