AI chatbots unable to accurately summarise news, BBC finds

TechnologyFebruary 11, 20253 min read

AI chatbots unable to accurately summarise news, BBC finds

AI chatbots unable to accurately summarise news, BBC finds

AI chatbots unable to accurately summarise news, BBC finds

Reading Level

A recent investigation by the BBC has revealed that four leading artificial intelligence chatbots are struggling to accurately summarize news articles. The chatbots in question are OpenAI's ChatGPT, Microsoft's Copilot, Google's Gemini, and Perplexity. The BBC provided these AI systems with content from its own website and subsequently posed questions regarding the news. The findings indicated that the responses generated by these chatbots contained numerous inaccuracies and misleading information. Deborah Turness, the CEO of BBC News and Current Affairs, emphasized that while AI technology presents 'endless opportunities', the companies developing these tools are 'playing with fire'. She raised concerns about the potential consequences of AI-generated headlines that could lead to significant harm in the real world. The BBC has reached out to the companies behind these chatbots for their comments on the study's results. In the course of the investigation, the BBC tasked the chatbots with summarizing 100 different news stories and evaluated the quality of their responses. Journalists with expertise in the relevant subjects were enlisted to assess the answers provided by the AI assistants. The results were alarming, revealing that 51% of all AI-generated responses contained significant issues. Furthermore, 19% of the answers that referenced BBC content included factual inaccuracies, such as incorrect statements, erroneous numbers, and wrong dates. In her blog, Ms. Turness expressed the BBC's desire to initiate a dialogue with AI technology providers to collaborate on finding solutions to these pressing issues. She urged tech companies to 'pull back' their AI news summaries, particularly in light of instances where Apple Intelligence misrepresented news stories. Some specific examples of inaccuracies identified by the BBC included: Gemini incorrectly asserting that the NHS does not recommend vaping as a method to quit smoking. ChatGPT and Copilot mistakenly claimed that Rishi Sunak and Nicola Sturgeon were still in office, despite the fact that they had already stepped down. Perplexity misquoted BBC News in a report about the Middle East, stating that Iran initially exhibited 'restraint' and labeled Israel's actions as 'aggressive'. Overall, Microsoft's Copilot and Google's Gemini were found to have more significant issues compared to OpenAI's ChatGPT and Perplexity, which counts Jeff Bezos among its investors. Typically, the BBC restricts its content from being accessed by AI chatbots, but it made an exception for this study in December 2024. The report highlighted that, in addition to factual inaccuracies, the chatbots struggled to distinguish between opinion and fact, often editorializing and failing to provide essential context. Pete Archer, the BBC's Programme Director for Generative AI, stated that publishers should maintain control over how their content is utilized and that AI companies must demonstrate how their assistants process news, along with the extent and nature of the errors and inaccuracies they produce.

About VocabSphere

AI-Powered English Learning Platform

Innovative Platform

VocabSphere is an innovative English learning platform that provides adaptive articles tailored to different proficiency levels. Our AI-powered system helps learners improve their vocabulary, reading comprehension, and language skills through engaging, real-world content.

Learning Benefits

By reading articles like this one, learners can expand their vocabulary, improve reading speed, and gain confidence in understanding complex English texts. Each article is carefully curated and adapted to provide the optimal learning experience for students at every level.

AI-PoweredPersonalized LearningReal-time NewsMulti-level Difficulty

Difficult Words

summarizingmisleadingexpressedfactualmisrepresentedrestraintransparentcontext

Good Sentences

"The results showed that 51% of the answers from the AI had major problems."

Why

This is a sample explanation that demonstrates why this sentence is considered good for English learning...

Login to view

"Deborah Turness, who is the CEO of BBC News, mentioned that while AI has many exciting possibilities, the companies creating these tools need to be careful."

Why

This is a sample explanation that demonstrates why this sentence is considered good for English learning...

Login to view

Download Mobile App

Only our iOS and Android apps give you full access to VocabSphere features like Forgetting Curve Vocab Book, Exercise Generation, and Personal Learning Progress Monitoring.

Download now for the complete learning experience!

Discover VocabSphere's Powerful Features

Enhance your English learning experience

Personalized Reading

Customized articles and news to match students' English proficiency levels. Get instant word translations, synonyms. Expand vocabulary effortlessly.

Vocabulary Usage

VocabSphere uses the forgetting curve principle to help you memorize words efficiently. Master every word comprehensively. Your personalized vocabulary library, available anytime, anywhere.

Exercise Generation

Create custom grammar exercises from your vocabulary library. Practice different parts of speech and sentence patterns. Teachers can also generate reading comprehension quizzes and exercises.

Back to News