Why generative AI can be very convincing when it is wrong

We often worry about AI being wrong. In practice, the bigger risk is when it is wrong but sounds right. In this article, Marta Dobrowolska, Head of Data Science and Knowledge Management at Incotec, discusses this phenomenon.

Marta Dobrowolska, Head of Data Science and Knowledge Management at Incotec

What are AI hallucinations?

One of the biggest challenges with generative AI (GenAI) is not that it sometimes makes mistakes. It is that those mistakes are often presented in a way that sounds polished, plausible, and confident. These systems are already being used widely for drafting, summarizing, translating, brainstorming, and supporting day-to-day work, which makes this issue increasingly relevant in practice.
That is what people usually mean by hallucinations: moments when an AI system produces an answer that sounds polished and plausible, but is false, unsupported, or simply invented. The European Commission’s Joint Research Centre (JRC) has already highlighted quality issues in generative AI outputs, including hallucinations, bias, and over-reliance, and has stressed the need for safeguards and human oversight.

Why do generative AI systems hallucinate?

Generative AI is designed to create new text, images, code, audio, or video that resemble the data on which it was trained. In technical terms, these models learn patterns and probability distributions in data and then generate new samples from them. That is what makes them so useful and versatile. It is also why they can produce fluent output without any built-in guarantee that a specific claim is correct.
So, when a model gives a wrong answer, it is usually not “inventing” in the human sense. It is doing what it was built to do: generating the output that best fits the patterns it has learned from previous data. If the question is ambiguous, the context is thin, or the model does not have access to the right sources, it may fill the gap with something that sounds right rather than something that is right. Seen that way, a hallucination is less like deliberate deception and more like statistical prediction outrunning verification.

Why are hallucinations a risk?

Because fluency creates false confidence. A weak answer written in broken language is easy to challenge. A wrong answer written in a calm, authoritative tone is much harder to spot. That is why hallucinations are not a small technical flaw; they are a real reliability and governance issue. The EU AI Act reflects exactly that concern by placing emphasis on accuracy, robustness, transparency, and human oversight for higher-risk uses of AI.

In regulated, evidence-heavy environments, “mostly right” is simply not a strong enough standard

Real world examples of AI errors

There are already examples of this in serious professional settings. In 2025, Deloitte Australia admitted that a government-contracted assurance review of the country’s Targeted Compliance Framework contained fabricated references and quotes generated with Azure OpenAI GPT-4o and agreed to provide a partial refund. The problem was not that the report looked unprofessional. It was that some of the evidence behind it was invented. That is exactly what makes hallucinations difficult: the error can sit inside otherwise credible-looking work. In this case, the issues were spotted by a human reviewer, Australian welfare academic Chris Rudge.
In healthcare and pharma-related work, the stakes are even higher. A study published in BMJ Quality & Safety, a peer-reviewed journal focused on healthcare quality and patient safety, examined AI-powered chatbot answers to patient questions about commonly prescribed drugs. While many answers were broadly accurate, experts judged 66% of a subset of inaccurate answers to be potentially harmful, and 22% potentially severe or even life-threatening if followed. In regulated, evidence-heavy environments, “mostly right” is simply not a strong enough standard.

That is why this matters for sectors such as pharma, medical affairs, and R&D. If an AI tool generates a fabricated reference, misstates evidence, or gives a confident but incorrect interpretation, the issue is not just poor wording. It becomes a scientific credibility problem, a compliance risk, and potentially a patient safety issue. More broadly, the JRC has already warned that generative AI raises cross-cutting risks around misinformation, trust, and the quality of decision-making when people rely on it too quickly.

Should we stop using generative AI?

The good news is that this is not a reason to stop using generative AI. It is a reason to use it more deliberately. Where facts matter, outputs should be grounded in trusted source material rather than generated from model memory alone. Just as importantly, organisations need to keep people accountable for the final output. The most useful question is often the simplest one: What is the source for this? If that question cannot be answered clearly, the output should not be treated as evidence. That is particularly true in customer communication, legal work, scientific writing, regulatory material, and decision support.

Still, this remains an extraordinary tool. Generative AI has significant potential to improve innovation and productivity across industries. But it is useful to apply a bit of critical thinking to its outputs and treat it a little like your overconfident colleague: often helpful, often impressively fast, occasionally brilliant, but not someone you would quote blindly without checking the source first.

How to use GenAI responsibly

That is why we need to apply the right balance. Generative AI is powerful, it is useful but if we want to use it responsibly, we need to be honest about one thing: it can be very convincing when it is wrong.

PS. I would double-check the reference if I were you.

*Andrikyan W, Sametinger SM, Kosfeld F, et al Artificial intelligence-powered chatbots in search engines: a cross-sectional study on the quality and risks of drug information for patients BMJ Quality & Safety 2025; 34:100-109
Text edited with help of M365 Copilot GTP 5.4

Published by

Marta Dobrowolska-Haywood Head of Data Science and Knowledge Management at Incotec

Data-driven seed technology

Incotec is leading the way in seed technology through its focus on data-driven innovation. Moving forward our approach will stay grounded in science, built on teamwork, and guided by real-world results.

Incotec lecture on AI transforming seed treatment

22 September 2025: Curious about how AI methods are transforming seed treatment? Find out during at Incotec’s lecture at the Seed meets technology trade fair on Sep 25, Zwaagdijk

Back to

Incotec News and Opinion

Name	Typical content	Expires
cookieConsent	Used to check if your device accepts cookies and used to remember users cookie consent.	1 year
SC_ANALYTICS_GLOBAL_COOKIE	This analytics cookie is provided by Sitecore. It tracks a visitor’s multiple visits to our Website in one year. We use this information to help improve our Website. This cookie is not used to identify individuals using the Website.	10 Years
.AspNet.Cookies	This cookie is used to identify and authenticate logged-in users. It maintains secure sessions by storing an encrypted authentication token.	30 days
ASP.NET_SessionId	This cookie is used by ASP.NET to maintain an anonymous user session by assigning a unique session ID. It enables the website to store temporary information between pages, such as login status or form input.	End of session
_sample_basket	Used to store sample items added to basket by user. Contents are serialised.	30 days
sitename#lang	This cookie is provided by Sitecore. This cookie tracks a user's language selection. The name of the cookie varies based on the configured site name	End of Session
__RequestVerificationToken	An anti forgery token used to help prevent CSRF attacks.	End of Session
_form_goals	Used to check if you have previously complete a form.	2 years
lastShownMarketingConsentDate_{identifier}	Use to determine when we requested you review your marketing consent for your account. It helps ensure we limit prompting to review your marketing consent, improving user experience.	1 year
productFinderLastSearch	Used within our product finder to determine the last search performed.	1 year
shell#lang	This cookie is provided by Sitecore. This cookie tracks a user's language selection. The name of the cookie varies based on the configured site name	End of Session
mailinglistsignupbanner_{identifier}	Use to determine when whether a mailing list sign up form has been already shown within a time period. It helps ensure that users are not repeatedly prompting or shown sign up forms, improving user experience. We may create multiple cookies with different identifiers.	< 120 days
resourcesAreaLastSearch	Used within our literature finder to determine the last search performed.	1 year
isProactiveInvite	This cookie is set by the SnapEngage live chat widget to track whether a proactive chat invitation (a pop-up asking if you need help) has already been shown during your visit. It helps ensure that users are not repeatedly prompted with the same chat invite, improving user experience.	End of session
SnapABugAgentAvatar	This cookie stores the agent image URL in order to show the agent avatar on the minimize state of the chat.	16 mins
SnapABugHistory	Keeps track of the visitor visits and last chats to present history to agent	1 year
SnapABugNoProactiveChat		6-12 months
SnapABugRef	This tracks the origin and site entry	120 mins
SnapABugUserAlias	This cookie stores the visitor alias (name). This cookie is used to support subsequent chats so that once it is known, the chat agent doesn’t need to collect the information again.	1 year
SnapABugVisit	This cookie is used to keep track of the visitor visits	End of session

Name	Typical content	Expires
recentlyViewedItems	Contains a list of recently visited items including products or formulations.	30 days
lidc	Used by LinkedIn for routing.	24 hours
li_gc	Used by LinkedIn to store consent of guests regarding the use of cookies for non-essential purposes.	6 months

Name	Typical content	Expires
_gat	Used by Google Analytics to throttle request rate.	10 minutes
_ga	Contains a unique identifier used by Google Analytics to determine unique visitors to our website.	2 years
_clsk	Used by Microsoft Clarity to connect multiple page views by a user into a single Clarity session recording.	1 day
_clck	Persists the Microsoft Clarity User ID and preferences, unique to that site is attributed to the same user ID.	1 year

Name	Typical content	Expires
ANONCHK	Used by Microsoft Clarity and Microsoft Ads to store session ID for a users session to ensure that clicks are verified for reporting purposes and for personalisation.	10 mins
MUID	Identifies unique web browsers used by Microsoft. These cookies are used for advertising, site analytics, and other operational purposes.	1 year
SM	Used by Microsoft to synchronise the unique identifier across Microsoft domains for consistent user recognition and ad targeting.	End of session
MR	Used by Microsoft to support ad delivery and performance tracking.	7 days
SRM_B	Set by Microsoft Bing Ads and is used for advertising purposes. It helps deliver targeted ads and measure the effectiveness of ad campaigns across websites.	1 year
CLID	Used by Microsoft Clarity to identify the first time a user visited any site using Clarity. Helps track user behaviour and engagement across sessions for analytics purposes.	1 year