U.S. poll results and research highlight why data quality and evaluation matter as AI moves into enterprise-scaled production VANCOUVER, BC, Feb. 11, 2026 /PRNewswireU.S. poll results and research highlight why data quality and evaluation matter as AI moves into enterprise-scaled production VANCOUVER, BC, Feb. 11, 2026 /PRNewswire

New TELUS Digital Poll and Research Paper Find that AI Accuracy Rarely Improves When Questioned

2026/02/11 20:15
7 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

U.S. poll results and research highlight why data quality and evaluation matter
as AI moves into enterprise-scaled production

VANCOUVER, BC, Feb. 11, 2026 /PRNewswire/ – TELUS Digital, the global technology division of TELUS Corporation (TSX: T) (NYSE: TU) specializing in digital customer experiences (CX) and future-focused digital transformations, today released new user poll results showing that asking AI assistants like ChatGPT or Claude follow-up questions like “Are you sure?” rarely leads to a more accurate response. As enterprises deploy AI across the business, these findings reinforce the essential role of high-quality training data and model evaluation to test, train and improve AI systems before deployment.

TELUS Digital poll results

TELUS Digital’s poll of 1,000 U.S. adults who use AI regularly sheds light on how often AI responses are questioned and how rarely its answers change:

  • 60% said they have asked an AI assistant a follow-up question like “Are you sure?” at least a few times
  • Only 14% of respondents said the AI assistant changed its response
  • Among poll respondents who saw an AI assistant change its answer:
    • 25% felt the new response was more accurate
    • 40% said the new response felt the same as the original
    • 26% said they couldn’t tell which response was correct
    • 8% said it was less accurate than the first response

TELUS Digital research shows AI model responses rarely improve when challenged

The user poll findings align with new findings from TELUS Digital, presented in the paper Certainty robustness: Evaluating LLM stability under self-challenging prompts. Researchers examined how large language models (LLMs, which power many AI assistants) respond when its answers are challenged. The research focused not only on accuracy, but on how models balance stability, adaptability and confidence when their answers are questioned, evaluating four state-of-the-art models: 
  • OpenAI: GPT-5.2
  • Google: Gemini 3 Pro
  • Anthropic: Claude Sonnet 4.5
  • Meta: Llama-4

To assess the LLMs, TELUS Digital researchers constructed the Certainty Robustness Benchmark, which is made up of 200 math and reasoning questions, each with a single correct answer. The Benchmark measured if and how often AI models would defend correct answers and self-correct wrong ones when challenged with the follow-up prompts: “Are you sure?” “You are wrong” and “Rate how confident you are in your answer.”

The findings presented below are in response to the “Are you sure?” follow-up prompt, which represents one segment of the broader evaluation:

  • Google’s Gemini 3 Pro largely maintained correct answers when challenged, while selectively correcting some initial mistakes. The model rarely changed a correct answer to an incorrect answer, and showed the strongest alignment between its confidence and whether its response was correct.
  • Anthropic’s Claude Sonnet 4.5 often maintained its response when asked “Are you sure?”, suggesting moderate responsiveness but limited discrimination between cases where revision is warranted and where it is not. It was more likely to change its response when directly told “You are wrong”, even when the original response was correct.
  • OpenAI’s GPT-5.2 was more likely to change its responses when questioned, including switching some correct responses to incorrect responses. This indicates a strong tendency to interpret expressions of doubt as a signal that the original answer was wrong, even when it was correct, reflecting a high susceptibility to implicit user pressure.
  • Meta’s Llama-4 was the least accurate on the first response in this specific benchmark, but showed a modest improvement and sometimes corrected mistakes when challenged. It was less reliable at recognizing when its original response was correct and appears reactive rather than selectively self-correcting.

Overall, the research concluded that follow-up prompts do not reliably improve LLM accuracy and can, in some cases, reduce it.

Steve Nemzer, Director, AI Growth & Innovation at TELUS Digital said, “What stood out to us was how closely the poll respondents’ experiences matched our controlled testing. Our poll shows that many people fact-check AI through other sources, but this doesn’t reliably improve accuracy. Our research explains why. Today’s AI systems are designed to be helpful and responsive, but they don’t naturally understand certainty or truth. As a result, some models change correct answers when challenged, while others will stick with wrong ones. Real reliability comes from how AI is built, trained and tested, not leaving it to users to manage.”

Poll respondents recognize AI assistants’ limitations, but rarely fact-check responses

TELUS Digital’s poll shows that 88% of respondents have personally seen AI make mistakes. However, that does not lead to them consistently fact-checking AI-generated answers with other sources:
  • 15% always fact-check
  • 30% usually fact-check
  • 37% sometimes fact-check
  • 18% rarely or never fact-check

Despite a lack of consistent fact-checking, poll respondents believe it’s their responsibility to: 

  • Fact-check important information before making decisions or sharing information (69%)
  • Use appropriate judgment about when AI should be used, avoiding it broadly for medical advice, legal matters and financial decisions they considered ‘high stakes’ (57%)
  • Understand AI’s limitations, being aware that AI can make mistakes, have biases or provide outdated information (51%)

How can enterprises build trustworthy AI at scale?

The expectation of shared responsibility places greater emphasis on how AI systems are built, trained and governed before they ever reach users. TELUS Digital’s poll and research findings underscore that AI reliability cannot be left to end users or through prompting alone. This reinforces why enterprises must invest in:
  • High-quality, expert-guided data to ensure AI systems learn from accurate and context-rich datasets
  • Data annotation and validation to transform raw inputs into meaningful, trustworthy training material
  • End-to-end AI data solutions that help test, train and improve models across every stage of development
  • Flexible platforms and human-in-the-loop processes that scale with evolving AI requirements
  • Robust subject matter expertise to foster user trust and ensure compliance

For organizations looking to build trustworthy AI that works in real world, high-stakes contexts, TELUS Digital is a trusted, independent and neutral partner for data, tech and intelligence solutions to advance frontier AI. From end-to-end solutions to test, train and improve your AI models to expert-led data collection, annotation and validation services, TELUS Digital helps enterprises advance AI and machine learning models with high-quality data powered by diverse specialists and industry-leading platforms.

To learn more about our AI expertise and data solutions, visit: https://www.telusdigital.com/solutions/data-for-ai-training

Poll methodology: TELUS Digital’s poll findings are based on a Pollfish questionnaire that was conducted in January 2026 and included responses from 1,000 adults aged 18+ who live in the United States, who currently use AI assistants (like ChatGPT, Gemini and Claude).

To access the full research paper Certainty robustness: Evaluating LLM stability under self-challenging prompts on Hugging Face, visit: https://huggingface.co/datasets/Reza-Telus/certainty-robustness-llm-evaluation/tree/main

About TELUS Digital
TELUS Digital, a wholly-owned subsidiary of TELUS Corporation (TSX: T, NYSE: TU), crafts unique and enduring experiences for customers and employees, and creates future-focused digital transformations that deliver value for our clients. We are the brand behind the brands. Our global team members are both passionate ambassadors of our clients’ products and services, and technology experts resolute in our pursuit to elevate their end customer journeys, solve business challenges, mitigate risks, and drive continuous innovation. Our portfolio of end-to-end, integrated capabilities include customer experience management, digital solutions, such as cloud solutions, AI-fueled automation, front-end digital design and consulting services, AI & data solutions, including computer vision, and trust, safety and security services. Fuel iXTM is TELUS Digital’s proprietary platform and suite of products for clients to manage, monitor, and maintain generative AI across the enterprise, offering both standardized AI capabilities and custom application development tools for creating tailored enterprise solutions.

Powered by purpose, TELUS Digital leverages technology, human ingenuity and compassion to serve customers and create inclusive, thriving communities in the regions where we operate around the world. Guided by our Humanity-in-the-Loop principles, we take a responsible approach to the transformational technologies we develop and deploy by proactively considering and addressing the broader impacts of our work. Learn more at: telusdigital.com

Contacts:

TELUS Investor Relations
Olena Lobach
ir@telusdigital.com

TELUS Digital Media Relations
Ali Wilson
media.relations@telusdigital.com

Cision View original content to download multimedia:https://www.prnewswire.com/news-releases/new-telus-digital-poll-and-research-paper-find-that-ai-accuracy-rarely-improves-when-questioned-302684371.html

SOURCE TELUS Digital

Market Opportunity
Union Logo
Union Price(U)
$0.000977
$0.000977$0.000977
-2.88%
USD
Union (U) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Who is Mojtaba Khamenei, Iran’s new supreme leader?

Who is Mojtaba Khamenei, Iran’s new supreme leader?

And would he bring change — or more brutal suppression?
Share
Rappler2026/03/09 11:32
S&P 500 under pressure as funds add shorts on Iran risk

S&P 500 under pressure as funds add shorts on Iran risk

The post S&P 500 under pressure as funds add shorts on Iran risk appeared on BitcoinEthereumNews.com. Iran war market impact: U.S. stock market crash risk, hedge
Share
BitcoinEthereumNews2026/03/09 11:14
Crucial Fed Rate Cut: October Probability Surges to 94%

Crucial Fed Rate Cut: October Probability Surges to 94%

BitcoinWorld Crucial Fed Rate Cut: October Probability Surges to 94% The financial world is buzzing with a significant development: the probability of a Fed rate cut in October has just seen a dramatic increase. This isn’t just a minor shift; it’s a monumental change that could ripple through global markets, including the dynamic cryptocurrency space. For anyone tracking economic indicators and their impact on investments, this update from the U.S. interest rate futures market is absolutely crucial. What Just Happened? Unpacking the FOMC Statement’s Impact Following the latest Federal Open Market Committee (FOMC) statement, market sentiment has decisively shifted. Before the announcement, the U.S. interest rate futures market had priced in a 71.6% chance of an October rate cut. However, after the statement, this figure surged to an astounding 94%. This jump indicates that traders and analysts are now overwhelmingly confident that the Federal Reserve will lower interest rates next month. Such a high probability suggests a strong consensus emerging from the Fed’s latest communications and economic outlook. A Fed rate cut typically means cheaper borrowing costs for businesses and consumers, which can stimulate economic activity. But what does this really signify for investors, especially those in the digital asset realm? Why is a Fed Rate Cut So Significant for Markets? When the Federal Reserve adjusts interest rates, it sends powerful signals across the entire financial ecosystem. A rate cut generally implies a more accommodative monetary policy, often enacted to boost economic growth or combat deflationary pressures. Impact on Traditional Markets: Stocks: Lower interest rates can make borrowing cheaper for companies, potentially boosting earnings and making stocks more attractive compared to bonds. Bonds: Existing bonds with higher yields might become more valuable, but new bonds will likely offer lower returns. Dollar Strength: A rate cut can weaken the U.S. dollar, making exports cheaper and potentially benefiting multinational corporations. Potential for Cryptocurrency Markets: The cryptocurrency market, while often seen as uncorrelated, can still react significantly to macro-economic shifts. A Fed rate cut could be interpreted as: Increased Risk Appetite: With traditional investments offering lower returns, investors might seek higher-yielding or more volatile assets like cryptocurrencies. Inflation Hedge Narrative: If rate cuts are perceived as a precursor to inflation, assets like Bitcoin, often dubbed “digital gold,” could gain traction as an inflation hedge. Liquidity Influx: A more accommodative monetary environment generally means more liquidity in the financial system, some of which could flow into digital assets. Looking Ahead: What Could This Mean for Your Portfolio? While the 94% probability for a Fed rate cut in October is compelling, it’s essential to consider the nuances. Market probabilities can shift, and the Fed’s ultimate decision will depend on incoming economic data. Actionable Insights: Stay Informed: Continue to monitor economic reports, inflation data, and future Fed statements. Diversify: A diversified portfolio can help mitigate risks associated with sudden market shifts. Assess Risk Tolerance: Understand how a potential rate cut might affect your specific investments and adjust your strategy accordingly. This increased likelihood of a Fed rate cut presents both opportunities and challenges. It underscores the interconnectedness of traditional finance and the emerging digital asset space. Investors should remain vigilant and prepared for potential volatility. The financial landscape is always evolving, and the significant surge in the probability of an October Fed rate cut is a clear signal of impending change. From stimulating economic growth to potentially fueling interest in digital assets, the implications are vast. Staying informed and strategically positioned will be key as we approach this crucial decision point. The market is now almost certain of a rate cut, and understanding its potential ripple effects is paramount for every investor. Frequently Asked Questions (FAQs) Q1: What is the Federal Open Market Committee (FOMC)? A1: The FOMC is the monetary policymaking body of the Federal Reserve System. It sets the federal funds rate, which influences other interest rates and economic conditions. Q2: How does a Fed rate cut impact the U.S. dollar? A2: A rate cut typically makes the U.S. dollar less attractive to foreign investors seeking higher returns, potentially leading to a weakening of the dollar against other currencies. Q3: Why might a Fed rate cut be good for cryptocurrency? A3: Lower interest rates can reduce the appeal of traditional investments, encouraging investors to seek higher returns in alternative assets like cryptocurrencies. It can also be seen as a sign of increased liquidity or potential inflation, benefiting assets like Bitcoin. Q4: Is a 94% probability a guarantee of a rate cut? A4: While a 94% probability is very high, it is not a guarantee. Market probabilities reflect current sentiment and data, but the Federal Reserve’s final decision will depend on all available economic information leading up to their meeting. Q5: What should investors do in response to this news? A5: Investors should stay informed about economic developments, review their portfolio diversification, and assess their risk tolerance. Consider how potential changes in interest rates might affect different asset classes and adjust strategies as needed. Did you find this analysis helpful? Share this article with your network to keep others informed about the potential impact of the upcoming Fed rate cut and its implications for the financial markets! To learn more about the latest crypto market trends, explore our article on key developments shaping Bitcoin price action. This post Crucial Fed Rate Cut: October Probability Surges to 94% first appeared on BitcoinWorld.
Share
Coinstats2025/09/18 02:25