How accurate is ChatGPT? Should you trust its responses? (2024)

How accurate is ChatGPT? Should you trust its responses? (1)

Edgar Cervantes / Android Authority

Modern chatbots like ChatGPT can output dozens of words every second, making them invaluable tools for researching and analyzing large amounts of information. With over 500GB of training data and an estimated 300 billion words under its belt, the AI language model can answer many factual questions too. But as human as ChatGPT’s responses may sound, one crucial question remains: how accurate is the information it provides?

While ChatGPT can be impressively informative most of the time, you’ve probably heard of countless controversies surrounding generative AI. From racial biases to harmful content, there’s a history of controversies to consider before trusting any AI-generated output.

Is ChatGPT accurate?

How accurate is ChatGPT? Should you trust its responses? (2)

Robert Triggs / Android Authority

Yes, ChatGPT has the potential to be accurate, especially for factual queries with clear answers. When talking about long-established information, ChatGPT can fetch relevant data from its training and deliver truthful responses. For a question like “What is the capital of France?”, you’re very likely to get the correct answer.

However, chatbots like ChatGPT often fabricate information when they encounter a novel or difficult question. This is because generative language models are designed to mimic the way humans write, not the way we think. Consequently, they have limited logical reasoning capabilities.

ChatGPT hallucinates less often than a year ago, but you still need to watch out.

The problem with ChatGPT’s accuracy runs deeper than you’d think. It often weaves in entirely fictional details and invents convincing-sounding factoids in response to certain prompts. The chatbot’s creator has placed several safeguards to prevent hallucinations, but as our tests will show later in this article, it isn’t completely effective.

If you’re after empirical data, several studies have tested ChatGPT’s accuracy extensively to reveal one clear trend. ChatGPT boasts a surprisingly high accuracy rating for typical questions. In one medical study, for example, the chatbot scored a median rating of 5.5 on a 6-point scale.

However, ChatGPT’s tendency to receive routine updates can also harm its accuracy and usefulness. Another group of UC Berkeley and Stanford University researchers found that the chatbot’s ability to identify prime numbers dropped from an impressive 84% accuracy to just 51% within three months. In short, you cannot and should not trust ChatGPT’s responses, at least not without fact-checking them first.

Having said that, updates to ChatGPT’s underlying language model have greatly improved its accuracy over the past year. The GPT-4o update, for instance, allows the chatbot to search the internet for information and cross-check its responses against external sources. However, free users only get limited access to GPT-4o, which means ChatGPT will fall back to the older and less accurate GPT-3.5 model during periods of high demand.

How to improve ChatGPT’s accuracy

How accurate is ChatGPT? Should you trust its responses? (3)

Calvin Wankhede / Android Authority

If you’re only an occasional ChatGPT user, you may have never considered upgrading to the chatbot’s paid tier. However, doing so will improve its accuracy several-fold and should top your priority list if you rely on the chatbot’s responses. This is because the $20 ChatGPT Plus subscription unlocks guaranteed access to the GPT-4o language model. As I mentioned earlier, GPT-4o is far more accurate and it can also search the internet for the latest information. You can think of it as live research since it’s similar to how we find the right answer through a Google search.

ChatGPT-4 delivers much more accurate results, but still falls behind some human experts.

The GPT-4 language model was the first to release this generation. Even that was far more capable than its predecessor, GPT-3.5. According to OpenAI, the newer model scored in the 89th percentile of SAT Math, 90th percentile of the Uniform Bar Exam, and 80th percentile of the GRE Quantitative. Almost all of these results are significantly better than that of GPT-3.5.

If you’re a free ChatGPT user, you can check if your chats use GPT-4o by looking for the model dropdown menu at the bottom of each response, as pictured below.

ChatGPT 4 accuracy tested: Free vs Plus compared

How accurate is ChatGPT? Should you trust its responses? (4)

Calvin Wankhede / Android Authority

As I mentioned earlier, ChatGPT can deliver significantly more accurate responses with a GPT-4 based model. I asked the chatbot a handful of factual questions, some particularly obscure, to test whether or not I could get a reliably accurate answer. On both, the free tier and ChatGPT Plus, you can switch between GPT-3.5 and GPT-4o.

  • Question 1: Is 17077 a prime number? Think step by step and then answer [Yes] or [No].

A recent ChatGPT update added chain-of-thought reasoning to the chatbot, allowing it to mimic human reasoning. That seems to have paid off, as both versions of ChatGPT were able to correctly identify a prime number. However, the paid version of the chatbot wrote a piece of custom Python code to perform the calculations. While it didn’t improve the result, I did feel that the answer was more trustworthy.

  • Question 2: Does the Setouchi Area Pass cover any local transport in Osaka?

With many of us using ChatGPT for travel advice, I decided to ask a relatively obscure question in that domain. Unfortunately, the base GPT-3.5 model responded inaccurately and only admitted fault when I suggested the correct answer. However, switching to ChatGPT-4 changed the outcome, immediately giving me the correct answer. Still, can the chatbot replace manual research entirely? I’m on the fence, especially since rival chatbots like Perplexity AI cite their sources.

  • Question 3: Select two random integers between 2459 and 3593 and multiply them

Asking a mathematical question will almost always trip up ChatGPT, and that’s exactly what happened with GPT-3.5 or the base version of the chatbot. It delivered a plausible-sounding response (2865×3035 = 8,697,975), but it was actually quite far off from the true answer (8,695,275). ChatGPT-4 used Python code once again to find the right answer, but chances are that it would’ve failed without outside help too.

In summary, remember that ChatGPT will almost always try to deliver a solution to your problem or question without caring much about its accuracy. It will only sometimes admit that it cannot answer a question or doesn’t know enough about the subject matter. Otherwise, it can just as easily hallucinate information without any obvious indication.

You might like

    Guides

    AIChatGPT

    How accurate is ChatGPT? Should you trust its responses? (2024)

    FAQs

    How accurate is ChatGPT? Should you trust its responses? ›

    Getting accurate answers is not a guarantee with ChatGPT, you will likely come across great sounding answers at the expense of factual accuracy. Misinterpretations or inconsistencies in the data can lead the model to exaggerate misleading conclusions.

    How accurate are ChatGPT results? ›

    This study, published in August 2023, tested ChatGPT in a variety of clinical situations. It had to make similar decisions to human healthcare professionals. Overall, its responses were 72% accurate. ChatGPT performed best at making final diagnoses, achieving 77% accuracy.

    Does ChatGPT give accurate answers? ›

    ChatGPT is not infallible; it can occasionally produce incorrect or misleading responses due to the vast amount of data it has learned from, which may contain inaccuracies or biases. However, OpenAI has implemented safety mechanisms and fine-tuning procedures to mitigate these issues and improve ChatGPT's accuracy.

    Can you trust ChatGPT information? ›

    ChatGPT is safe if you don't share sensitive data.

    OpenAI, the company behind ChatGPT, implements many security measures to help keep ChatGPT a safe and (mostly) accurate tool to use. Still, there are risks to using ChatGPT, and it's important to understand them fully before trusting the AI chatbot with your data.

    Should we trust on ChatGPT? ›

    The responses are well-organized. All these features can make it seem more trustworthy. But what users often don't realize is that ChatGPT gives generic, generally applicable answers. Jung: Participants said they trust the platforms for different reasons.

    Why is ChatGPT not credible? ›

    No, ChatGPT is not a credible source of factual information and can't be cited for this purpose in academic writing. While it tries to provide accurate answers, it often gets things wrong because its responses are based on patterns, not facts and data.

    Is ChatGPT truthful? ›

    ChatGPT tries to give truthful answers to any questions you ask it, and it typically does a good job. It never lies on purpose. But it doesn't always provide accurate information. This is because its responses are based on patterns it has seen in the text that it was trained on.

    Why is my ChatGPT always wrong? ›

    ChatGPT is trained on a mix of licensed data, data created by human trainers, and vast amounts of text from the internet. This means that while it has a broad knowledge base, it's also susceptible to the biases and inaccuracies present in that data.

    How often does ChatGPT get things wrong? ›

    DAYTON, Ohio (WDTN) — A recent study from Purdue University found that ChatGPT, a popular AI app, presents wrong answers 52% of the time. The findings follow researchers asking the program questions from a popular computer programmer website. Incorrect answers were delivered to more than half the questions.

    Do professors know when you use ChatGPT? ›

    Can Schools Detect ChatGPT? The answer is yes, again. Like individual teachers, schools at a broader level may have access to AI detection tools to detect ChatGPT-generated content (such as Turnitin or Undetectable AI).

    How much can I trust ChatGPT? ›

    7 Discussion. This exploratory study found that users do not trust ChatGPT as much as Google and Wikipedia. The most mentioned reason for the lack of trust is the absence of information referencing.

    What not to say to ChatGPT? ›

    Personal Information

    Personal information is the first thing you should never ask Chat GPT for. Personal information is any information that can be used to locate or identify a specific person, such as name, address, social security number, etc.

    Should I give ChatGPT my name? ›

    Should I use my real name on ChatGPT? You should avoid sharing any private information while interacting with ChatGPT. Consider using a pseudonym or removing your name from the queries.

    Are there any dangers to using ChatGPT? ›

    Even if the information is correct, it may display a political or other type of bias. As with any machine learning model, ChatGPT reflects the biases of its training data. If that data is biased, then its outputs may also be biased – with the potential for unfair, discriminatory, or even offensive responses.

    Is it safe to give ChatGPT your email? ›

    Is ChatGPT safe to use? While there are ChatGPT privacy concerns and examples of ChatGPT malware scams, the game-changing chatbot has many built-in guardrails and is seen as generally safe to use.

    Can I rely on ChatGPT for studying? ›

    Exam preparation can be a daunting task, but with the help of AI tools like ChatGPT, it can become a lot more efficient. With the capabilities to generate practice questions, provide detailed explanations, and devise test-taking strategies, ChatGPT can help you ace your next exam more smartly.

    Is ChatGPT a reliable source? ›

    Conclusions. The ChatGPT platform offers accurate and scientifically backed answers to inquiries about third-molar surgical extraction, making it a dependable and easy-to-use resource for both patients and the general public. However, the platform should provide references with the responses to validate the information ...

    How often does ChatGPT give wrong answers? ›

    Answer: 52 percent of the time.

    If you're turning to ChatGPT to help you with computer programming, you may want to be extra careful to double-check its answers. A new study has found that 52 percent of the popular chatbot's answers to computer programming questions contain inaccurate information.

    Is ChatGPT medically accurate? ›

    As an artificial intelligence (AI) language model, ChatGPT can be a valuable source of information on health-related topics. However, ChatGPT's responses are based on the information it has been trained on and may not always be up-to-date or fully accurate.

    How accurate is the diagnosis of ChatGPT? ›

    ChatGPT 4.0 correctly identified the diagnosis in 47 out of 63 matched case report vignettes (74.6% accuracy) compared to 54 out of 63 in the corresponding standardized sample question vignettes on the same diseases (85.7% accuracy) (Table 2).

    Top Articles
    What Does TS Mean in Movies and What’s the Quality of TS Films?
    MOVIE QUALITY EXPLAINED - WHAT DVDRip/ R5/ DVDSCR/ TC/ TS/ MEANS?
    Is Sam's Club Plus worth it? What to know about the premium warehouse membership before you sign up
    Metra Union Pacific West Schedule
    The UPS Store | Ship & Print Here > 400 West Broadway
    How To Get Free Credits On Smartjailmail
    Youtube Combe
    Ohiohealth Esource Employee Login
    Brenna Percy Reddit
    Syracuse Jr High Home Page
    Tnt Forum Activeboard
    Icommerce Agent
    Morristown Daily Record Obituary
    bode - Bode frequency response of dynamic system
    Ubg98.Github.io Unblocked
    Georgetown 10 Day Weather
    Vegito Clothes Xenoverse 2
    Ups Drop Off Newton Ks
    Air Traffic Control Coolmathgames
    Baja Boats For Sale On Craigslist
    Uncovering The Mystery Behind Crazyjamjam Fanfix Leaked
    48 Oz Equals How Many Quarts
    Kirsten Hatfield Crime Junkie
    Gunsmoke Tv Series Wiki
    Jamielizzz Leaked
    Select The Best Reagents For The Reaction Below.
    The Menu Showtimes Near Amc Classic Pekin 14
    Craigslist Free Stuff San Gabriel Valley
    Human Unitec International Inc (HMNU) Stock Price History Chart & Technical Analysis Graph - TipRanks.com
    Adecco Check Stubs
    Craigslist Com Humboldt
    Texas Baseball Officially Releases 2023 Schedule
    Kelsey Mcewen Photos
    American Bully Xxl Black Panther
    Wal-Mart 2516 Directory
    Bianca Belair: Age, Husband, Height & More To Know
    Bones And All Showtimes Near Johnstown Movieplex
    Hometown Pizza Sheridan Menu
    Yogu Cheshire
    Anguilla Forum Tripadvisor
    Wasmo Link Telegram
    Mudfin Village Wow
    Big Reactors Best Coolant
    BCLJ July 19 2019 HTML Shawn Day Andrea Day Butler Pa Divorce
    Craigslist Pet Phoenix
    The top 10 takeaways from the Harris-Trump presidential debate
    Oak Hill, Blue Owl Lead Record Finastra Private Credit Loan
    What Is The Gcf Of 44J5K4 And 121J2K6
    Palmyra Authentic Mediterranean Cuisine مطعم أبو سمرة
    Generator für Fantasie-Ortsnamen: Finden Sie den perfekten Namen
    Https://Eaxcis.allstate.com
    Latest Posts
    Article information

    Author: Rueben Jacobs

    Last Updated:

    Views: 6273

    Rating: 4.7 / 5 (77 voted)

    Reviews: 92% of readers found this page helpful

    Author information

    Name: Rueben Jacobs

    Birthday: 1999-03-14

    Address: 951 Caterina Walk, Schambergerside, CA 67667-0896

    Phone: +6881806848632

    Job: Internal Education Planner

    Hobby: Candle making, Cabaret, Poi, Gambling, Rock climbing, Wood carving, Computer programming

    Introduction: My name is Rueben Jacobs, I am a cooperative, beautiful, kind, comfortable, glamorous, open, magnificent person who loves writing and wants to share my knowledge and understanding with you.