Wednesday, May 14, 2025
LBNN
  • Business
  • Markets
  • Politics
  • Crypto
  • Finance
  • Energy
  • Technology
  • Taxes
  • Creator Economy
  • Wealth Management
  • Documentaries
No Result
View All Result
LBNN

Verbal nonsense reveals limitations of AI chatbots

Simon Osuji by Simon Osuji
September 15, 2023
in Artificial Intelligence
0
Verbal nonsense reveals limitations of AI chatbots
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Verbal nonsense reveals limitations of AI chatbots
Different AI language models can make different judgments about whether sentences are meaningful or nonsense. Credit: Columbia University’s Zuckerman Institute

The era of artificial-intelligence chatbots that seem to understand and use language the way we humans do has begun. Under the hood, these chatbots use large language models, a particular kind of neural network. But a new study shows that large language models remain vulnerable to mistaking nonsense for natural language. To a team of researchers at Columbia University, it’s a flaw that might point toward ways to improve chatbot performance and help reveal how humans process language.

Related posts

A VPN Company Canceled All Lifetime Subscriptions, Claiming It Didn’t Know About Them

A VPN Company Canceled All Lifetime Subscriptions, Claiming It Didn’t Know About Them

May 13, 2025
Europe backs generative AI to drive clean energy transformation

Europe backs generative AI to drive clean energy transformation

May 13, 2025

In a paper published online in Nature Machine Intelligence, the scientists describe how they challenged nine different language models with hundreds of pairs of sentences. For each pair, people who participated in the study picked which of the two sentences they thought was more natural, meaning that it was more likely to be read or heard in everyday life. The researchers then tested the models to see if they would rate each sentence pair the same way the humans had.

In head-to-head tests, more sophisticated AIs based on what researchers refer to as transformer neural networks tended to perform better than simpler recurrent neural network models and statistical models that just tally the frequency of word pairs found on the internet or in online databases. But all the models made mistakes, sometimes choosing sentences that sound like nonsense to a human ear.

“That some of the large language models perform as well as they do suggests that they capture something important that the simpler models are missing,” said Dr. Nikolaus Kriegeskorte, Ph.D., a principal investigator at Columbia’s Zuckerman Institute and a co-author on the paper. “That even the best models we studied still can be fooled by nonsense sentences shows that their computations are missing something about the way humans process language.”

Consider the following sentence pair that both human participants and the AI’s assessed in the study:

That is the narrative we have been sold.

This is the week you have been dying.

People given these sentences in the study judged the first sentence as more likely to be encountered than the second. But according to BERT, one of the better models, the second sentence is more natural. GPT-2, perhaps the most widely known model, correctly identified the first sentence as more natural, matching the human judgments.

“Every model exhibited blind spots, labeling some sentences as meaningful that human participants thought were gibberish,” said senior author Christopher Baldassano, Ph.D., an assistant professor of psychology at Columbia. “That should give us pause about the extent to which we want AI systems making important decisions, at least for now.”

The good but imperfect performance of many models is one of the study results that most intrigues Dr. Kriegeskorte. “Understanding why that gap exists and why some models outperform others can drive progress with language models,” he said.

Another key question for the research team is whether the computations in AI chatbots can inspire new scientific questions and hypotheses that could guide neuroscientists toward a better understanding of human brains. Might the ways these chatbots work point to something about the circuitry of our brains?

Further analysis of the strengths and flaws of various chatbots and their underlying algorithms could help answer that question.

“Ultimately, we are interested in understanding how people think,” said Tal Golan, Ph.D., the paper’s corresponding author who this year segued from a postdoctoral position at Columbia’s Zuckerman Institute to set up his own lab at Ben-Gurion University of the Negev in Israel.

“These AI tools are increasingly powerful but they process language differently from the way we do. Comparing their language understanding to ours gives us a new approach to thinking about how we think.”

More information:
Testing the limits of natural language models for predicting human language judgements, Nature Machine Intelligence (2023). DOI: 10.1038/s42256-023-00718-1 , www.nature.com/articles/s42256-023-00718-1

Provided by
Columbia University

Citation:
Verbal nonsense reveals limitations of AI chatbots (2023, September 14)
retrieved 14 September 2023
from https://techxplore.com/news/2023-09-nonsense-reveals-limitations-ai-chatbots.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.





Source link

Previous Post

BRICS Trade With UAE Set to Easily Exceed $190 Billion

Next Post

Gagosian notches victory in lawsuit brought by photographer over Richard Prince’s New Portraits series

Next Post
Gagosian notches victory in lawsuit brought by photographer over Richard Prince’s New Portraits series

Gagosian notches victory in lawsuit brought by photographer over Richard Prince’s New Portraits series

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

RECOMMENDED NEWS

Dubai to host IPS 2025 expo, elevating global property deals

Dubai to host IPS 2025 expo, elevating global property deals

2 months ago
Recession-Proof US Stock Surged 125% in 5 Years: Should You Buy?

Recession-Proof US Stock Surged 125% in 5 Years: Should You Buy?

2 months ago
AR6 budget boost could add extra 1GW of offshore wind capacity

AR6 budget boost could add extra 1GW of offshore wind capacity

10 months ago
GUH and Scottish Enterprise forge floating wind partnership

GUH and Scottish Enterprise forge floating wind partnership

3 months ago

POPULAR NEWS

  • Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    Ghana to build three oil refineries, five petrochemical plants in energy sector overhaul

    0 shares
    Share 0 Tweet 0
  • When Will SHIB Reach $1? Here’s What ChatGPT Says

    0 shares
    Share 0 Tweet 0
  • Matthew Slater, son of Jackson State great, happy to see HBCUs back at the forefront

    0 shares
    Share 0 Tweet 0
  • Dolly Varden Focuses on Adding Ounces the Remainder of 2023

    0 shares
    Share 0 Tweet 0
  • US Dollar Might Fall To 96-97 Range in March 2024

    0 shares
    Share 0 Tweet 0
  • Privacy Policy
  • Contact

© 2023 LBNN - All rights reserved.

No Result
View All Result
  • Home
  • Business
  • Politics
  • Markets
  • Crypto
  • Economics
    • Manufacturing
    • Real Estate
    • Infrastructure
  • Finance
  • Energy
  • Creator Economy
  • Wealth Management
  • Taxes
  • Telecoms
  • Military & Defense
  • Careers
  • Technology
  • Artificial Intelligence
  • Investigative journalism
  • Art & Culture
  • Documentaries
  • Quizzes
    • Enneagram quiz
  • Newsletters
    • LBNN Newsletter
    • Divergent Capitalist

© 2023 LBNN - All rights reserved.