Movatterモバイル変換


[0]ホーム

URL:


Skip to main content
ScienceDaily
Your source for the latest research news
New! Sign up for our freeemail newsletter.
Science News
from research organizations

Can advanced AI can solve visual puzzles and perform abstract reasoning?

Date:
October 11, 2024
Source:
University of Southern California
Summary:
Artificial Intelligence has learned to master language, generate art, and even beat grandmasters at chess. But can it crack the code of abstract reasoning --t hose tricky visual puzzles that leave humans scratching their heads? Researchers are putting AI's cognitive abilities to the test, pushing the multi-modal large language models (MLLMs) to solve visual problems once reserved for human IQ tests. The result? A glimpse into how far AI has come -- and where it still stumbles.
Share:
FULL STORY

Artificial Intelligence has learned to master language, generate art, and even beat grandmasters at chess. But can it crack the code of abstract reasoning -- those tricky visual puzzles that leave humans scratching their heads? Researchers at USC Viterbi School of Engineering Information Sciences Institute (ISI) are putting AI's cognitive abilities to the test, pushing the multi-modal large language models (MLLMs) to solve visual problems once reserved for human IQ tests. The result? A glimpse into how far AI has come -- and where it still stumbles.

USC Viterbi ISI Research Assistants Kian Ahrabian and Zhivar Sourati recently investigated whether MLLMs can perform nonverbal abstract reasoning, tasks that require both visual perception and logical reasoning, and presented their findings at the Conference on Language Modeling (COLM 2024) in Philadelphia, PA October 7-9, 2024.

Jay Pujara, research associate professor of computer science at the USC Viterbi School of Engineering and an author on the paper said, "Every day we're bombarded with new headlines about what AI can (and can't) do, which are often very surprising. We still have such a limited understanding of what new AI models can do, and until we understand these limitations we can't make AI better, safer, and more useful. This paper helps fill in a missing piece of the story of where AI struggles."

The Challenge: Can AI See and Think?

"We wanted to see if this new generation of large models, which are able to process images, can reason on their own," Ahrabian explained. "For example, if you see a yellow circle turning into a blue triangle, can the model apply the same pattern in a different scenario?"

To answer this question, the team tested 24 different MLLMs on puzzles based on Raven's Progressive Matrices, a well-known test of abstract reasoning. They found that open-source models struggled significantly. "They were really bad. They couldn't get anything out of it," Ahrabian said plainly.

In contrast, closed-source models, such as GPT-4V -- models developed by private companies and not publicly available for modification -- performed better. These models are typically trained with more advanced resources, including larger datasets and more powerful computing systems, giving them a noticeable edge. "We saw some nontrivial results with closed-source models," Ahrabian added, "Specifically, GPT-4V was relatively good at reasoning, but it's far from perfect."

Where the AI Stumbles

A critical part of the study involved dissecting where these models were failing. One key issue was the AI's ability to accurately process visual information. "We wanted to know if the models could see the details -- like colors or lines colliding -- and whether that was where they were going wrong," Ahrabian said.

To isolate the problem, the researchers provided detailed textual descriptions of the images, ensuring the models had all the necessary information in a different format "Even when we removed the visual element and just gave them text, many models still couldn't reason effectively," Sourati explained. This revealed a crucial insight: the issue wasn't just with visual processing -- it was with the reasoning itself. Now, the team had a clearer picture of what wasn't working, which allowed them to refine their focus and guide future improvements.

The Path Forward: Improving AI's Reasoning

One promising method the researchers explored was "Chain of Thought prompting," where the AI is prompted to think step by step through reasoning tasks. This approach led to significant improvements in some cases. "By guiding the models with hints, we were able to see up to 100% improvement in performance," Ahrabian noted.

Despite the remaining challenges, the researchers are optimistic. The study's findings highlight both the current limitations of AI and the exciting possibilities for future advancements. As these models continue to develop, USC's research could pave the way for AI that not only understands but reasons -- blurring the line between machine intelligence and human cognition.

New Research at a New Conference

Ahrabian and Sourati, Ph.D students at the Thomas Lord Department of Computer Science, presented the paper, The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models, at COLM this week, marking the conference's inaugural year.

Pujara, who is also the director of the Center on Knowledge Graphs at ISI, commented, "AI is undergoing a major shift with the advent of language models. The emergence of new conferences like COLM to support this evolution is a great way to foster collaboration and inspire students eager to contribute to this rapidly advancing field."


Story Source:

Materials provided byUniversity of Southern California.Note: Content may be edited for style and length.


Cite This Page:

University of Southern California. "Can advanced AI can solve visual puzzles and perform abstract reasoning?." ScienceDaily. ScienceDaily, 11 October 2024. <www.sciencedaily.com/releases/2024/10/241009183543.htm>.
University of Southern California. (2024, October 11). Can advanced AI can solve visual puzzles and perform abstract reasoning?.ScienceDaily. Retrieved February 17, 2026 from www.sciencedaily.com/releases/2024/10/241009183543.htm
University of Southern California. "Can advanced AI can solve visual puzzles and perform abstract reasoning?." ScienceDaily. www.sciencedaily.com/releases/2024/10/241009183543.htm (accessed February 17, 2026).

Explore More

from ScienceDaily

RELATED STORIES

Feb. 21, 2025 — Artificial Intelligence (AI), particularly large language models like GPT-4, has shown impressive performance on reasoning tasks. But does AI truly understand abstract concepts, or is it just ...
Aug. 12, 2024 — Large Language Models (LLMs) are entirely controllable through human prompts and lack 'emergent abilities'; that is, the means to form their own insights or conclusions. Increasing model ...
Dec. 21, 2023 — The introduction of artificial intelligence is a significant part of the digital transformation bringing challenges and changes to the job descriptions among management. A study shows that ...
Sep. 14, 2023 — The era of artificial-intelligence chatbots that seem to understand and use language the way we humans do has begun. Under the hood, these chatbots use large language models, a particular kind of ...
Apr. 20, 2023 — An AI strategy proven adept at board games like Chess and Go, reinforcement learning, has now been adapted for a powerful protein design program. The results show that reinforcement learning can do ...
July 18, 2022 — Like a newborn animal, a four-legged robot stumbles around during its first walking attempts. But while a foal or a giraffe needs much longer to master walking, the robot learns to move forward ...

TRENDING ATSCITECHDAILY.com

Oxford Breakthrough Reveals the Secret Ingredient Inside Lithium-Ion Batteries

Astronomers Stunned by Rocky Planet in the Wrong Place

A Massive Star Suddenly Vanished and Left a Black Hole Behind

This Unexpected Ingredient Makes Bread Much Healthier

 Print  Email  Share

Breaking

this hour

Trending Topics

this week

SPACE & TIME
NASA
Space Telescopes
Mars
MATTER & ENERGY
Albert Einstein
Materials Science
Spintronics
COMPUTERS & MATH
Computer Modeling
Computers and Internet
Internet

Strange & Offbeat

 

SPACE & TIME
Universe May End in a “big Crunch,” New Dark Energy Data Suggests
Ramanujan’s 100-Year-Old Pi Formula Is Still Revealing the Universe
Hidden Dimensions Could Explain Where Mass Comes from
MATTER & ENERGY
Fusion Reactors May Create Dark Matter Particles
NASA's Webb Finds Life’s Building Blocks Frozen in a Galaxy Next Door
Physicists Prove the Universe Isn’t a Simulation After All
COMPUTERS & MATH
“Existential Risk” – Why Scientists Are Racing to Define Consciousness
Researchers Tested AI Against 100,000 Humans on Creativity
What If AI Becomes Conscious and We Never Know


[8]ページ先頭

©2009-2026 Movatter.jp