Movatterモバイル変換


[0]ホーム

URL:


Wikipedia

Neuro-symbolic AI

This articlecontainsweasel words: vague phrasing that often accompaniesbiased orunverifiable information. Such statements should beclarified or removed.(May 2024)

Neuro-symbolic AI is a type ofartificial intelligence that integratesneural andsymbolic AI architectures to address the weaknesses of each, providing a robust AI capable ofreasoning, learning, andcognitive modeling. As argued byLeslie Valiant[1] and others,[2][3] the effective construction of rich computationalcognitive models demands the combination ofsymbolic reasoning and efficientmachine learning.

Gary Marcus argued, "We cannot construct richcognitive models in an adequate, automated way without the triumvirate of hybrid architecture, rich prior knowledge, and sophisticated techniques for reasoning."[4] Further, "To build a robust, knowledge-driven approach to AI we must have the machinery of symbol manipulation in our toolkit. Too much useful knowledge is abstract to proceed without tools that represent and manipulate abstraction, and to date, the only known machinery that can manipulate such abstract knowledge reliably is the apparatus of symbol manipulation."[5]

Angelo Dalli,[6]Henry Kautz,[7]Francesca Rossi,[8] andBart Selman[9] also argued for such a synthesis. Their arguments attempt to address the two kinds of thinking, as discussed inDaniel Kahneman's bookThinking Fast and Slow. It describes cognition as encompassing two components: System 1 is fast, reflexive, intuitive, and unconscious. System 2 is slower, step-by-step, and explicit. System 1 is used forpattern recognition. System 2 handles planning, deduction, and deliberative thinking. In this view,deep learning best handles the first kind of cognition while symbolic reasoning best handles the second kind. Both are needed for a robust, reliable AI that can learn, reason, and interact with humans to accept advice and answer questions. Such dual-process models with explicit references to the two contrasting systems have been worked on since the 1990s, both in AI and in Cognitive Science, by multiple researchers.[10]

Approaches

edit

Approaches for integration are diverse.[11]Henry Kautz's taxonomy of neuro-symbolic architectures[12] follows, along with some examples:

  • Symbolic Neural symbolic is the current approach of many neural models innatural language processing, where words or subword tokens are the ultimate input and output oflarge language models. Examples includeBERT, RoBERTa, andGPT-3.
  • Symbolic[Neural] is exemplified byAlphaGo, where symbolic techniques are used to invoke neural techniques. In this case, the symbolic approach isMonte Carlo tree search and the neural techniques learn how to evaluate game positions.
  • Neural | Symbolic uses a neural architecture to interpret perceptual data as symbols and relationships that are reasoned about symbolically. Neural-Concept Learner[13] is an example.
  • Neural: Symbolic → Neural relies on symbolic reasoning to generate or labeltraining data that is subsequently learned by a deep learning model, e.g., to train a neural model for symbolic computation by using aMacsyma-likesymbolic mathematics system to create or label examples.
  • NeuralSymbolic uses aneural net that is generated from symbolic rules. An example is the Neural Theorem Prover,[14] which constructs a neural network from anAND-OR proof tree generated from knowledge base rules and terms. Logic Tensor Networks[15] also fall into this category.
  • Neural[Symbolic] allows a neural model to directly call a symbolic reasoning engine, e.g., to perform an action or evaluate a state. An example would beChatGPT using aplugin to queryWolfram Alpha.

These categories are not exhaustive, as they do not consider multi-agent systems. In 2005, Bader andHitzler presented a more fine-grained categorization that considered, e.g., whether the use of symbols included logic and if it did, whether the logic waspropositional or first-order logic.[16] The 2005 categorization and Kautz's taxonomy above are compared and contrasted in a 2021 article.[12] Recently,Sepp Hochreiter argued thatGraph Neural Networks "...are the predominant models of neural-symbolic computing"[17] since "[t]hey describe the properties of molecules, simulate social networks, or predict future states in physical and engineering applications with particle-particle interactions."[18]

Artificial general intelligence

edit

Gary Marcus argues that "...hybrid architectures that combine learning and symbol manipulation are necessary for robust intelligence, but not sufficient",[19] and that there are

...four cognitive prerequisites for building robust artificial intelligence:

  • hybrid architectures that combine large-scale learning with the representational and computational powers of symbol manipulation,
  • large-scale knowledge bases—likely leveraging innate frameworks—that incorporate symbolic knowledge along with other forms of knowledge,
  • reasoning mechanisms capable of leveraging those knowledge bases in tractable ways, and
  • richcognitive models that work together with those mechanisms andknowledge bases.[20]

This echoes earlier calls for hybrid models as early as the 1990s.[21][22]

History

edit

Garcez and Lamb described research in this area as ongoing at least since the 1990s.[23][24] At that time, the termssymbolic and sub-symbolic AI were popular.

A series of workshops on neuro-symbolic AI has been held annually since 2005 Neuro-Symbolic Artificial Intelligence.[25] In the early 1990s, an initial set of workshops on this topic were organized.[21]

Research

edit

Key research questions remain,[26] such as:

  • What is the best way to integrate neural and symbolic architectures?
  • How should symbolic structures be represented within neural networks and extracted from them?
  • How should common-sense knowledge be learned and reasoned about?
  • How can abstract knowledge that is hard to encode logically be handled?

Implementations

edit

Implementations of neuro-symbolic approaches include:

  • AllegroGraph: an integrated Knowledge Graph based platform for neuro-symbolic application development.[27][28][29]
  • Scallop: a language based onDatalog that supports differentiable logical and relational reasoning. Scallop can be integrated inPython and with aPyTorch learning module.[30]
  • Logic Tensor Networks: encode logical formulas as neural networks and simultaneously learn term encodings, term weights, and formula weights.
  • DeepProbLog: combines neural networks with the probabilistic reasoning ofProbLog.
  • SymbolicAI: a compositional differentiable programming library.
  • Explainable Neural Networks (XNNs): combine neural networks with symbolichypergraphs and trained using a mixture of backpropagation and symbolic learning called induction.[31]

See also

edit

Citations

edit
  1. ^Valiant 2008.
  2. ^Garcez et al. 2015.
  3. ^D'Avila Garcez, Artur S.; Lamb, Luis C.; Gabbay, Dov M. (2009).Neural-symbolic cognitive reasoning. Cognitive technologies. Springer.ISBN 978-3-540-73245-7.
  4. ^Marcus 2020, p. 44.
  5. ^Marcus & Davis 2019, p. 17.
  6. ^Dalli 2025.
  7. ^Kautz 2020.
  8. ^Rossi 2022.
  9. ^Selman 2022.
  10. ^Sun 1995.
  11. ^"Disentangling visual attributes with neuro-vector-symbolic architectures, in-memory computing, and device noise".IBM Research. 2021-02-09. Retrieved2024-10-20.
  12. ^abSarker, Md Kamruzzaman; Zhou, Lu; Eberhart, Aaron;Hitzler, Pascal (2021)."Neuro-symbolic artificial intelligence: Current trends".AI Communications.34 (3):197–209.doi:10.3233/AIC-210084.S2CID 239199144.
  13. ^Mao et al. 2019.
  14. ^Rocktäschel, Tim; Riedel, Sebastian (2016)."Learning Knowledge Base Inference with Neural Theorem Provers".Proceedings of the 5th Workshop on Automated Knowledge Base Construction. San Diego, CA: Association for Computational Linguistics. pp. 45–50.doi:10.18653/v1/W16-1309. Retrieved2022-08-06.
  15. ^Serafini, Luciano; Garcez, Artur d'Avila (2016). "Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge".arXiv:1606.04422 [cs.AI].
  16. ^Bader & Hitzler 2005.
  17. ^L.C. Lamb, A.S. d'Avila Garcez, M.Gori, M.O.R. Prates, P.H.C. Avelar,M.Y. Vardi (2020). "Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective." CoRR abs/2003.00330 (2020)
  18. ^Hochreiter, Sepp (April 2022)."Toward a broad AI".Communications of the ACM.65 (4):56–57.doi:10.1145/3512715.ISSN 0001-0782.
  19. ^Marcus 2020, p. 50.
  20. ^Marcus 2020, p. 48.
  21. ^abSun & Bookman 1994.
  22. ^Honavar 1995.
  23. ^Garcez & Lamb 2020, p. 2.
  24. ^Garcez et al. 2002.
  25. ^"Neuro-Symbolic Artificial Intelligence".people.cs.ksu.edu. Retrieved2023-09-11.
  26. ^Sun 2001.
  27. ^Harper, Jelani (2023-12-29)."AllegroGraph 8.0 Incorporates Neuro-Symbolic AI, a Pathway to AGI".The New Stack. Retrieved2024-06-13.
  28. ^"Neuro-Symbolic AI and Large Language Models Introduction | AllegroGraph 8.1.1".franz.com. Retrieved2024-06-13.
  29. ^"Franz Inc. Introduces AllegroGraph Cloud: A Managed Service for Neuro-Symbolic AI Knowledge Graphs".Datanami. Retrieved2024-06-13.
  30. ^Li, Ziyang; Huang, Jiani; Naik, Mayur (2023). "Scallop: A Language for Neurosymbolic Programming".arXiv:2304.04812 [cs.PL].
  31. ^"Model Induction Method for Explainable AI". USPTO. 2021-05-06.

References

edit

External links

edit

[8]ページ先頭

©2009-2025 Movatter.jp