Philosophies 5 (4):40 (2020)

Copy BIBT_EX

Abstract

An artificial general intelligence (AGI) might have an instrumental drive to modify its utility function to improve its ability to cooperate, bargain, promise, threaten, and resist and engage in blackmail. Such an AGI would necessarily have a utility function that was at least partially observable and that was influenced by how other agents chose to interact with it. This instrumental drive would conflict with the strong orthogonality thesis since the modifications would be influenced by the AGI’s intelligence. AGIs in highly competitive environments might converge to having nearly the same utility function, one optimized to favorably influencing other agents through game theory. Nothing in our analysis weakens arguments concerning the risks of AGI.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

Roman Yampolskiy

University of Louisville

Keywords

instrumental drives orthogonality thesis artificial general intelligence

Reprint years

DOI

10.3390/philosophies5040040

Other Versions

No versions found

My notes

Analytics

Added to PP
2020-12-10

Downloads
55 (#432,821)

6 months
10 (#383,414)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Roman Yampolskiy

University of Louisville

Citations of this work

Is superintelligence necessarily moral?Leonard Dung -2024 -Analysis 84 (4):730-738.

Existential risk from AI and orthogonality: Can we have it both ways?Vincent C. Müller &Michael Cannon -2021 -Ratio 35 (1):25-36.

Liability for Robots: Sidestepping the Gaps.Bartek Chomanski -2021 -Philosophy and Technology 34 (4):1013-1032.

Multiple unnatural attributes of AI undermine common anthropomorphically biased takeover speculations.Preston W. Estep -forthcoming -AI and Society:1-16.

Provably Safe Artificial General Intelligence via Interactive Proofs.Kristen Carlson -2021 -Philosophies 6 (4):83.

Add more citations

References found in this work

Pascal's mugging.Nick Bostrom -2009 -Analysis 69 (3):443-445.

Intelligent machinery, a heretical theory.A. M. Turing -1996 -Philosophia Mathematica 4 (3):256-260.

Thinking Inside the Box: Controlling and Using an Oracle AI.Stuart Armstrong,Anders Sandberg &Nick Bostrom -2012 -Minds and Machines 22 (4):299-324.

The Superintelligent Will: Motivation and Instrumental Rationality in Advanced Artificial Agents. [REVIEW]Nick Bostrom -2012 -Minds and Machines 22 (2):71-85.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Movatterモバイル変換

An AGI Modifying Its Utility Function in Violation of the Strong Orthogonality Thesis

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Other Versions

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work