Engineering AI for provable retention of objectives over time

Adeniyi Fasoro

Download from

onlinelibrary.wiley.com

Engineering AI for provable retention of objectives over time

Adeniyi Fasoro

AI Magazine 45 (2):1-11 (2024) Copy BIBT_EX

Abstract

I argue that ensuring artificial intelligence (AI) retains alignment with human values over time is critical yet understudied. Most research focuses on static alignment, neglecting crucial retention dynamics enabling stability during learning and autonomy. This paper elucidates limitations constraining provable retention, arguing key gaps include formalizing dynamics, transparency of advanced systems, participatory scaling, and risks of uncontrolled recursive self-improvement. I synthesize technical and ethical perspectives into a conceptual framework grounded in control theory and philosophy to analyze dynamics. I argue priorities should shift towards capability modulation, participatory design, and advanced modeling to verify enduring alignment. Overall, I argue that realizing AI safely aligned throughout its lifetime necessitates translating principles into formal methods, demonstrations, and systems integrating technical and humanistic rigor.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Adeniyi Fasoro

Keywords

Add keywords

Reprint years

My notes

Similar books and articles

The Protention-Retention Asymmetry in Husserl’s Conception of Time Consciousness.Cristian Dimitriu - 2014 - Praxis Filosófica:209-229.

HUSSERL’DE RETENSİYON VE ANIMSAMA AYRIMI.A. Suat Gozcu - 2017 - Felsefe Arkivi 44:13-23.

Arithmetical interpretations of dynamic logic.Petr Hájek - 1983 - Journal of Symbolic Logic 48 (3):704-713.

Belief Retention: A Fregean Account.Vojislav Bozickovic - 2015 - Erkenntnis 80 (3):477-486.

Proof vs Provability: On Brouwer’s Time Problem.Palle Yourgrau - 2020 - History and Philosophy of Logic 41 (2):140-153.

Engineering Ethics Education: A Comparative Study of Japan and Malaysia.Balamuralithara Balakrishnan, Fumihiko Tochinai & Hidekazu Kanemitsu - 2019 - Science and Engineering Ethics 25 (4):1069-1083.

Supplementary report: Time between pairings and short-term retention.Lloyd R. Peterson, Kenneth Hillner & Dorothy Saltzman - 1962 - Journal of Experimental Psychology 64 (5):550.

The End-Use Problem in Engineering Ethics.C. Thomas Rogers - 1980 - PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association 1980:464 - 480.

First-list retention and time and method of recall.John P. Houston - 1966 - Journal of Experimental Psychology 71 (6):839.

On the Provable Contradictions of the Connexive Logics C and C3.Satoru Niki & Heinrich Wansing - 2023 - Journal of Philosophical Logic 52 (5):1355-1383.

James Mensch: Husserl’s Account of our Consciousness of Time: Marquette University Press, 2010, 278 pp. $29.00, ISBN 13: 978-0-87462-801-2. [REVIEW]Lanei M. Rodemeyer - 2013 - Husserl Studies 29 (2):171-179.

The relative effect of a time interval upon learning and retention.L. M. Johnson - 1939 - Journal of Experimental Psychology 24 (2):169.

An Essential Definition of Engineering to Support Engineering Research in the Twenty-First Century.Orlando Lopez-Cruz - 2022 - International Journal of Philosophy 10 (4):130.

Is a provable measure of time possible-on the protophysics of time.P. Rohs - 1986 - Philosophische Rundschau 33 (1-2):133-151.

Intuitionistically provable recursive well-orderings.Harvey M. Friedman & Andre Scedrov - 1986 - Annals of Pure and Applied Logic 30 (2):165-171.

Analytics

Added to PP
2024-04-14

Downloads
5 (#1,544,164)

6 months
5 (#647,370)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Adeniyi Fasoro

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Engineering AI for provable retention of objectives over time

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work