Towards a Benchmark for Scientific Understanding in Humans and Machines

Kristian Gonzalez Barman; Sascha Caron; Tom Claassen; Henk De Regt

Download from

dx.doi.org

More download options

Towards a Benchmark for Scientific Understanding in Humans and Machines

Kristian Gonzalez Barman, Sascha Caron, Tom Claassen & Henk De Regt

Minds and Machines 34 (1):1-16 (2024) Copy BIBT_EX

Abstract

Scientific understanding is a fundamental goal of science. However, there is currently no good way to measure the scientific understanding of agents, whether these be humans or Artificial Intelligence systems. Without a clear benchmark, it is challenging to evaluate and compare different levels of scientific understanding. In this paper, we propose a framework to create a benchmark for scientific understanding, utilizing tools from philosophy of science. We adopt a behavioral conception of understanding, according to which genuine understanding should be recognized as an ability to perform certain tasks. We extend this notion of scientific understanding by considering a set of questions that gauge different levels of scientific understanding, covering information retrieval, the capability to arrange information to produce an explanation, and the ability to infer how things would be different under different circumstances. We suggest building a Scientific Understanding Benchmark (SUB), formed by a set of these tests, allowing for the evaluation and comparison of scientific understanding. Benchmarking plays a crucial role in establishing trust, ensuring quality control, and providing a basis for performance evaluation. By aligning machine and human scientific understanding we can improve their utility, ultimately advancing scientific understanding and helping to discover new insights within machines.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author Profiles

Kristian Gonzalez Barman

Ghent University

Henk W. de Regt

Radboud University

Keywords

Artificial Intelligence Cognitive Psychology Game Theory, Economics, Social and Behav. Sciences Philosophy of Mind Philosophy of Science Theory of Computation

Reprint years

DOI

10.1007/s11023-024-09657-1

My notes

Analytics

Added to PP
2024-04-27

Downloads
8 (#1,322,157)

6 months
8 (#368,968)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Kristian Gonzalez Barman

Ghent University

Henk W. de Regt

Radboud University

Citations of this work

No citations found.

Add more citations

References found in this work

The extended mind.Andy Clark & David J. Chalmers - 1998 - Analysis 58 (1):7-19.

Minds, brains, and programs.John Searle - 1980 - Behavioral and Brain Sciences 3 (3):417-57.

Computing machinery and intelligence.Alan M. Turing - 1950 - Mind 59 (October):433-60.

Studies in the logic of explanation.Carl Gustav Hempel & Paul Oppenheim - 1948 - Philosophy of Science 15 (2):135-175.

AI as Agency Without Intelligence: on ChatGPT, Large Language Models, and Other Generative Models.Luciano Floridi - 2023 - Philosophy and Technology 36 (1):1-7.

View all 25 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Towards a Benchmark for Scientific Understanding in Humans and Machines

Abstract

Author Profiles

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work