为什么数学的最终公理会引起如此大的争议

qimuai 发布于 2026-4-30 01:01 阅读：8 一手编译

为什么数学的最终公理会引起如此大的争议

内容来源：https://www.quantamagazine.org/why-maths-final-axiom-proved-so-controversial-20260429/

内容总结：

数学“最终公理”之争：选择公理何以引发百年争议？

在数学界，判定一个命题是否为真的标准是严格的证明。每一次证明都建立在已被证实的结论之上，而这些结论又依赖于更基础的证明，层层递进，最终不得不追溯至某个无法再被证明的起点——这便是公理，数学世界的“基本法则”。如今，现代数学的基石是“泽梅洛-弗兰克尔集合论加上选择公理”（简称ZFC），它由十条基本原则构成，支撑着几乎全部现代数学大厦。

然而，这套看似铁板一块的公理体系，其诞生过程远非一帆风顺，甚至充满了争议和人性化的考量。正如加州大学欧文分校的数学哲学家佩内洛普·马迪所言，任何诚实的审视都会承认，ZFC公理的采纳是“一系列数学考量”的结果，这个过程始于一个多世纪前，至今仍在演进。

危机的起点：悖论与怀疑

19世纪末，数学界陷入了一场由悖论引发的信任危机。当时，数学家们试图寻找一个能统一所有数学分支（如几何、算术）的共同规则体系。格奥尔格·康托尔关于无穷集合的研究，为此提供了可能的方向，也带来了新的困惑。

康托尔发现，实数比整数多，这意味着无穷大也有“大小”之分。他用来比较的工具是看似简单的“集合”——由对象组成的整体。人们逐渐意识到，几乎所有复杂的数学概念都能用集合来表示，这使其成为统一数学的潜在工具。

然而，早期的集合论缺乏严格规则，允许根据任意性质定义集合，这直接导致了著名的“罗素悖论”：考虑“所有不属于自身集合的集合”，它是否包含自身？无论肯定还是否定，都会产生矛盾。这一悖论震撼了整个数学界。

关键时刻：选择公理的诞生

正是在与悖论的斗争中，ZFC体系逐渐成型。1883年，康托尔提出了“良序原理”，声称任何集合都可以被排列，使得其任意非空子集都有一个“最小元素”。这对有限集合是直观的，但对无限集合（如整数集）则不然。康托尔认为，对所有集合（包括无限集）都应如此，这是将无限集视同有限集的一种方式。

1904年，德国数学家恩斯特·泽梅洛证明了良序原理。他的证明依赖于他自己发展出的一个新原则——“选择公理”：如果你有多个（甚至无限多个）非空集合，你可以从每个集合中各选一个元素，组成一个新集合。为了完成证明，泽梅洛列出了他所需的所有假设，这构成了ZFC公理体系的最初蓝本。

从ZF到ZFC：争议与接受

泽梅洛的公理列表出现后，包括亚伯拉罕·弗兰克尔在内的许多数学家也在探索集合论基础，并提出了不同版本的公理。1930年，泽梅洛发布了“最终”列表，包含了对自己公理的修订和一些补充，但一开始并没有纳入选择公理。数学家们对其犹豫不决，因为与其他公理不同，选择公理只是断言了集合的存在，却没有给出构造它的具体方法。

尽管泽梅洛的体系（当时称为ZF）成功地消除了罗素悖论等主要矛盾，但他无法证明该体系是“一致”的（即不会产生矛盾）。他不必为此担心，因为几年后，库尔特·哥德尔证明了任何能表达基本算术的公理体系都无法证明自身的一致性；而且，任何一致的体系必然是不完备的，总存在真的命题无法被证明。

1960年代，斯坦福大学的保罗·科恩更进一步，证明了选择公理与其他公理是“独立”的——即，在ZF规则下，选择公理既无法被证明为真，也无法被证明为假。

既然逻辑无法裁决选择公理的对错，问题就变成了：它有用吗？答案显然是肯定的。它为大量数学研究（尤其是涉及无穷对象的领域）提供了可能。“没有选择公理，你的工具非常有限，就像被绑着手做数学”，集合论学家琼·巴加里亚形象地比喻。正因为其强大的“生产力”，选择公理最终被广泛接受，字母“C”被正式添加到公理列表中，形成了今天的ZFC。

争议的启示：公理并非不言自明

选择公理的争议史生动地表明，数学公理并非不言自明或显而易见。一个公理被接受，往往是因为它的“实用价值”——它能产生有趣且有影响力的定理。尽管ZFC公理被视为人类所能阐述的最普适、最坚实的真理——即使物理定律可能在另一个宇宙中被颠覆，数学定律依然恒定——但它仍然带有“选择”的烙印：它们是数学家们最终选择相信的东西。这种悖论，至今仍在延续。

中文翻译：

为何数学最终公理引发如此争议
引言
数学家如何判断某事为真？他们通过证明来确认。
通常，他们会基于现有证明展开工作，在已被证实的论断之间建立联系或加以延伸。而这些证明本身，又依赖其他证明来支撑其论点，如此层层递进。证明之上叠加证明，真理之上累积真理。但这个过程终究会抵达终点。在某个节点上，某些事物之所以为真，仅仅因为它们本就如此。
这些真理便是公理，即基本规则。人们很容易就此止步——正如加州大学尔湾分校的数学哲学家佩内洛普·马迪所言，宣称“公理是显而易见的、直观的或概念上的真理”。
毕竟，大多数数学家即便愿意承认公理的存在，也只是简单接受自己的研究依赖于一套公理体系——即“带有选择公理的策梅洛-弗兰克尔集合论”，简称ZFC。ZFC由十条基本原则构成，共同奠定了几乎整个现代数学的基础。
但深入审视后会发现，确立真理的过程远更充满变数与人的因素。“但凡诚实而清醒地审视ZFC公理是如何被采纳的人，都会承认这些决策背后涉及了广泛的数学考量，”马迪表示。
这一始于一个多世纪前的进程，至今仍在持续推进中。
悖论与质疑
19世纪末是一个充满悖论与质疑的时代，源于数学家们开始探索数学宇宙所遵循的统一法则体系。当时虽存在公理系统，但往往仅适用于特定数学领域：欧几里得的几何公设、各种算术标准化方案。但这些系统如何彼此兼容？全部数学能否源于一套共同规则？
数学家们在格奥尔格·康托尔的研究中找到了潜在答案——也引发了更多疑问。
彼时，康托尔正在研究实数（即数轴上出现的所有数字）以及它们能揭示的无穷本质。他发现实数数量多于整数，从而得出深刻结论：并非所有无穷大都是同等的规模。
为进行这种比较，康托尔使用了看似简单的工具：集合。集合是对象或元素的汇集。它可以是数字的集合（如实数），也可以是形状的集合，甚至可以是其他集合的集合。随着时间的推移，人们发现复杂而迥异的数学概念——几乎全部数学概念——都能用这些基本实体来表征。由此，集合成为消除不同数学领域间不一致性的潜在工具。
但早期集合论缺乏规范法则。人们可以依据任意性质定义集合，这恰好导致了当时困扰数学家的那类悖论。例如，考虑“所有不属于自身的集合构成的集合”。这个集合是否包含自身？无论回答是或否，都会产生矛盾——即后来被称为“罗素悖论”的困境。
当数学家们为这些难题绞尽脑汁时，ZFC在与康托尔另一思想的碰撞中逐步成形。
1883年，康托尔提出了他的“良序原理”。他声称应当能够将任意集合进行排列，使得其所有（非空）子集都存在最小元素。对于有限集合，这很直观——总能把较小元素排在前面。但对无限集合而言，这点并不明显。以整数集{…, -2, -1, 0, 1, 2, …}为例，负数构成子集，但数值会无限递减，似乎不存在最小元素。
但若将原整数集重新排列为{0, -1, 1, -2, 2, …}呢？此时可以说：任意子集中排在最前的元素即为最小元素。如此一来，-1便成为负数子集的最小元素。
康托尔的“法则”主张这对所有集合都应成立——即使无法明确构造出恰当的排序。这是论证无限集与有限集行为相似的一种方式。
1904年，德国数学家恩斯特·策梅洛证明了这一点。他通过证明康托尔法则等价于自己在探索集合性质时发展出的一个原理来完成论证。这个原理即所谓的“选择公理”：若从多个（甚至无穷多个）非空集合出发，可以从每个集合中选取一个元素构成新集合。
为了证明这种等价性，策梅洛制定了其他公理。“他只是罗列了证明所需的所有假设，”巴塞罗那大学的集合论学者琼·巴加里亚说道。这份清单包含基本概念：存在由元素定义的事物——即集合。其他公理涉及从已有集合构造新集合、以及无穷集合的存在性。
策梅洛的公理清单问世时，亚伯拉罕·弗兰克尔等众多数学家也正在探索集合论基础。他们中的许多人不约而同地得出了相似概念的不同表述——并补充了一些新内容，以解决由更大规模无穷相关理论引发的新问题。1930年，策梅洛发布了“最终”清单，包含对其原有公理的修订及若干补充——但最初并未包含选择公理。数学家们对纳入该公理更为犹豫，因为与其他公理不同，它定义了集合却没有给出明确的构造方式。
策梅洛欣慰地发现，其被称为ZF的公理体系似乎清除了集合论宇宙中的许多重大悖论（如罗素悖论）。但他遗憾未能证明该公理体系的“一致性”——即不会产生矛盾。
他本不必担忧。在ZF诞生仅数年后，库尔特·哥德尔证明：任何能表达基本算术的公理体系都无法证明自身一致性。此外，任何一致系统必然不完全——即存在无法用该体系公理证明的真数学陈述。
事实上，在20世纪60年代，斯坦福大学数学家保罗·科恩证明选择公理与其他公理“独立”——即在ZF规则下，选择公理既不能被证明为真，也不能被证明为假。
一旦明确逻辑无法以任何方式验证选择公理，问题便转为：它有用吗？答案是有用。它使得大量其他数学成为可能——尤其涉及无穷对象的数学。此后，选择公理获得了更广泛的接受。“没有选择公理，你的工具将极其有限，”巴加里亚说，“就像被绑着手做数学。”于是，C（代表“选择”）被附加到最初为其奠定基础的那套公理清单上。
选择公理揭示了“数学公理不言自明”这一信念的谬误。正如马迪所言，公理亦可因其他众多理由而被接受——例如，因其推导出有趣定理的能力。
ZFC公理常被视为人类所能阐述的最具普适性的真理——因为物理学家或许能设想物理定律彻底颠倒的宇宙，但数学定律将恒久不变。
这是一个无解的悖论：数学基础如人类所知的一切那般普适而坚实，是几乎所有数学真理的核心组成部分。然而它们终究只是我们选择相信的东西。

英文来源：

Why Math’s Final Axiom Proved So Controversial
Introduction
How do mathematicians decide that something is true? They write a proof.
Often they start with proofs that already exist, building on or drawing connections between proven claims. Each of these proofs, in turn, has relied on other proofs to make its point, and so on. Proofs upon proofs. Truths upon truths. But eventually this process must come to an end. At some point, things are true simply because they are.
These truths are the axioms, the ground rules. And it is tempting to stop there — to declare, as Penelope Maddy, a philosopher of mathematics at the University of California, Irvine, put it, “that axioms are obvious or intuitive or conceptual truths.”
After all, most mathematicians simply accept that their work relies on an axiomatic system — namely, “Zermelo-Fraenkel set theory with the axiom of choice,” or ZFC — if they bother to acknowledge the axioms at all. ZFC is a list of 10 basic principles that together form the foundation on which nearly all of modern mathematics is built.
But a closer inspection reveals a more unsettled, human process of establishing truth. “Any honest, clear-eyed examination of how the axioms of ZFC came to be adopted would have to acknowledge that a wide range of mathematical considerations went into these decisions,” Maddy said.
That process, which began over a century ago, is still very much in progress.
Paradoxes and Doubt
The late 1800s were a time of paradoxes and doubt, the result of mathematicians beginning to search for cohesive ideas about what rules the mathematical universe obeyed. There were axiomatic systems out there, but they tended to be for specific areas of math: Euclid’s postulates for geometry; various schemes for standardizing arithmetic. But how did they all fit together? Could all of math be derived from one common set of rules?
Mathematicians found a potential solution — and more doubts — in the work of Georg Cantor.
At the time, Cantor was studying the real numbers — that is, all the numbers that appear on the number line — and what they could say about the nature of infinity. He had found that there were more real numbers than whole numbers, giving rise to the profound realization that not all infinities are the same size.
To make this comparison, Cantor had used a seemingly simple tool: the set. A set is a collection of objects, or elements. It might be a collection of numbers, like the real numbers, or a collection of shapes, or even a collection of other sets. Over time, it became clear that complex and disparate mathematical ideas — nearly all of them — could be represented with these same elementary entities. As a result, the set emerged as a potential tool for ironing out any inconsistencies between different areas of math.
But early set theory lacked canonical rules. It was possible to define sets with any property, which led to exactly the sorts of paradoxes that were bothering mathematicians at the time. Consider, for example, the set of all sets that are not members of themselves. Does this set contain itself? Whether you answer yes or no, you get a contradiction now known as Russell’s paradox.
As mathematicians obsessed over these dilemmas, ZFC emerged out of a struggle with another idea of Cantor’s.
In 1883, Cantor introduced his “well-ordering principle.” He claimed that it should be possible to arrange any set so that all of its (non-empty) subsets would have a smallest element. For finite sets, this is intuitive. You can always put the lesser items first. But for infinite sets, it is less obvious. Take the set of integers {…, −2, −1, 0, 1, 2, …}. The negative numbers form a subset, but they also get lower and lower for eternity. It seems as if there can be no least element.
But what if you arrange the original set of integers like this: {0, −1, 1, −2, 2, …}? Now you can say that the smallest element is the one that comes first in any subset. In this way, −1 becomes the smallest element of the subset of negative numbers.
Cantor’s “law” was that this should be possible for all sets, even if you can’t explicitly construct the proper ordering. It was one way of arguing that infinite sets behave like finite ones.
In 1904, the German mathematician Ernst Zermelo proved it. He did so by showing that Cantor’s law was equivalent to a principle he had developed while exploring the properties of sets. This principle, the so-called axiom of choice, said that if you start with multiple (or even infinitely many) non-empty sets, you can choose one element from each of those sets to create a new set.
Zermelo developed his other axioms to prove this equivalence. “He was just listing all the assumptions that he needed to get the proof through,” said Joan Bagaria, a set theorist at the University of Barcelona. That list included the basic idea that there is such a thing as a set, which is defined by its elements. Other axioms dealt with the formation of sets from other sets, or with the existence of infinite sets.
Zermelo’s list of axioms emerged at a time when many mathematicians, such as Abraham Fraenkel, were also tinkering with set theory’s foundations. Many of them found themselves arriving at different formulations of similar ideas — and some new ones, too, that resolved problems arising from newer theories having to do with larger forms of infinity. In 1930, Zermelo released a “final” list that included revisions to his own axioms as well as a handful of additions — but not, at first, the axiom of choice. Mathematicians were more hesitant to include it, because unlike the other axioms, it defined sets without giving an explicit way to construct them.
Zermelo was pleased that his list of principles, known as ZF, appeared to cleanse the set-theoretic universe of many major paradoxes such as Russell’s. But he lamented that he was unable to prove that his axiomatic system was “consistent” — that it didn’t yield contradictions.
He need not have worried. Just a few years after the arrival of ZF, Kurt Gödel showed that no axiomatic system capable of basic arithmetic can be used to prove its own consistency. Moreover, any consistent system must also be incomplete, meaning that there are true mathematical statements that cannot be proved using the system’s axioms.
In fact, in the 1960s, the Stanford mathematician Paul Cohen proved that the axiom of choice is “independent” of the other axioms — that is, under the rules of ZF, the axiom of choice cannot be proved true or false.
Once it was clear that logic could not validate the axiom of choice one way or the other, the question became: Is it useful? And it was. It makes a great deal of other math possible — especially math related to infinite objects. After that, the axiom of choice gained much more widespread acceptance. “Without choice, your tools are very limited,” Bagaria said. “It’s like doing math with your hands tied behind your back.” And so the C (for “choice”) came to be affixed to the list of axioms that were originally developed to support it.
The axiom of choice demonstrates the folly of believing that mathematical axioms are self-evident or obvious. An axiom can also be accepted for plenty of other reasons, as Maddy put it — such as for its power to generate interesting theorems.
The ZFC axioms are often regarded as perhaps the most universal truths that humanity has managed to articulate — for while it may be possible for physicists to imagine universes in which physical laws are turned inside out, mathematical laws will remain constant.
It is a paradox without resolution: The foundations of mathematics are as universal, as solid as anything humanity knows, a core part of nearly every mathematical truth. And yet they remain simply what we choose to believe.

quanta

文章目录

📚 推荐阅读

扫描二维码，在手机上阅读