Search This Blog

Sunday, July 10, 2022

Implementation of mathematics in set theory

From Wikipedia, the free encyclopedia

This article examines the implementation of mathematical concepts in set theory. The implementation of a number of basic mathematical concepts is carried out in parallel in ZFC (the dominant set theory) and in NFU, the version of Quine's New Foundations shown to be consistent by R. B. Jensen in 1969 (here understood to include at least axioms of Infinity and Choice).

What is said here applies also to two families of set theories: on the one hand, a range of theories including Zermelo set theory near the lower end of the scale and going up to ZFC extended with large cardinal hypotheses such as "there is a measurable cardinal"; and on the other hand a hierarchy of extensions of NFU which is surveyed in the New Foundations article. These correspond to different general views of what the set-theoretical universe is like, and it is the approaches to implementation of mathematical concepts under these two general views that are being compared and contrasted.

It is not the primary aim of this article to say anything about the relative merits of these theories as foundations for mathematics. The reason for the use of two different set theories is to illustrate that multiple approaches to the implementation of mathematics are feasible. Precisely because of this approach, this article is not a source of "official" definitions for any mathematical concept.

Preliminaries

The following sections carry out certain constructions in the two theories ZFC and NFU and compare the resulting implementations of certain mathematical structures (such as the natural numbers).

Mathematical theories prove theorems (and nothing else). So saying that a theory allows the construction of a certain object means that it is a theorem of that theory that that object exists. This is a statement about a definition of the form "the x such that exists", where is a formula of our language: the theory proves the existence of "the x such that " just in case it is a theorem that "there is one and only one x such that ". (See Bertrand Russell's theory of descriptions.) Loosely, the theory "defines" or "constructs" this object in this case. If the statement is not a theorem, the theory cannot show that the object exists; if the statement is provably false in the theory, it proves that the object cannot exist; loosely, the object cannot be constructed.

ZFC and NFU share the language of set theory, so the same formal definitions "the x such that " can be contemplated in the two theories. A specific form of definition in the language of set theory is set-builder notation: means "the set A such that for all x, " (A cannot be free in ). This notation admits certain conventional extensions: is synonymous with ; is defined as , where is an expression already defined.

Expressions definable in set-builder notation make sense in both ZFC and NFU: it may be that both theories prove that a given definition succeeds, or that neither do (the expression fails to refer to anything in any set theory with classical logic; in class theories like NBG this notation does refer to a class, but it is defined differently), or that one does and the other doesn't. Further, an object defined in the same way in ZFC and NFU may turn out to have different properties in the two theories (or there may be a difference in what can be proved where there is no provable difference between their properties).

Further, set theory imports concepts from other branches of mathematics (in intention, all branches of mathematics). In some cases, there are different ways to import the concepts into ZFC and NFU. For example, the usual definition of the first infinite ordinal in ZFC is not suitable for NFU because the object (defined in purely set theoretical language as the set of all finite von Neumann ordinals) cannot be shown to exist in NFU. The usual definition of in NFU is (in purely set theoretical language) the set of all infinite well-orderings all of whose proper initial segments are finite, an object which can be shown not to exist in ZFC. In the case of such imported objects, there may be different definitions, one for use in ZFC and related theories, and one for use in NFU and related theories. For such "implementations" of imported mathematical concepts to make sense, it is necessary to be able to show that the two parallel interpretations have the expected properties: for example, the implementations of the natural numbers in ZFC and NFU are different, but both are implementations of the same mathematical structure, because both include definitions for all the primitives of Peano arithmetic and satisfy (the translations of) the Peano axioms. It is then possible to compare what happens in the two theories as when only set theoretical language is in use, as long as the definitions appropriate to ZFC are understood to be used in the ZFC context and the definitions appropriate to NFU are understood to be used in the NFU context.

Whatever is proven to exist in a theory clearly provably exists in any extension of that theory; moreover, analysis of the proof that an object exists in a given theory may show that it exists in weaker versions of that theory (one may consider Zermelo set theory instead of ZFC for much of what is done in this article, for example).

Empty set, singleton, unordered pairs and tuples

These constructions appear first because they are the simplest constructions in set theory, not because they are the first constructions that come to mind in mathematics (though the notion of finite set is certainly fundamental). Even though NFU also allows the construction of set ur-elements yet to become members of a set, the empty set is the unique set with no members:

For each object , there is a set with as its only element:

For objects and , there is a set containing and as its only elements:

The union of two sets is defined in the usual way:

This is a recursive definition of unordered -tuples for any concrete (finite sets given as lists of their elements:)

In NFU, all the set definitions given work by stratified comprehension; in ZFC, the existence of the unordered pair is given by the Axiom of Pairing, the existence of the empty set follows by Separation from the existence of any set, and the binary union of two sets exists by the axioms of Pairing and Union ().

Ordered pair

First, consider the ordered pair. The reason that this comes first is technical: ordered pairs are needed to implement relations and functions, which are needed to implement other concepts which may seem to be prior. The first definition of the ordered pair was the definition proposed by Norbert Wiener in 1914 in the context of the type theory of Principia Mathematica. Wiener observed that this allowed the elimination of types of n-ary relations for n > 1 from the system of that work. It is more usual now to use the definition , due to Kuratowski. Either of these definitions works in either ZFC or NFU. In NFU, these two definitions have a technical disadvantage: the Kuratowski ordered pair is two types higher than its projections, while the Wiener ordered pair is three types higher. It is common to postulate the existence of a type-level ordered pair (a pair which is the same type as its projections) in NFU. It is convenient to use the Kuratowski pair in both systems until the use of type-level pairs can be formally justified. The internal details of these definitions have nothing to do with their actual mathematical function. For any notion of ordered pair, the thing that matters is that it satisfies the defining condition

…and that it be reasonably easy to collect ordered pairs into sets.

Relations

Relations are sets whose members are all ordered pairs. Where possible, a relation (understood as a binary predicate) is implemented as (which may be written as ). When is a relation, the notation means .

In ZFC, some relations (such as the general equality relation or subset relation on sets) are 'too large' to be sets (but may be harmlessly reified as proper classes). In NFU, some relations (such as the membership relation) are not sets because their definitions are not stratified: in , and would need to have the same type (because they appear as projections of the same pair), but also successive types (because is considered as an element of ).

Related definitions

Let and be given binary relations. Then the following concepts are useful:

The converse of is the relation .

The domain of is the set .

The range of is the domain of the converse of . That is, the set .

The field of is the union of the domain and range of .

The preimage of a member of the field of is the set (used in the definition of 'well-founded' below.)

The downward closure of a member of the field of is the smallest set containing , and containing each for each (i.e., including the preimage of each of its elements with respect to as a subset.)

The relative product of and is the relation .

Notice that with our formal definition of a binary relation, the range and codomain of a relation are not distinguished. This could be done by representing a relation with codomain as , but our development will not require this.

In ZFC, any relation whose domain is a subset of a set and whose range is a subset of a set will be a set, since the Cartesian product is a set (being a subclass of ), and Separation provides for the existence of . In NFU, some relations with global scope (such as equality and subset) can be implemented as sets. In NFU, bear in mind that and are three types lower than in (one type lower if a type-level ordered pair is used).

Properties and kinds of relations

A binary relation is:

  • Reflexive if for every in the field of .
  • Symmetric if .
  • Transitive if .
  • Antisymmetric if .
  • Well-founded if for every set which meets the field of , whose preimage under does not meet .
  • Extensional if for every in the field of , if and only if and have the same preimage under .

Relations having certain combinations of the above properties have standard names. A binary relation is:

  • An equivalence relation if is reflexive, symmetric, and transitive.
  • A partial order if is reflexive, antisymmetric, and transitive.
  • A linear order if is a partial order and for every in the field of , either or .
  • A well-ordering if is a linear order and well-founded.
  • A set picture if is well-founded and extensional, and the field of either equals the downward closure of one of its members (called its top element), or is empty.

Functions

A functional relation is a binary predicate such that Such a relation (predicate) is implemented as a relation (set) exactly as described in the previous section. So the predicate is implemented by the set . A relation is a function if and only if It is therefore possible to define the value function as the unique object such that  – i.e.: is -related to such that the relation holds between and  – or as the unique object such that . The presence in both theories of functional predicates which are not sets makes it useful to allow the notation both for sets and for important functional predicates. As long as one does not quantify over functions in the latter sense, all such uses are in principle eliminable.

Outside of formal set theory, we usually specify a function in terms of its domain and codomain, as in the phrase "Let be a function". The domain of a function is just its domain as a relation, but we have not yet defined the codomain of a function. To do this we introduce the terminology that a function is from to if its domain equals and its range is contained in . In this way, every function is a function from its domain to its range, and a function from to is also a function from to for any set containing .

Indeed, no matter which set we consider to be the codomain of a function, the function does not change as a set since by definition it is just a set of ordered pairs. That is, a function does not determine its codomain by our definition. If one finds this unappealing then one can instead define a function as the ordered pair , where is a functional relation and is its codomain, but we do not take this approach in this article (more elegantly, if one first defines ordered triples - for example as - then one could define a function as the ordered triple so as to also include the domain). Note that the same issue exists for relations: outside of formal set theory we usually say "Let be a binary relation", but formally is a set of ordered pairs such that and .

In NFU, has the same type as , and is three types higher than (one type higher, if a type-level ordered pair is used). To solve this problem, one could define as for any set , but this is more conveniently written as . Then, if is a set and is any functional relation, the Axiom of Replacement assures that is a set in ZFC. In NFU, and now have the same type, and is two types higher than (the same type, if a type-level ordered pair is used).

The function such that is not a set in ZFC because it is "too large". is however a set in NFU. The function (predicate) such that is neither a function nor a set in either theory; in ZFC, this is true because such a set would be too large, and, in NFU, this is true because its definition would not be stratified. Moreover, can be proved not to exist in NFU (see the resolution of Cantor's paradox in New Foundations.)

Operations on functions

Let and be arbitrary functions. The composition of and , , is defined as the relative product , but only if this results in a function such that is also a function, with , if the range of is a subset of the domain of . The inverse of , , is defined as the converse of if this is a function. Given any set , the identity function is the set , and this is a set in both ZFC and NFU for different reasons.

Special kinds of function

A function is an injective (also called one-to-one) if it has an inverse function.

A function from to is a:

  • Injection from to if the images under of distinct members of are distinct members of .
  • Surjection from to if the range of is .
  • Bijection from to if is both an injection and a surjection.

Defining functions as ordered pairs or ordered triples has the advantages that we do not have to introduce the terminology of being a function "from to ", and that we can speak of "being surjective" outright as opposed to only being able to speak of "being surjective onto ".

Size of sets

In both ZFC and NFU, two sets A and B are the same size (or are equinumerous) if and only if there is a bijection f from A to B. This can be written as , but note that (for the moment) this expresses a relation between A and B rather than a relation between yet-undefined objects and . Denote this relation by in contexts such as the actual definition of the cardinals where even the appearance of presupposing abstract cardinals should be avoided.

Similarly, define as holding if and only if there is an injection from A to B.

It is straightforward to show that the relation of equinumerousness is an equivalence relation: equinumerousness of A with A is witnessed by ; if f witnesses , then witnesses ; and if f witnesses and g witnesses , then witnesses .

It can be shown that is a linear order on abstract cardinals, but not on sets. Reflexivity is obvious and transitivity is proven just as for equinumerousness. The Schröder–Bernstein theorem, provable in ZFC and NFU in an entirely standard way, establishes that

(this establishes antisymmetry on cardinals), and

follows in a standard way in either theory from the axiom of choice.

Finite sets and natural numbers

Natural numbers can be considered either as finite ordinals or finite cardinals. Here consider them as finite cardinal numbers. This is the first place where a major difference between the implementations in ZFC and NFU becomes evident.

The Axiom of Infinity of ZFC tells us that there is a set A which contains and contains for each . This set A is not uniquely determined (it can be made larger while preserving this closure property): the set N of natural numbers is

which is the intersection of all sets which contain the empty set and are closed under the "successor" operation .

In ZFC, a set is finite if and only if there is such that : further, define as this n for finite A. (It can be proved that no two distinct natural numbers are the same size).

The usual operations of arithmetic can be defined recursively and in a style very similar to that in which the set of natural numbers itself is defined. For example, + (the addition operation on natural numbers) can be defined as the smallest set which contains for each natural number and contains whenever it contains .

In NFU, it is not obvious that this approach can be used, since the successor operation is unstratified and so the set N as defined above cannot be shown to exist in NFU (it is consistent for the set of finite von Neumann ordinals to exist in NFU, but this strengthens the theory, as the existence of this set implies the Axiom of Counting (for which see below or the New Foundations article)).

The standard definition of the natural numbers, which is actually the oldest set-theoretic definition of natural numbers, is as equivalence classes of finite sets under equinumerousness. Essentially the same definition is appropriate to NFU (this is not the usual definition, but the results are the same): define Fin, the set of finite sets, as

For any set , define as . Define N as the set .

The Axiom of Infinity of NFU can be expressed as : this is enough to establish that each natural number has a nonempty successor (the successor of being for any ) which is the hard part of showing that the Peano axioms of arithmetic are satisfied.

The operations of arithmetic can be defined in a style similar to the style given above (using the definition of successor just given). They can also be defined in a natural set theoretical way: if A and B are disjoint finite sets, define |A|+|B| as . More formally, define m+n for m and n in N as

(But note that this style of definition is feasible for the ZFC numerals as well, but more circuitous: the form of the NFU definition facilitates set manipulations while the form of the ZFC definition facilitates recursive definitions, but either theory supports either style of definition).

The two implementations are quite different. In ZFC, choose a representative of each finite cardinality (the equivalence classes themselves are too large to be sets); in NFU the equivalence classes themselves are sets, and are thus an obvious choice for objects to stand in for the cardinalities. However, the arithmetic of the two theories is identical: the same abstraction is implemented by these two superficially different approaches.

Equivalence relations and partitions

A general technique for implementing abstractions in set theory is the use of equivalence classes. If an equivalence relation R tells us that elements of its field A are alike in some particular respect, then for any set x, regard the set as representing an abstraction from the set x respecting just those features (identify elements of A up to R).

For any set A, a set is a partition of A if all elements of P are nonempty, any two distinct elements of P are disjoint, and .

For every equivalence relation R with field A, is a partition of A. Moreover, each partition P of A determines an equivalence relation .

This technique has limitations in both ZFC and NFU. In ZFC, since the universe is not a set, it seems possible to abstract features only from elements of small domains. This can be circumvented using a trick due to Dana Scott: if R is an equivalence relation on the universe, define as the set of all y such that and the rank of y is less than or equal to the rank of any . This works because the ranks are sets. Of course, there still may be a proper class of 's. In NFU, the main difficulty is that is one type higher than x, so for example the "map" is not in general a (set) function (though is a set). This can be circumvented by the use of the Axiom of Choice to select a representative from each equivalence class to replace , which will be at the same type as x, or by choosing a canonical representative if there is a way to do this without invoking Choice (the use of representatives is hardly unknown in ZFC, either). In NFU, the use of equivalence class constructions to abstract properties of general sets is more common, as for example in the definitions of cardinal and ordinal number below.

Ordinal numbers

Two well-orderings and are similar and write just in case there is a bijection f from the field of to the field of such that for all x and y.

Similarity is shown to be an equivalence relation in much the same way that equinumerousness was shown to be an equivalence relation above.

In New Foundations (NFU), the order type of a well-ordering W is the set of all well-orderings which are similar to W. The set of ordinal numbers is the set of all order types of well-orderings.

This does not work in ZFC, because the equivalence classes are too large. It would be formally possible to use Scott's trick to define the ordinals in essentially the same way, but a device of von Neumann is more commonly used.

For any partial order , the corresponding strict partial order < is defined as . Strict linear orders and strict well-orderings are defined similarly.

A set A is said to be transitive if : each element of an element of A is also an element of A. A (von Neumann) ordinal is a transitive set on which membership is a strict well-ordering.

In ZFC, the order type of a well-ordering W is then defined as the unique von Neumann ordinal which is equinumerous with the field of W and membership on which is isomorphic to the strict well-ordering associated with W. (the equinumerousness condition distinguishes between well-orderings with fields of size 0 and 1, whose associated strict well-orderings are indistinguishable).

In ZFC there cannot be a set of all ordinals. In fact, the von Neumann ordinals are an inconsistent totality in any set theory: it can be shown with modest set theoretical assumptions that every element of a von Neumann ordinal is a von Neumann ordinal and the von Neumann ordinals are strictly well-ordered by membership. It follows that the class of von Neumann ordinals would be a von Neumann ordinal if it were a set: but it would then be an element of itself, which contradicts the fact that membership is a strict well-ordering of the von Neumann ordinals.

The existence of order types for all well-orderings is not a theorem of Zermelo set theory: it requires the Axiom of replacement. Even Scott's trick cannot be used in Zermelo set theory without an additional assumption (such as the assumption that every set belongs to a rank which is a set, which does not essentially strengthen Zermelo set theory but is not a theorem of that theory).

In NFU, the collection of all ordinals is a set by stratified comprehension. The Burali-Forti paradox is evaded in an unexpected way. There is a natural order on the ordinals defined by if and only if some (and so any) is similar to an initial segment of some (and so any) . Further, it can be shown that this natural order is a well-ordering of the ordinals and so must have an order type . It would seem that the order type of the ordinals less than with the natural order would be , contradicting the fact that is the order type of the entire natural order on the ordinals (and so not of any of its proper initial segments). But this relies on one's intuition (correct in ZFC) that the order type of the natural order on the ordinals less than is for any ordinal . This assertion is unstratified, because the type of the second is four higher than the type of the first (two higher if a type level pair is used). The assertion which is true and provable in NFU is that the order type of the natural order on the ordinals less than is for any ordinal , where is the order type of for any (it is easy to show that this does not depend on the choice of W; note that T raises type by one). Thus the order type of the ordinals less than with the natural order is , and . All uses of here can be replaced with if a type-level pair is used.

This shows that the T operation is nontrivial, which has a number of consequences. It follows immediately that the singleton map is not a set, as otherwise restrictions of this map would establish the similarity of W and for any well-ordering W. T is (externally) bijective and order-preserving. Because of this, the fact establishes that is a "descending sequence" in the ordinals which cannot be a set.

Ordinals fixed by T are called Cantorian ordinals, and ordinals which dominate only cantorian ordinals (which are easily shown to be cantorian themselves) are said to be strongly cantorian. There can be no set of cantorian ordinals or set of strongly cantorian ordinals.

Digression: von Neumann ordinals in NFU

It is possible to reason about von Neumann ordinals in NFU. Recall that a von Neumann ordinal is a transitive set A such that the restriction of membership to A is a strict well-ordering. This is quite a strong condition in the NFU context, since the membership relation involves a difference of type. A von Neumann ordinal A is not an ordinal in the sense of NFU, but belongs to an ordinal which may be termed the order type of (membership on) A. It is easy to show that the order type of a von Neumann ordinal A is cantorian: for any well-ordering W of order type , the induced well-ordering of initial segments of W by inclusion has order type (it is one type higher, thus the application of T): but the order types of the well-ordering of a von Neumann ordinal A by membership and the well-ordering of its initial segments by inclusion are clearly the same because the two well-orderings are actually the same relation, so the order type of A is fixed under T. Moreover, the same argument applies to any smaller ordinal (which will be the order type of an initial segment of A, also a von Neumann ordinal) so the order type of any von Neumann ordinal is strongly cantorian.

The only von Neumann ordinals which can be shown to exist in NFU without additional assumptions are the concrete finite ones. However, the application of a permutation method can convert any model of NFU to a model in which every strongly cantorian ordinal is the order type of a von Neumann ordinal. This suggests that the concept "strongly cantorian ordinal of NFU" might be a better analogue to "ordinal of ZFC" than is the apparent analogue "ordinal of NFU".

Cardinal numbers

Cardinal numbers are defined in NFU in a way which generalizes the definition of natural number: for any set A, .

In ZFC, these equivalence classes are too large as usual. Scott's trick could be used (and indeed is used in ZF), is usually defined as the smallest order type (here a von Neumann ordinal) of a well-ordering of A (that every set can be well-ordered follows from the Axiom of Choice in the usual way in both theories).

The natural order on cardinal numbers is seen to be a well-ordering: that it is reflexive, antisymmetric (on abstract cardinals, which are now available) and transitive has been shown above. That it is a linear order follows from the Axiom of Choice: well-order two sets and an initial segment of one well-ordering will be isomorphic to the other, so one set will have cardinality smaller than that of the other. That it is a well-ordering follows from the Axiom of Choice in a similar way.

With each infinite cardinal, many order types are associated for the usual reasons (in either set theory).

Cantor's theorem shows (in both theories) that there are nontrivial distinctions between infinite cardinal numbers. In ZFC, one proves In NFU, the usual form of Cantor's theorem is false (consider the case A=V), but Cantor's theorem is an ill-typed statement. The correct form of the theorem in NFU is , where is the set of one-element subsets of A. shows that there are "fewer" singletons than sets (the obvious bijection from to V has already been seen not to be a set). It is actually provable in NFU + Choice that (where signals the existence of many intervening cardinals; there are many, many urelements!). Define a type-raising T operation on cardinals analogous to the T operation on ordinals: ; this is an external endomorphism of the cardinals just as the T operation on ordinals is an external endomorphism of the ordinals.

A set A is said to be cantorian just in case ; the cardinal is also said to be a cantorian cardinal. A set A is said to be strongly cantorian (and its cardinal to be strongly cantorian as well) just in case the restriction of the singleton map to A () is a set. Well-orderings of strongly cantorian sets are always strongly cantorian ordinals; this is not always true of well-orderings of cantorian sets (though the shortest well-ordering of a cantorian set will be cantorian). A cantorian set is a set which satisfies the usual form of Cantor's theorem.

The operations of cardinal arithmetic are defined in a set-theoretically motivated way in both theories. . One would like to define as , and one does this in ZFC, but there is an obstruction in NFU when using the Kuratowski pair: one defines as because of the type displacement of 2 between the pair and its projections, which implies a type displacement of two between a cartesian product and its factors. It is straightforward to prove that the product always exists (but requires attention because the inverse of T is not total).

Defining the exponential operation on cardinals requires T in an essential way: if was defined as the collection of functions from A to B, this is three types higher than A or B, so it is reasonable to define as so that it is the same type as A or B ( replaces with type-level pairs). An effect of this is that the exponential operation is partial: for example, is undefined. In ZFC one defines as without difficulty.

The exponential operation is total and behaves exactly as expected on cantorian cardinals, since T fixes such cardinals and it is easy to show that a function space between cantorian sets is cantorian (as are power sets, cartesian products, and other usual type constructors). This offers further encouragement to the view that the "standard" cardinalities in NFU are the cantorian (indeed, the strongly cantorian) cardinalities, just as the "standard" ordinals seem to be the strongly cantorian ordinals.

Now the usual theorems of cardinal arithmetic with the axiom of choice can be proved, including . From the case the existence of a type level ordered pair can be derived: is equal to just in case , which would be witnessed by a one-to-one correspondence between Kuratowski pairs and double singletons : redefine as the c such that is associated with the Kuratowski : this is a type-level notion of ordered pair.

The Axiom of Counting and subversion of stratification

So there are two different implementations of the natural numbers in NFU (though they are the same in ZFC): finite ordinals and finite cardinals. Each of these supports a T operation in NFU (basically the same operation). It is easy to prove that is a natural number if n is a natural number in NFU + Infinity + Choice (and so and the first infinite ordinal are cantorian) but it is not possible to prove in this theory that . However, common sense indicates that this should be true, and so it can be adopted as an axiom:

  • Rosser's Axiom of Counting: For each natural number n, .

One natural consequence of this axiom (and indeed its original formulation) is

  • for each natural number n.

All that can be proved in NFU without Counting is .

A consequence of Counting is that N is a strongly cantorian set (again, this is an equivalent assertion).

Properties of strongly cantorian sets

The type of any variable restricted to a strongly cantorian set A can be raised or lowered as desired by replacing references to with references to (type of a raised; this presupposes that it is known that a is a set; otherwise one must say "the element of " to get this effect) or (type of a lowered) where for all , so it is not necessary to assign types to such variables for purposes of stratification.

Any subset of a strongly cantorian set is strongly cantorian. The power set of a strongly cantorian set is strongly cantorian. The cartesian product of two strongly cantorian sets is strongly cantorian.

Introducing the Axiom of Counting means that types need not be assigned to variables restricted to N or to P(N), R (the set of reals) or indeed any set ever considered in classical mathematics outside of set theory.

There are no analogous phenomena in ZFC. See the main New Foundations article for stronger axioms that can be adjoined to NFU to enforce "standard" behavior of familiar mathematical objects.

Familiar number systems: positive rationals, magnitudes, and reals

Represent positive fractions as pairs of positive natural numbers (0 is excluded): is represented by the pair . To make , introduce the relation defined by . It is provable that this is an equivalence relation: define positive rational numbers as equivalence classes of pairs of positive natural numbers under this relation. Arithmetic operations on positive rational numbers and the order relation on positive rationals are defined just as in elementary school and proved (with some effort) to have the expected properties.

Represent magnitudes (positive reals) as nonempty proper initial segments of the positive rationals with no largest element. The operations of addition and multiplication on magnitudes are implemented by elementwise addition of the positive rational elements of the magnitudes. Order is implemented as set inclusion.

Represent real numbers as differences of magnitudes: formally speaking, a real number is an equivalence class of pairs of magnitudes under the equivalence relation defined by . The operations of addition and multiplication on real numbers are defined just as one would expect from the algebraic rules for adding and multiplying differences. The treatment of order is also as in elementary algebra.

This is the briefest sketch of the constructions. Note that the constructions are exactly the same in ZFC and in NFU, except for the difference in the constructions of the natural numbers: since all variables are restricted to strongly cantorian sets, there is no need to worry about stratification restrictions. Without the Axiom of Counting, it might be necessary to introduce some applications of T in a full discussion of these constructions.

Operations on indexed families of sets

In this class of constructions it appears that ZFC has an advantage over NFU: though the constructions are clearly feasible in NFU, they are more complicated than in ZFC for reasons having to do with stratification.

Throughout this section assume a type-level ordered pair. Define as . The definition of the general n-tuple using the Kuratowski pair is trickier, as one needs to keep the types of all the projections the same, and the type displacement between the n-tuple and its projections increases as n increases. Here, the n-tuple has the same type as each of its projections.

General cartesian products are defined similarly:

The definitions are the same in ZFC but without any worries about stratification (the grouping given here is opposite to that more usually used, but this is easily corrected for).

Now consider the infinite cartesian product . In ZFC, this is defined as the set of all functions f with domain I such that (where A is implicitly understood as a function taking each i to ).

In NFU, this is requires attention to type. Given a set I and set valued function A whose value at in is written , Define as the set of all functions f with domain I such that : notice that is stratified because of our convention that A is a function with values at singletons of the indices. Note that the very largest families of sets (which cannot be indexed by sets of singletons) will not have cartesian products under this definition. Note further that the sets are at the same type as the index set I (since one type higher than its elements); the product, as a set of functions with domain I (so at the same type as I) is one type higher (assuming a type-level ordered pair).

Now consider the product of the cardinals of these sets. The cardinality || is one type higher than the cardinals , so the correct definition of the infinite product of cardinals is (because the inverse of T is not total, it is possible that this may not exist).

Repeat this for disjoint unions of families of sets and sums of families of cardinals. Again, let A be a set-valued function with domain : write for . The disjoint union is the set . This set is at the same type as the sets .

The correct definition of the sum is thus , since there is no type displacement.

It is possible to extend these definitions to handle index sets which are not sets of singletons, but this introduces an additional type level and is not needed for most purposes.

In ZFC, define the disjoint union as , where abbreviates .

Permutation methods can be used to show relative consistency with NFU of the assertion that for every strongly cantorian set A there is a set I of the same size whose elements are self-singletons: for each i in I.

The cumulative hierarchy

In ZFC, define the cumulative hierarchy as the ordinal-indexed sequence of sets satisfying the following conditions: ; ; for limit ordinals . This is an example of a construction by transfinite recursion. The rank of a set A is said to be if and only if . The existence of the ranks as sets depends on the axiom of replacement at each limit step (the hierarchy cannot be constructed in Zermelo set theory); by the axiom of foundation, every set belongs to some rank.

The cardinal is called .

This construction cannot be carried out in NFU because the power set operation is not a set function in NFU ( is one type higher than A for purposes of stratification).

The sequence of cardinals can be implemented in NFU. Recall that is defined as , where is a convenient set of size 2, and . Let be the smallest set of cardinals which contains (the cardinality of the set of natural numbers), contains the cardinal whenever it contains , and which is closed under suprema of sets of cardinals.

A convention for ordinal indexing of any well-ordering is defined as the element x of the field of such that the order type of the restriction of to is ; then define as the element with index in the natural order on the elements of . The cardinal is the element with index in the natural order on all infinite cardinals (which is a well-ordering, see above). Note that follows immediately from this definition. In all these constructions, notice that the type of the index is two higher (with type-level ordered pair) than the type of .

Each set A of ZFC has a transitive closure (the intersection of all transitive sets which contains A). By the axiom of foundation, the restriction of the membership relation to the transitive closure of A is a well-founded relation. The relation is either empty or has A as its top element, so this relation is a set picture. It can be proved in ZFC that every set picture is isomorphic to some .

This suggests that (an initial segment of) the cumulative hierarchy can be studied by considering the isomorphism classes of set pictures. These isomorphism classes are sets and make up a set in NFU. There is a natural set relation analogous to membership on isomorphism classes of set pictures: if is a set picture, write for its isomorphism class and define as holding if is the isomorphism class of the restriction of y to the downward closure of one of the elements of the preimage under y of the top element of y. The relation E is a set relation, and it is straightforward to prove that it is well-founded and extensional. If the definition of E is confusing, it can be deduced from the observation that it is induced by precisely the relationship which holds between the set picture associated with A and the set picture associated with B when in the usual set theory.

There is a T operation on isomorphism classes of set pictures analogous to the T operation on ordinals: if x is a set picture, so is . Define as . It is easy to see that .

An axiom of extensionality for this simulated set theory follows from E's extensionality. From its well-foundedness follows an axiom of foundation. There remains the question of what comprehension axiom E may have. Consider any collection of set pictures (collection of set pictures whose fields are made up entirely of singletons). Since each is one type higher than x (using a type-level ordered pair), replacing each element of the field of each in the collection with results in a collection of set pictures isomorphic to the original collection but with their fields disjoint. The union of these set pictures with a new top element yields a set picture whose isomorphism type will have as its preimages under E exactly the elements of the original collection. That is, for any collection of isomorphism types , there is an isomorphism type whose preimage under E is exactly this collection.

In particular, there will be an isomorphism type [v] whose preimage under E is the collection of all T[x]'s (including T[v]). Since T[v] E v and E is well-founded, . This resembles the resolution of the Burali–Forti paradox discussed above and in the New Foundations article, and is in fact the local resolution of Mirimanoff's paradox of the set of all well-founded sets.

There are ranks of isomorphism classes of set pictures just as there are ranks of sets in the usual set theory. For any collection of set pictures A, define S(A) as the set of all isomorphism classes of set pictures whose preimage under E is a subset of A; call A a "complete" set if every subset of A is a preimage under E. The collection of "ranks" is the smallest collection containing the empty set and closed under the S operation (which is a kind of power set construction) and under unions of its subcollections. It is straightforward to prove (much as in the usual set theory) that the ranks are well-ordered by inclusion, and so the ranks have an index in this well-order: refer to the rank with index as . It is provable that for complete ranks . The union of the complete ranks (which will be the first incomplete rank) with the relation E looks like an initial segment of the universe of Zermelo-style set theory (not necessarily like the full universe of ZFC because it may not be large enough). It is provable that if is the first incomplete rank, then is a complete rank and thus . So there is a "rank of the cumulative hierarchy" with an "external automorphism" T moving the rank downward, exactly the condition on a nonstandard model of a rank in the cumulative hierarchy under which a model of NFU is constructed in the New Foundations article. There are technical details to verify, but there is an interpretation not only of a fragment of ZFC but of NFU itself in this structure, with defined as : this "relation" is not a set relation but has the same type displacement between its arguments as the usual membership relation .

So there is a natural construction inside NFU of the cumulative hierarchy of sets which internalizes the natural construction of a model of NFU in Zermelo-style set theory.

Under the Axiom of Cantorian Sets described in the New Foundations article, the strongly cantorian part of the set of isomorphism classes of set pictures with the E relation as membership becomes a (proper class) model of ZFC (in which there are n-Mahlo cardinals for each n; this extension of NFU is strictly stronger than ZFC). This is a proper class model because the strongly cantorian isomorphism classes do not make up a set.

Permutation methods can be used to create from any model of NFU a model in which every strongly cantorian isomorphism type of set pictures is actually realized as the restriction of the true membership relation to the transitive closure of a set.

Distributive justice

From Wikipedia, the free encyclopedia
 

Distributive justice concerns the socially just allocation of resources. Often contrasted with just process, which is concerned with the administration of law, distributive justice concentrates on outcomes. This subject has been given considerable attention in philosophy and the social sciences.

In social psychology, distributive justice is defined as perceived fairness of how rewards and costs are shared by (distributed across) group members. For example, when some workers work more hours but receive the same pay, group members may feel that distributive justice has not occurred. To determine whether distributive justice has taken place, individuals often turn to the behavioral expectations of their group. If rewards and costs are allocated according to the designated distributive norms of the group, distributive justice has occurred.

Types of distributive norms

Five types of distributive norm are defined by Donelson R. Forsyth:

  1. Equality: Regardless of their inputs, all group members should be given an equal share of the rewards/costs. Equality supports that someone who contributes 20% of the group's resources should receive as much as someone who contributes 60%.
  2. Equity: Members' outcomes should be based upon their inputs. Therefore, an individual who has invested a large amount of input (e.g. time, money, energy) should receive more from the group than someone who has contributed very little. Members of large groups prefer to base allocations of rewards and costs on equity
  3. Power: Those with more authority, status, or control over the group should receive more than those in lower level positions.
  4. Need: Those in greatest needs should be provided with resources needed to meet those needs. These individuals should be given more resources than those who already possess them, regardless of their input.
  5. Responsibility: Group members who have the most should share their resources with those who have less.

Theories of distributive justice

To create a list of the theories of distributive justice will inevitably come with its implications. It is important to take into consideration the various nuances within each theory, as well as the development and variations in interpretations that exist for the theories presented in this article. The listed theories below are three of the most prominent Anglo-American theories within the field. With this in mind, the list is in no way to be considered exhaustive for distributive justice theory.

Justice as fairness

In his book A Theory of Justice, John Rawls outlines his famous theory about justice as fairness. The theory consists of three core components:

  1. the equality of people in rights and liberties;
  2. the equality of opportunities for all; and
  3. an arrangement of economic inequalities focused on benefit maximisation for those who are least advantaged.

The just 'basic structure'

Building a modern view on social contract theory, Rawls bases his work on an idea of justice being rooted in the basic structure, constituting the fundamental rules in society, which shape the social and economic institutions, as well as the governance. This basic structure is what shapes the citizens’ life opportunities. According to Rawls, the structure is based on principles about basic rights and duties that any self-interested, rational individual would accept in order to further his/her own interests in a context of social cooperation.

The original position

Rawls presents the concept of an original position as a hypothetical idea of how to establish "a fair procedure so that any principles agreed on will be just." In his envisioning of the original position, it is created from a judgement made through negotiations between a group of people who will decide on what a just distribution of primary goods is (according to Rawls, the primary goods include freedoms, opportunities, and control over resources). These people are assumed to be guided by self-interest, while also having a basic idea of morality and justice, and thus capable of understanding and evaluating a moral argument. Rawls then argues that procedural justice in the process of negotiation will be possible via a nullification of temptations for these people to exploit circumstances so as to favor their own position in society.

Veil of ignorance

This nullification of temptations is realised through a veil of ignorance, which these people will be behind. The veil prevents the people from knowing what particular preferences they will have by concealing their talents, objectives, and, most importantly, where in society they themselves will end up. The veil, on the other hand, does not conceal general information about the society, and the people are assumed to possess societal and economic knowledge beyond the personal level. Thereby, such veil creates an environment for negotiations where the evaluation of the distribution of goods is based on general considerations, regardless of place in society, rather than biased considerations based on personal gains for specific citizen positions. By this logic, the negotiations will be sensitive to both those who are worst off, given that a risk of being in that category yourself will incentivize protection of these people, but also the rest of society, as one would not wish to hinder maximal utilisation for these in case you would end up in higher classes.

Basic principles of a just distribution

In this original position, the main concern will be to secure the goods that are most essential for pursuing the goals of each individual, regardless of what this specific goal might be. With this in mind, Rawls theorizes two basic principles of just distribution.

The first principle, the liberty principle, is the equal access to basic rights and liberties for all. With this, each person should be able to access the most extensive set of liberties that is compatible with similar schemes of access by other citizens. Thereby, it is not only a question of positive individual access but also of negative restrictions so as to respect others’ basic rights and liberties.

The second principle, the difference principle, addresses how the arrangement of social and economic inequalities, and thus the just distribution should look. Firstly, Rawls argues that such distribution should be based on a reasonable expectation of advantage for all, but also to the greatest benefit of the least advantaged in society. Secondly, the offices and positions attached to this arrangement should be open to all.

These principles of justice are then prioritised according to two additional principles:

  1. the principles of the priority of liberty, wherein basic liberties only can be restricted if this is done for the sake of protecting liberty either:
    1. by strengthening “the total system of liberties shared by all;” or
    2. if a less than equal liberty is acceptable to those who are subject to this same lesser liberty.
  2. inequality of opportunity, and the priority of efficiency & welfare, can only be acceptable if:
    1. it enhances “the opportunities of those with lesser opportunities” in society; and/or
    2. excessive saving either balances out or lessens the gravity of hardship for those who do not traditionally benefit.

Utilitarianism

In 1789, Jeremy Bentham published his book An Introduction to the Principles of Morals and Legislation. Centred on individual utility and welfare, utilitarianism builds on the notion that any action which increases the overall welfare in society is good, and any action that decreases welfare is bad. By this notion, utilitarianism's focus lies with its outcomes and pays little attention to how these outcomes are shaped. This idea of utilisation maximisation, while being a much broader philosophical consideration, also translates into a theory of justice.

Conceptualising welfare

While the basic notion that utilitarianism builds on seems simple, one major dispute within the school of utilitarianism revolved around the conceptualisation and measurement of welfare. With disputes over this fundamental aspect, utilitarianism is evidently a broad term embracing many different sub-theories under its umbrella, and while much of the theoretical framework transects across these conceptualisations, using the different conceptualisation have clear implications for how we understand the more practical side of utilitarianism in distributive justice.

Bentham originally conceptualised this according to the hedonistic calculus, which also became the foundation for John Stuart Mill's focus on intellectual pleasures as the most beneficial contribution to societal welfare. Another path has been painted by Aristotle, based on an attempt to create a more universal list of conditions required for human prosperity. Opposite this, another path focuses on a subjective evaluation of happiness and satisfaction in human lives.

Egalitarianism

Based on a fundamental notion of equal worth and moral status of human beings, egalitarianism is concerned with equal treatment of all citizens in both respect and in concern, and in relation to the state as well as one another. Egalitarianism focuses more on the process through which distribution takes place, egalitarianism evaluates the justification for a certain distribution based on how the society and its institutions have been shaped, rather than what the outcome is. Attention is mainly given to ways in which unchosen person circumstances affect and hinder individuals and their life opportunities. As Elizabeth Anderson defines it, "the positive aim of egalitarian justice is...to create a community in which people stand in relation of equality to others."

While much academic work distinguishes between luck egalitarianism and social egalitarianism, Roland Pierik presents a synthesis combining the two branches. In his synthesis, he argues that instead of focusing on compensations for unjust inequalities in society via redistribution of primary goods, egalitarianism scholars should instead, given the fundamental notion upon which the theory is built, strive to create institutions that creates and promotes meaningful equal opportunities from the get-go. Pierik thus moves egalitarianism's otherwise reactive nature by emphasising a need for attention to the development of fundamentally different institutions that would eradicate the need for redistribution and instead focus on the initial equal distribution of opportunities from which people then themselves be able to shape their lives.

Application and outcomes

Outcomes

Distributive justice affects performance when efficiency and productivity are involved. Improving perceptions of justice increases performance. Organizational citizenship behaviors (OCBs) are employee actions in support of the organization that are outside the scope of their job description. Such behaviors depend on the degree to which an organization is perceived to be distributively just. As organizational actions and decisions are perceived as more just, employees are more likely to engage in OCBs. Perceptions of distributive justice are also strongly related to the withdrawal of employees from the organization.

Wealth

Distributive justice considers whether the distribution of goods among the members of society at a given time is subjectively acceptable.

Not all advocates of consequentialist theories are concerned with an equitable society. What unites them is the mutual interest in achieving the best possible results or, in terms of the example above, the best possible distribution of wealth.

Environmental justice

Distributive justice in an environmental context is the equitable distribution of a society's technological and environmental risks, impacts, and benefits. These burdens include exposure to hazardous waste, land appropriation, armed violence, and murder. Distributive justice is an essential principle of environmental justice because there is evidence that shows that these burdens cause health problems, negatively affect quality of life, and drive down property value.

The potential negative social impacts of environmental degradation and regulatory policies have been at the center environmental discussions since the rise of environmental justice. Environmental burdens fall disproportionately upon the Global South, while benefits are primarily accrued to the Global North.

In politics

Distributive justice theory argues that societies have a duty to individuals in need and that all individuals have a duty to help others in need. Proponents of distributive justice link it to human rights. Many governments are known for dealing with issues of distributive justice, especially in countries with ethnic tensions and geographically distinctive minorities. Post-apartheid South Africa is an example of a country that deals with issues of re-allocating resources with respect to the distributive justice framework.

Influenced figures

Distributive justice is also fundamental to the Catholic Church's social teaching, inspiring such figures as Dorothy Day and Pope John Paul II.

Hayek's criticism

Within the context of Western liberal democracies in the post-WWII decades, Friedrich von Hayek was one of the most famous opposers of the idea of distributive justice. For him, social and distributive justice were meaningless and impossible to attain, on the grounds of being within a system where the outcomes are not determined deliberately by the people but contrarily spontaneity is the norm. Therefore, distributive justice, redistribution of wealth, and the demands for social justice in a society ruled by an impersonal process such as the market are in this sense incompatible with that system.

In his famous book Road to Serfdom, there can be found considerations about social assistance from the state. There, in talking about the importance of a restrictive kind of security (the one against physical privation) in front of one that necessarily needs to control or abolish the market, Hayek poses that "there can be no doubt that some minimum of food, shelter, and clothing, sufficient to preserve health and the capacity to work, can be assured to everybody". Providing this type of security is for Hayek compatible with individual freedom as it does not involve planning. But already in this early work, he acknowledges the fact that this provision must keep the incentives and the external pressure going and not select which group enjoys security and which does not, for under these conditions "the striving for security tends to become higher than the love of freedom". Therefore, fostering a certain kind of security (the one that for him socialist economic policies follow) can entail growing insecurity as the privilege increases social differences. Notwithstanding, he concludes that "adequate security against severe privation, and the reduction of the avoidable causes of misdirected effort and consequent disappointment, will have to be one of the main goals of policy".

Despite that vague social awareness (also a sign of the times as WWII's devastating consequences were obvious), Hayek dismisses an organizational view that ascribes certain outcomes to an intentional design, which would be contrary to his worshipped spontaneous order. For this, Hayek famously firstly regards the term social (or distributive) justice as meaningless when it is applied to the results of a liberal market system that should yield spontaneous outcomes. Justice has an individual component for Hayek, is only understood in the aggregation of individual actions which follow common rules, social and distributive justice are the negative opposite as they need a command economy. Secondly, following Tebble's (2009) view, the concept of social justice is for Hayek a reminiscence of an atavistic view towards society, that has been overcome by the survival capacity of the catallactic order and its values.

The third Hayekian critique is about the unfeasibility of attaining distributive justice in a free market order and this is defended on the basis of the determinate goal that all distributive justice aims to. In a catallactic order, the individual morality should freely determine what are distributive fairness and the values that govern economic activity, and the fact that it is impossible to gather all the individual information in a single pursuit for social and distributive justice results in realizing the fact that it cannot be pursued. Lastly, Hayek claims for the incompatibility between the free market and social justice, for, in essence, they are different kinds of inequalities. The former is one determined by the interaction of free individuals and the latter by the decision of an authority. Hayek will, on ethical grounds, choose the former.

Libertarian perspective

One of the major exponents of the libertarian outlook toward distributive justice is Robert Nozick. In his book Anarchy, State and Utopia he stresses how the term distributive justice is not a neutral one. In fact, there is no central distributor that can be regarded as such. What each person gets, he or she gets from the outcomes of Lockean self-ownership (a condition that implies one's labor mixed with the world), or others who give to him in exchange for something, or as a gift. For him, "there is no more a distributing or distribution of shares than there is a distribution of mates in a society in which persons choose whom they shall marry". This means that there can be no pattern to which to conform or aim. The market and the result of individual actions provided the conditions for libertarian principles of just acquisition and exchange (contained in his Entitlement Theory) will have as a result a distribution that will be just, without the need for considerations about the specific model or standard it should follow.

Liquefied petroleum gas

From Wikipedia, the free encyclopedia ...