A Functional Hitchhikers Guide to Hereditarily Finite Sets, Ackermann Encodings and Pairing Functions

A Functional Hitchhiker’ s Guide to Her editarily Finite Sets, Ackermann Encodings and P airing Functions – unpublished draft – Paul T arau Department of Computer Science and Engineering Univ ersity of North T exas ta rau@cs.unt.edu Abstract The paper is org anized as a self-contained literate Haskell program that implements elements of an ex ecutable ﬁ- nite set theory with focus on combinatorial generation and arithmetic encodings. The code, tested under GHC 6.6.1, is av ailable at http://logic.csci.unt.edu/tarau/ research/2008/fSET.zip . W e introduce ranking and unranking functions generaliz- ing Ackermann’ s encoding to the universe of Hereditarily Fi- nite Sets with Urelements. Then we build a lazy enumerator for Hereditarily Finite Sets with Urelements that matches the unranking function provided by the in verse of Ackermann’ s encoding and we describe functors between them resulting in arithmetic encodings for po wersets, h ypergraphs, ordinals and choice functions. After implementing a digraph repre- sentation of Hereditarily Finite Sets we deﬁne decoration functions that can reco ver well-founded sets from encodings of their associated acyclic digraphs. W e conclude with an en- coding of arbitrary digraphs and discuss a concept of duality induced by the set membership relation. Keyw ords her editarily ﬁnite sets, ranking and unrank- ing functions, e xecutable set theory , arithmetic encodings, Haskell data repr esentations, functional pr ogramming and computational mathematics 1. Introduction While the Uni verse of Hereditarily Finite Sets is best known as a model of the Zermelo-Fraenkel Set theory with the Axiom of Inﬁnity replaced by its negation (T akahashi 1976; Meir et al. 1983), it has been the object of rene wed practical Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proﬁt or commercial advantage and that copies bear this notice and the full citation on the ﬁrst page. T o copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior speciﬁc permission and/or a fee. WXYZ ’08 date, City . Copyright c  2008 A CM [to be supplied]. . . $5.00 interest in various ﬁelds, from representing structured data in databases (Leontjev and Sazonov 2000) to reasoning with sets and set constraints in a Logic Programming framew ork (Dovier et al. 2000; Piazza and Policriti 2004; Do vier et al. 2001). The Univ erse of Hereditarily Finite Sets is built from the empty set (or a set of Urelements ) by successively applying powerset and set union operations. A surprising bijection, discov ered by Wilhelm Ackermann in 1937 (Ackermann 1937; Abian and Lamacchia 1978; Kaye and W ong 2007) from Hereditarily Finite Sets to Natural Numbers, was the original trigger for our work on building in a mathematically elegant programming language, a concise and executable hereditarily ﬁnite set theory . The arbitrary size of the data objects brought in the need for arbitrary length integers. The focus on potentially inﬁnite enumerations brought in the need for lazy e valuation. These ha v e made Haskell a natural choice. W e will describe our constructs in a subset of Haskell (Peyton Jones 2002, 2003a,b) seen as a concrete syntax for a generic lambda calculus based functional language 1 . W e will only make the assumptions that non-strict func- tions (higher order included), with call-by-need ev aluation and arbitrary length integers are available in the language. While our code will conform Hask ell’ s type system, we will do that without any type declarations, by ensuring that the types of our functions are all inferred. This increases chances that the code can be ported, through simple syntax transfor- mations, to any programming language that implements our basic assumptions. The paper is organized as follows: section 2 introduces the reader to combinatorial generation with help of a bit- string example, section 3 introduces Ackermann’ s encod- 1 As a courtesy to the reader wondering about the title, the author confesses being a hitchhiker in the world of functional programming, coming from the not so distant galaxy of logic programming but still confused by recent hitchhiking trips in the exotic worlds of logic synthesis, foundations of mathematics, natural language processing, con versational agents and virtual reality . And not being afraid to go boldly where . . . a few others have already been before. ing in the more general case when urelements are present and shows an encoding for hypergraphs as a particular case. Section 4 gives examples of transporting common set and natural number operations from one side to the other . After discussing some classic pairing functions, section 5 intro- duces ne w pairing/unpairing on natural numbers. Section 6 discusses graph representations and decoration functions on Hereditarily Finite Sets (6.1), and provides encodings for di- rected acyclic graphs (6.3). Sections 7 and 8 discuss related work, future work and conclusions. 2. What’ s in a Bit? Let us observe ﬁrst that the well kno wn bitstring representa- tion of Natural Numbers (see to rbits and from rbit in Appendix and notice the reversed bit order) is a ﬁrst hint at their genuinely polymorphic, “shapeshifting” nature: to_rbits 2008 [0,0,0,1,1,0,1,1,1,1,1] from_rbits [0,0,0,1,1,0,1,1,1,1,1] 2008 The effect is trivial here - these transformers turn a number into a list of bits and back. One step further , we will now deﬁne two one argument functions , that implement the “bits” o and i : o x = 2 ∗ x + 0 i x = 2 ∗ x + 1 One can recognize now that 2008 is just the result of composing “bits”, with a result similar to the result of from rbits : (o.o.o.i.i.o.i.i.i.i.i) 0 2008 The reader will notice that we ha ve just “shapeshifted” to yet another view: a number is now a composition of bits, seen as tr ansformers , where each bit does its share by leftshifting the string one position and then adding its contrib ution to it. Note the analogy with Church numerals, which represent numbers as iterations of function application, except that here n will only need O( log 2 ( n ) ) of space. Like with the usual bitstring representation, the dominant digit is always 1 , zeros after that hav e no effect, from where we can infer that the mapping between such bitstrings and numbers is not one-to-one. A variant of the 2-adic bijective numeral representation ﬁxes this, and shows one of the sim- plest bijective mappings from natural numbers to bitstrings (i.e. the regular language { 0 , 1 } ∗ ): nat2bits = drop_last . to_rbits . succ where drop_last bs = genericTake (l-1) bs where l = genericLength bs bits2nat bs = (from_rbits (bs + + [1]))-1 nat2bits 42 [1,1,0,1,0] bits2nat it 42 map nat2bits [0..15] [[],[0],[1],[0,0],[1,0],[0,1],[1,1], [0,0,0],[1,0,0],[0,1,0],[1,1,0], [0,0,1],[1,0,1],[0,1,1],[1,1,1], [0,0,0,0]] The last example suggests that we are now able to generate the inﬁnite str eam of all possible bitstrings simply as as: all_bitstrings = map nat2bits [0..] W e will now hitchhike with this design pattern in our tool- box to a more interesting univ erse. 3. Hereditarily Finite Sets and the Ackermann Encoding The Univ erse of Hereditarily Finite Sets ( H F S ) is b uilt from the empty set (or a set of Urelements ) by successiv ely ap- plying powerset and set union operations. Assuming H F S extended with Urelements (i.e. objects not having an y el- ements), the following data type deﬁnes a recursi ve “rose tree” for Hereditarily Finite Sets: data HFS t = U t | S [HFS t] deriving (Show, Eq) W e will assume that Urelements are represented as Natural Numbers in [0..ulimit-1] . The constructor U t marks Ur elements of type t (usually the arbitrary length Inte ger type in Haskell) and the constructor S marks a list of recur- siv ely built H F S type elements. Note that if no elements are used with the U constructor, we obtain the “pure” H F S univ erse by representing the empty set as S [] . 3.1 Ackermann’ s Encoding A surprising bijection, disco vered by W ilhelm Ackermann in 1937 (Ackermann 1937; Abian and Lamacchia 1978; Kaye and W ong 2007) maps Hereditarily Finite Sets ( H F S ) to Natural Numbers ( N at ): f ( x ) = if x = {} then 0 else P a ∈ x 2 f ( a ) Let us note that Ackermann’ s encoding can be seen as the recursiv e application of a bijection set2nat from ﬁnite subsets of N at to N at , that associates to a set of (distinct!) natural numbers a (unique!) natural number . A simple change to Ackermann’ s mapping, will accomo- date a ﬁnite number of Urelements in [0 ..u − 1] , as follo ws: f u ( x ) = if x < u then x else u + P a ∈ x 2 f u ( a ) P R O P O S I T I O N 1 . F or u ∈ N at the function f u is a bijection fr om N at to H F S with Urelements in [0 ..u − 1] . The proof follows from the fact that no sets map to val- ues smaller than ul imit and that Urelements map into them- selves. W ith this representation, Ackermann’ s encoding from H F S with Urelements in [0..ulimit-1] to N at hfs2nat becomes: hfs2nat_ _ (U n) = n hfs2nat_ ulimit (S es) = ulimit + set2nat (map (hfs2nat_ ulimit) es) set2nat ns = sum (map (2^) ns) where set2nat maps a set of exponents of 2 to the associ- ated sum of powers of 2. W e can now deﬁne hfs2nat = hfs2nat_ urelement_limit urelement_limit = 0 where the constant urelement limit controls the initial segment of N at to be mapped to Ur elements . Note that to keep our Haskell code as simple as possible we assume that urelement limit is a global parameter that implicitly ﬁxes the set of Urelements. T o obtain the in verse of the Ackerman encoding, let’ s ﬁrst deﬁne the in verse nat2set of the bijection set2nat . It de- composes a natural number into a list of exponents of 2 (seen as bit positions equaling 1 in its bitstring representation, in increasing order). nat2set n = nat2right_exps n 0 where nat2right_exps 0 _ = [] nat2right_exps n e = add_rexp (n ‘mod‘ 2) e (nat2right_exps (n ‘div‘ 2) (e + 1)) where add_rexp 0 _ es = es add_rexp 1 e es = (e:es) nat2set 42 [1,3,5] set2nat [1,3,5] 42 nat2set 2008 [3,4,6,7,8,9,10] set2nat [3,4,6,7,8,9,10] 2008 The in verse of the (bijectiv e) Ackermann encoding (general- ized to work with urelements in [0..ulimit-1] ) is deﬁned as follows: nat2hfs_ ulimit n | n < ulimit = U n nat2hfs_ ulimit n = S (map (nat2hfs_ ulimit) (nat2set (n-ulimit))) W e can now deﬁne nat2hfs = nat2hfs_ urelement_limit where the constant urelement limit controls the initial segment of N at to be mapped to Ur elements . As both nat2hfs and hfs2nat are obtained through recursiv e compositions of nat2set and set2nat , respec- tiv ely , one can generalize the encoding mechanism by re- placing these building blocks with other bijections with sim- ilar properties. One can try out nat2hfs and its in v erse hfs2nat and print out a H F S with the setShow function (given in Ap- pendix): nat2hfs 42 S [S [U 0],S [U 0,S [U 0]],S [U 0,S [S [U 0]]]] hfs2nat (nat2hfs 42) 42 setShow 42 "{{{}},{{},{{}}},{{},{{{}}}}}" Assuming urelement limit=3 the HFS representation be- comes: nat2hfs 42 S [U 0,U 1,U 2,S [U 1]] setShow 42 "{0,1,2,{1}}" Note that setShow n will build a string representation of n ∈ N at , “shapeshifted” as a H F S with Urelements. Figure 1 sho ws directed ac yclic graphs obtained by mer ging shared nodes in the rose tree representation of the H F S associated to a natural number (with arro ws pointing from sets to their elements). Figure 1: Hereditarily Finite Set associated to 42 3.2 Combinatorial Generation as Iteration Using the in v erse of Ackermann’ s encoding, the inﬁnite stream H F S can be generated simply by iterating ov er the inﬁnite stream [0..] : iterative_hfs_generator = map nat2hfs [0..] take 5 iterative_hfs_generator [U 0,S [U 0],S [S [U 0]], S [U 0,S [U 0]],S [S [S [U 0]]]] 3.3 Generating the Stream of Hereditarily Finite Sets Directly T o fully appreciate the elegance and simplicity of the combi- natorial generation mechanism described pre viously , we will also provide a “hand-crafted” recursiv e generator for H F S . The reader will notice that this uses some fairly high lev el Haskell constructs like list comprehensions and lazy ev alu- ation, and that in a language without such features the algo- rithm might get signiﬁcantly more intricate. If P ( x ) denotes the po werset of x , the Univ erse of Hered- itarily Finite Sets H F S is constructed inductiv ely as fol- lows: 1. the empty set {} is in H F S 2. if x is in H F S then the union of its power sets P k ( x ) is in H F S T o implement in Haskell a simple H F S generator , conform- ing this deﬁnition, we start with a po werset function, work- ing with sets represented as lists: list_subsets [] = [[]] list_subsets (x:xs) = [zs | ys ← list_subsets xs,zs ← [ys,(x:ys)]] W e can generate the inﬁnite stream of “pure” hereditarily ﬁnite sets using Haskell’ s lazy e valuation mechanism, as follows: hfs_generator = uhfs_from 0 where uhfs_from k = union (old_hfs k) (uhfs_from (k + 1)) old_hfs k = elements_of (hpow k (U 0)) elements_of (U _) = [] elements_of (S hs) = hs hpow 0 h = h hpow k h = hpow (k-1) (S (hsubsets h)) hsubsets (U n) = [] hsubsets (S hs) = (map S (list_subsets hs)) One can now extract a ﬁnite number of H F S from the stream take 5 hfs_generator [S [],S [S []],S [S [S []]], S [S [],S [S []]],S [S [S [S []]]]] and notice the identical behavior of hfs generator and iterative hfs generator . 3.4 Encoding Hypergraphs D E FI N I T I O N 1 . A hypergr aph (also called set system ) is a pair H = ( X , E ) wher e X is a set and E is a set of non- empty subsets of X . By limiting recursion to one le vel in Ackermann’ s encoding, we can deri ve a bijecti ve encoding of hyper graphs , repre- sented as sets of sets: nat2hypergraph = (map nat2set) . nat2set hypergraph2nat = set2nat . (map set2nat) as shown in the follo wing e xample: nat2hypergraph 2008 [[0,1],[2],[1,2],[0,1,2],[3],[0,3],[1,3]] hypergraph2nat (nat2hypergraph 2008) 2008 As in the case of H F S combinatorial generation of the inﬁnite stream of hypergraphs becomes simply map nat2hypergraph [0..] Note also that a hypothetical application using integers, ﬁ- nite sets and hypergraphs can use internally the same im- mutable data type, with opportunities to share common structures. In the follo wing sections we will think about Acker- mann’ s encoding and its in verse as Functors in Category Theory (Pierce 1991), transporting various operations from Natural Numbers to Hereditarily Finite Sets and back. 4. Shapeshifting Operations between N at and H F S 4.1 Fold operators and functors Giv en the rose tr ee structure of H F S , a natural fold oper- ation (Nipko w and P aulson 2005) can be deﬁned on them as a higher order Haskell function: hfold f g (U n) = g n hfold f g (S xs) = f (map (hfold f g) xs) For instance, it can count how many sets occur in a given H F S , as follows: hsize = hfold f g where f xs = 1 + (sum xs) g _ = 1 Note that recursing over nat2set has been used to build a member of H F S from a member of N at . Thus, we can com- bine it with the action of a fold operator working directly on natural numbers as follows: nfold f g n = nfold_ f g urelement_limit n nfold_ f g ulimit n | n < ulimit = g n nfold_ f g ulimit n = f (map (nfold_ f g ulimit) (nat2set n)) For instance, nfold allows counting the elements contained in the H F S representation of a number: nsize = nfold f g where f xs = 1 + (sum xs) g _ = 1 as if deﬁned by nsize_alt n = hsize (nat2hfs n) The action of the Ackermann encoding as a Functor from H F S to N at on morphisms (seen as functions on a list of arguments) is deﬁned as follo ws: toNat f = nat2hfs . f . (map hfs2nat) The same, acting on 1 and 2 argument operations is: toNat1 f i = nat2hfs (f (hfs2nat i)) toNat2 f i j = nat2hfs (f (hfs2nat i) (hfs2nat j)) The inv erse Ackermann encoding acts as a Functor from N at to H F S : toHFS f = hfs2nat . f . (map nat2hfs) with variants acting on a 1 and 2 ar gument functions: toHFS1 f x = hfs2nat (f (nat2hfs x)) toHFS2 f x y = hfs2nat (f (nat2hfs x) (nat2hfs x)) Note that the nat2set and set2nat functions used in the Ackerman encoding and its inv erse can also be seen as pro- viding Functors connecting N at and [ N at ] (seen as a repre- sentation of ﬁnite subsets of N at ): toExps f = set2nat . f . (map nat2set) fromExps f = nat2set . f . (map set2nat) 4.2 Mappings between Arithmetic and Set Operations After extending 2 argument set operations to lists, using foldl setOp f [] = [] setOp f (x:xs) = foldl f x xs we can deﬁne the equiv alent of adduction (i.e. { i } ∪ s - see (Kaye and W ong 2007; Kirby 2007)), union, intersection etc., on natural numbers seen as (lists of) sets: nat_adduction i is = set2nat (union [i] (nat2set is)) nat_singleton i = 2^i nat_intersect = nats_intersect . nat2set nats_intersect = toExps (setOp intersect) nat_union = nats_union . nat2set nats_union = toExps (setOp union) nat_equal i j = if i = = j then 1 else 0 Similarly , we can transport from N at to H F S , opera- tions like successor , addition, product, equality as follo ws: hsucc = toNat1 succ hsum = toNat sum hproduct = toNat product hequal = toNat2 nat_equal hexp2 = toNat1 (2^) with the practical idea in mind that one can pick the most efﬁcient (or the simpler to implement) of the two represen- tations at will. As current computer architectures tend to support Natu- ral Numbers and underlying arbitrary inte ger representations quite well, we can pick them as the hub that mediates the “shapeshiftings” between various data types. Howe ver , in an application where lazy structure building would be instru- mental for performance, something like H F S (or one of the encodings described in the next sections) could be the most appropriate internal representation. 5. Pairing Functions P airings are bijective functions N at × N at → N at . Fol- lowing the classic notation for pairings of (Robinson 1950), giv en the pairing function J , its left and right inv erses K and L are such that J ( K ( z ) , L ( z )) = z (1) K ( J ( x, y )) = x (2) L ( J ( x, y )) = y (3) W e refer to (C ´ egielski and Richard 2001) for a typical use in the foundations of mathematics and to (Rosenberg 2002) for an extensi ve study of various pairing functions and their computational properties. On top of the “set operations” deﬁned in subsection 4.2 on N at , the classic K uratowski ordered pair ( a, b ) = {{ a } , { a, b }} can be implemented with adductions and sin- gletons as follows: nat_kpair x y = nat_adduction sx ssxy where sx = nat_singleton x sy = nat_singleton y sxy = nat_adduction x sy ssxy = nat_singleton sxy Howe v er , the Kuratowski pair only provides an injectiv e function N at × N at → N at , resulting in fast growing integers v ery quickly: [nat_kpair x y|x<-[0..3],y<-[0..3]] [2,10,34,514,12,4,68,1028,48, 80,16,4112,768,1280,4352,256] 5.1 Cantor’ s Pairing Function W e can do better by borrowing some interesting pairing functions deﬁned on natural numbers. Starting from Can- tor’ s pairing function bijections from N at × N at to N at hav e been used for various proofs and constructions of math- ematical objects (Robinson 1950, 1955, 1968a,b; C ´ egielski and Richard 2001). Cantor’ s pairing function is deﬁned as: nat_cpair x y = (x + y) ∗ (x + y + 1) ‘div‘ 2 + y Note that its range is more compact [nat_cpair i j|i<-[0..3],j<-[0..3]] [0,2,5,9,1,4,8,13,3,7,12,18,6,11,17,24] Unfortunately , its in verse in volv es ﬂoating point operations that do not combine well with arbitrary length integers. 5.2 A new Pairing Function W e will introduce here a ne w pairing function, that provides compact representations for various set theoretic constructs in v olving ordered pairs while only using elementary integer arithmetic operations. Our bijection bitmerge pair from N at × N at to N at and its in verse to pair are deﬁned as follows: bitmerge_pair (i,j) = set2nat ((evens i) + + (odds j)) where evens x = map ( ∗ 2) (nat2set x) odds y = map succ (evens y) bitmerge_unpair n = (f xs,f ys) where (xs,ys) = partition even (nat2set n) f = set2nat . (map (‘div‘ 2)) The function bitmerge pair works by splitting a number’ s big endian bitstring representation into odd and even bits while its in verse to pair blends the odd and even bits back together . With help of the function to rbits giv en in Ap- pendix, that decomposes n ∈ N at into a list of bits (smaller units ﬁrst) on can follow what happens, step by step: to_rbits 2008 [0,0,0,1, 1,0,1,1, 1,1,1] bitmerge_unpair 2008 (60,26) to_rbits 60 [0,0, 1,1, 1,1] to_rbits 26 [0,1, 0,1, 1] bitmerge_pair (60,26) 2008 Note also the signiﬁcantly more compact packing, com- pared to Kuratowski pairs, and, like Cantor’ s pairing func- tion, similar growth in both ar guments: map bitmerge_unpair [0..15] [(0,0),(1,0),(0,1),(1,1),(2,0),(3,0), (2,1),(3,1),(0,2),(1,2),(0,3),(1,3), (2,2),(3,2),(2,3),(3,3)] [bitmerge_pair (i,j)|i<-[0..3],j<-[0..3]] [0,2,8,10,1,3,9,11,4,6,12,14,5,7,13,15] 5.3 Powersets, Ordinals and Choice Functions A concept of (ﬁnite) powerset can be associated to a number n ∈ N at by computing the po werset of the H F S associated to it: nat_powset i = set2nat (map set2nat (list_subsets (nat2set i))) or , directly , as in (Abian and Lamacchia 1978): nat_powset_alt i = product (map ( λ k → 1 + 2^(2^k)) (nat2set i)) The von Neumann or dinal associated to a H F S , deﬁned with interval notation as λ = [0 , λ ) , is implemented by the function hfs ordinal , simply by transporting it from N at : nat_ordinal 0 = 0 nat_ordinal n = set2nat (map nat_ordinal [0..(n-1)]) hfs_ordinal = nat2hfs . nat_ordinal The following example shows the transitive structur e of a von Neumann ordinal’ s set representation (see Fig. 2). It also shows its fast growing N at encoding ( 4 → 2059 ) which can be seen as a some what unusual injecti v e embedding of ﬁnite ordinals in N at , seen as the set of ﬁnite cardinals. hfs_ordinal 4 S [S [],S [S []],S [S [], S [S []]],S [S [],S [S []], S [S [],S [S []]]]] nat_ordinal 4 2059 Finally , a choice function is implemented as an encoding of pairs of sets and their ﬁrst elements with our compact N at × N at → N at pairing function: nat_choice_fun i = set2nat xs where es = nat2set i hs = map (head . nat2set) es xs = zipWith (curry bitmerge_pair) es hs As even numbers represent sets that do not contain the empty set as an element, we compute N at representations of the choice function as follows: map nat_choice_fun [0,2..16] [0,2,64,66,32,34,96,98,16777216] Note that nat choice function computes a natural num- ber representation i.e. G ¨ oedel number for a function that picks an element of each set of any family of sets not con- taining the empty set. Constructing such a natural number prov es that N at , with the structure borrowed from H F S is actually a model for the Axiom of Choice . Such models are important in the foundations of mathematics as they show that interpretations of sets and functions other the usual ones are compatible with v arious axiomatizations of set theory (Kaye and W ong 2007; Kirby 2007). 6. Directed Graph Encodings Directed Graphs are equiv alent to binary relations seen as sets of ordered pairs. Equiv alently , (as implemented in the Haskell Data.Graph package), they can also be seen as Figure 2: 4 and its associated ordinal: as a pure H F S and its associated or dinal 2059 arrays of vertices in [0..n] paired with lists of vertices of adjacent outgoing edges. W e will freely alternate between these two representations in this section. 6.1 Directed Acyclic Graph representations for H F S The r ose tr ee representation of H F S can be seen as a set of edges, oriented to describe either set membership ∈ or its transpose, set containment. nat2memb = nat2pairs with_memb nat2contains = nat2pairs with_contains with_memb a x = (x,a) with_contains a x = (a,x) Note that this uses the function nat2pairs (see Appendix) that provides the actual decomposition of a number into Haskell ordered pairs. The follo wing examples sho w ho w this works: nat2memb 42 [(0,1),(0,3),(0,5),(1,2),(1,3), (1,42),(2,5),(3,42),(5,42)] nat2contains 42 [(1,0),(2,1),(3,0),(3,1),(5,0), (5,2),(42,1),(42,3),(42,5)] These list of pair representations can be easily conv erted to Haskell’ s graph data type (imported from Data.Graph) as follows: nat2member_dag = nat2dag_ nat2memb nat2contains_dag = nat2dag_ nat2contains nat2dag_ f n = buildG (0,l) es where es = reverse (f n) l = foldl max 0 (nat2parts n) where nat2parts , giv en in the Appendix, con verts n to the set of Natural Numbers occurring in its H F S representation. Moreov er , the pair representation of ∈ and its inv erse can be turned into a more compact graph by replacing its n distinct vertex numbers with smaller inte gers from [0 ..n − 1] , by progressively b uilding a map describing this association, as shown in the function to dag to_dag n = (buildG (0,l) (map (remap m) (nat2contains n))) where is = [0..l] ns = reverse (nat2parts n) l = (genericLength ns)-1 m = (zip ns is) remap m (f,t) = (lf,lt) where (Just lf) = (lookup f m) (Just lt) = (lookup t m) Dually , one can con vert n ∈ N at to the containment graph of its H F S as follows to_ddag = transposeG . to_dag An interesting question arises at this point. Can we r e- build a natural number fr om its dir ected acyclic graph rep- r esentation, assuming no labels ar e available, except 0? Sur- prisingly , the answer is yes, and the function from dag pro- vides the con version: from_dag g = compute_decoration g (fst (bounds g)) compute_decoration g v = compute_decorations g (g!v) where compute_decorations _ [] = 0 compute_decorations g es = sum (map ((2^) . (compute_decoration g)) es) to_dag 42 array (0,5) [(0,[1,2,4]),(1,[3,5]),(2,[4,5]), (3,[4]),(4,[5]),(5,[])] from_dag (to_dag 42) 42 to_ddag 42 array (0,5) [(0,[]),(1,[0]),(2,[0]),(3,[1]), (4,[3,2,0]),(5,[4,2,1])] After implementing this function, we hav e found that it closely follows the decoration functions used in Aczel’ s book (Aczel 1988), and renamed it compute decoration . In the simpler case of the H F S univ erse, with our well- founded sets represented as D A Gs, the existence and unicity of the result computed by from dag follo ws immediately from the Mostowski Collapsing Lemma ((Aczel 1988)). 6.2 Extensional/Intensional Duality What can be said about the graphs obtained by rev ersing the direction of the arrows representing the ∈ relation? Intuitiv ely , it corresponds to the fact that intensions/con- cepts would become the building blocks of the theory , pro- vided that something similar to the axiom of extensionality holds. In comments related to Russell’ s type theory (Goedel 1999) pp. 457-458 G ¨ odel mentions an axiom of intension- ality with the intuitiv e meaning that “different deﬁnitions belong to different notions”. G ¨ odel also notices the dual- ity between “no two different properties belong to exactly the same things” and “no two different things have exactly the same properties” but warns that contradictions in a sim- ple type theory would result if such an axiom is used non- constructiv ely . W e can now look for the presence of intensional/exten- sional symmetry in H F S by trying to reb uild a H F S repre- sentation from the transpose of ∈ , 3 : from_ddag g = compute_decoration g (snd (bounds g)) intensional_dual_of = from_ddag . to_ddag Are such representations self-dual? Let’ s deﬁne as self-dual a number n ∈ N at that equals its intensional dual and then ﬁlter self-dual numbers in an interval: self_idual n = n = = intensional_dual_of n self_iduals from to = filter self_idual [from..to] Unfortunately , as the following example shows, relati vely few numbers are self-duals: self_iduals 0 1000 [0,1,2,3,4,5,10,11,16,17,34,35, 64,65,130,131,264,265,522,523] Figures 3 and 4 show some H F S graphs of natural numbers equal to their intensional duals. W e will lea ve it as a topic for future research to in vestig ate more in depth, various aspects of ∈ / 3 duality in H F S , in correlation with Natural Numbers and their encodings. Figure 3: self-dual and its intensional dual as H F S graphs: 131 and the dual of 131 6.3 Encodings of Directed Graphs as Natural Numbers Hypersets (Aczel 1988) are deﬁned by replacing the Founda- tion Axiom with the AntiFoundation axiom. Intuitiv ely this means that the ∈ -graphs can be cyclical (Barwise and Moss 1996), provided that they are minimized through bisimu- lation equivalence (Dovier et al. 2001). W e hav e not (yet) found an elegant encoding of hereditarily ﬁnite hypersets as natural numbers, similar to Ackerman’ s encoding. The main difﬁculty seems related to the fact that hypersets are modeled in H F S as equiv alence classes with respect to bisimulation (Aczel 1988; Barwise and Moss 1996; Piazza and Policriti 2004). T ow ard this end, an easy ﬁrst step seems to ﬁnd a bi- jection from directed graphs (with no isolated vertices, cor- responding to their view as binary relations), to N at : nat2digraph n = map bitmerge_unpair (nat2set n) Figure 4: self-dual and its intensional dual as H F S graphs: 16393 and the dual of 16393 digraph2nat ps = set2nat (map bitmerge_pair ps) W ith digraphs represented as lists of edges, this bijection works as follo ws: nat2digraph 2008 [(1,1),(2,0),(2,1),(3,1), (0,2),(1,2),(0,3)] digraph2nat (nat2digraph 2008) 2008 nat2digraph (255) [(0,0),(1,0),(0,1),(1,1), (2,0),(3,0),(2,1),(3,1)] digraph2nat (nat2digraph 255) 255 As usual map nat2digraph [0..] provides a combinatorial generator for the inﬁnite stream of directed acyclic graphs. 7. Related work Natural Number encodings of Hereditarily Finite Sets have triggered the interest of researchers in ﬁelds ranging from Axiomatic Set Theory and Foundations of Logic to Com- plexity Theory and Combinatorics (T akahashi 1976; Kaye and W ong 2007; Kirby 2007; Abian and Lamacchia 1978; Kirby 2007; Meir et al. 1983; Leontjev and Sazonov 2000; Sazonov 1993; A vigad 1997). Graph representations of sets and hypersets based on the variants of the Anti Foundation Axiom have been studied extensi vely in (Aczel 1988; Bar- wise and Moss 1996). Computational and Data Representa- tion aspects of Finite Set Theory and hypersets hav e been de- scribed in logic programming and theorem proving contexts in (Dovier et al. 2000; Piazza and Policriti 2004; Dovier et al. 2001; Paulson 1994). P airing functions hav e been used work on decision problems as early as (Pepis 1938; Kalmar 1939; Robinson 1950, 1955, 1968a,b). V arious mappings from nat- ural number encodings to Rational Numbers are described in (Gibbons et al. 2006), also in a functional programming framew ork. 8. Conclusion and Future W ork Implementing with relativ e ease the encoding techniques typically used only in the foundations of mathematics rec- ommends Haskell as a surprisingly effecti ve tool for experi- mental mathematics. W e hav e described a v ariety of isomorphisms between mathematically interesting data structures, all centered around encodings as Natural Numbers. The possibility of sharing signiﬁcant common parts of HFS-represented inte gers could be used in implementing shared stores for arbitrary length integers. Along the same lines, another application would be data compression using some “information theoretically minimal” v ariants of the graphs in subsection 6.1, from which larger , H F S and/or natural numbers can be rebuilt. Last but not least, making more accessible to computer science students some of the encoding techniques typically used only in the foundations of mathematics (and related reasoning techniques), suggests applications to teaching dis- crete mathematics and/or functional languages in the tradi- tion of (Hall and O’Donnell 2000). References Alexander Abian and Samuel Lamacchia. On the consistency and independence of some set-theoretical constructs. Notr e Dame Journal of F ormal Logic , X1X(1):155–158, 1978. W ilhelm Friedrich Ackermann. Die W iderspruchsfreiheit der allge- meinen Mengenlhere. Mathematisc he Annalen , (114):305–315, 1937. Peter Aczel. Non-wellfounded sets . Number 14 in CSLI Lec- ture Notes. Center for the Study of Language and Information (CSLI), 1988. Jeremy A vigad. The Combinatorics of Propositional Prov ability. In ASL W inter Meeting , San Die go, January 1997. Jon Barwise and Lawrence Moss. V icious Circles . Number 60 in CSLI Lecture Notes. Center for the Study of Language and Information (CSLI), 1996. Patrick C ´ egielski and Denis Richard. Decidability of the theory of the natural integers with the cantor pairing function and the successor . Theor . Comput. Sci. , 257(1-2):51–77, 2001. Agostino Dovier , Carla Piazza, and Alberto Policriti. Comparing Expressiv eness of Set Constructor Symbols. In F rontiers of Combining Systems , pages 275–289, 2000. Agostino Dovier , Carla Piazza, and Alberto Policriti. A F ast Bisim- ulation Algorithm. In G ´ erard Berry , Hubert Comon, and Alain Finkel, editors, CA V , volume 2102 of Lectur e Notes in Computer Science , pages 79–90. Springer , 2001. ISBN 3-540-42345-1. Jeremy Gibbons, David Lester, and Richard Bird. Enumerating the rationals. Journal of Functional Pro gramming , 16(4), 2006. URL http://www.comlab.ox.ac.uk/oucl/work/jeremy. gibbons/publications/rationals.pdf . Kurt Goedel. Russel’ s mathematical logic. In A.D. Irvine, editor , Bertrand Russell: Critical Assessments , London, 1999. Rout- ledge. C. Hall and J. O’Donnell. Discrete Mathematics Using a Computer . Springer , 2000. Laszlo Kalmar . On the reduction of the decision problem. ﬁrst paper . ackermann preﬁx, a single binary predicate. The J ournal of Symbolic Logic , 4(1):1–9, mar 1939. ISSN 0022-4812. Richard Kaye and Tin Lock W ong. On Interpretations of Arith- metic and Set Theory . Notr e Dame J. F ormal Logic V olume , 48 (4):497–510, 2007. Laurence Kirby . Addition and multiplication of sets. Math. Log. Q. , 53(1):52–65, 2007. Alexander Leontjev and Vladimir Y u. Sazonov . Capturing LOGSP A CE over Hereditarily-Finite Sets. In Klaus-Dieter Schewe and Bernhard Thalheim, editors, F oIKS , volume 1762 of Lectur e Notes in Computer Science , pages 156–175. Springer , 2000. ISBN 3-540-67100-5. Amram Meir , John W . Moon, and Jan Mycielski. Hereditarily Finite Sets and Identity T rees. J. Comb . Theory , Ser . B , 35(2): 142–155, 1983. T obias Nipkow and Lawrence C. Paulson. Proof Pearl: Deﬁning Functions o ver Finite Sets. In Joe Hurd and Thomas F . Melham, editors, TPHOLs , volume 3603 of Lecture Notes in Computer Science , pages 385–396. Springer , 2005. ISBN 3-540-28372-2. Lawrence C. Paulson. A Concrete Final Coalgebra Theorem for ZF Set Theory . In Peter Dybjer, Bengt Nordstr ¨ om, and Jan M. Smith, editors, TYPES , volume 996 of Lectur e Notes in Com- puter Science , pages 120–139. Springer , 1994. ISBN 3-540- 60579-7. Jozef Pepis. Ein verfahren der mathematischen logik. The Journal of Symbolic Logic , 3(2):61–76, jun 1938. ISSN 0022-4812. Simon L. Peyton Jones. Haskell 98: Introduction. J. Funct. Pro- gram. , 13(1):0–6, 2003a. Simon L. Peyton Jones. Haskell 98: Standard prelude. J . Funct. Pr ogr am. , 13(1):103–124, 2003b. Simon L. Peyton Jones, editor . Haskell 98 Languag e and Libraries: The Revised Report . September 2002. http://haskell.or g/deﬁnition/haskell98-report.pdf. Carla Piazza and Alberto Policriti. Ackermann Encoding, Bisimu- lations, and OBDDs. TPLP , 4(5-6):695–718, 2004. Benjamin C. Pierce. Basic Category Theory for Computer Scien- tists . MIT Press, 1991. Julia Robinson. General recursiv e functions. Pr oceedings of the American Mathematical Society , 1(6):703–718, dec 1950. ISSN 0002-9939. Julia Robinson. A note on primitiv e recursive functions. Proceed- ings of the American Mathematical Society , 6(4):667–670, aug 1955. ISSN 0002-9939. Julia Robinson. Recursiv e functions of one variable. Pr oceed- ings of the American Mathematical Society , 19(4):815–820, aug 1968a. ISSN 0002-9939. Julia Robinson. Finite generation of recursi vely enumerable sets. Pr oceedings of the American Mathematical Society , 19(6): 1480–1486, dec 1968b. ISSN 0002-9939. Arnold L. Rosenberg. Efﬁcient pairing functions - and why you should care. In IPDPS . IEEE Computer Society , 2002. ISBN 0-7695-1573-8. Vladimir Y u. Sazonov . Hereditarily-Finite Sets, Data Bases and Polynomial-T ime Computability. Theor . Comput. Sci. , 119(1): 187–214, 1993. Moto-o T akahashi. A F oundation of Finite Mathematics. Publ. Res. Inst. Math. Sci. , 12(3):577–708, 1976. A ppendix T o make the code in the paper fully self contained, we list here some auxiliary functions. Bit crunching functions The following functions imple- ment con v ersion operations between bitlists and numbers. Note that our bitlists represent binary numbers by selecting exponents of 2 in increasing order (i.e. “right to left”). -- from decimals to binary as list of bits to_rbits n = to_base 2 n -- from bits to decimals from_rbits bs = from_base 2 bs -- conversion to base n, as list of digits to_base base n = d : (if q = = 0 then [] else (to_base base q)) where (q,d) = quotRem n base -- conversion from any base to decimal from_base base [] = 0 from_base base (x:xs) = x + base ∗ (from_base base xs) String Representations The function setShow provides a string representation of a natural number as a “pure” HFS. setShow n = sShow urelement_limit n The function sShow provides a string representation of a natural number as a HFS with Ur elements . sShow 1 0 = "{}" sShow ulimit n | n < ulimit = show n sShow ulimit n = "{" + + foldl ( + + ) "" (intersperse "," (map (sShow ulimit) (nat2set (n-ulimit)))) + + "}" Con version to Ordered Pairs The function nat2pairs con verts a natural number to a set of Haskell ordered pairs expressing the ∈ relation on its associated H F S or its dual 3 . nat2pairs withF n = (sort . nub) (nat2ps withF n) nat2ps withF 0 = [] nat2ps withF from = ((n2rel ns) + + (ns2rel ns)) where f = withF from n2rel = map f ns2rel = concatMap (nat2ps withF) ns = nat2set from The function nat2parts con verts n to the set of Natural Numbers occurring in the H F S representation of n . nat2parts = sort . nub . nat2repeated where nat2repeated 0 = [0] nat2repeated from = from : (nat2more ns) where nat2more = concatMap nat2repeated ns = nat2set from

A Functional Hitchhikers Guide to Hereditarily Finite Sets, Ackermann Encodings and Pairing Functions

Original Paper

Comments & Academic Discussion

Leave a Comment

Original Paper

Related Papers

Comments & Academic Discussion

Leave a Comment