Extensible Sparse Functional Arrays User's Guide

Introduction

The esfa package contains a suite of libraries and programs in Haskell and C that implement ESF arrays. This document explains what ESF arrays are, how to use them, and how they work. The examples directory contains several test programs that illustrate how ESF arrays behave.

Further resources, including papers, presentations, and the latest versions of the software and this user guide, are available on the ESFA web page.

Notes

The system has been tested with QuickCheck. An alternative to QuickCheck is the program RandOps, in the examples directory. This generates operations randomly based on the status of arrays that exist in the system at each step, and gives finer control than is easily achievable with QuickCheck. Each update calculates the "axiomatic array" using the specification of ESF arrays, as well as performing the update completely independently on the parallel circuit. After each update, the RandOps program traverses the circuit, extracts the arrays, compares with the correct-by-specification arrays, and keeps track of the results. It has performed tens of thousands of operations with no errors.
This document isn't finished yet. The software is part of a research project, and is not yet a stable production release. Some changes to the API are planned, and this document will be updated as that happens. Perhaps the token AName will be replaced by something like ESFA.
Some parts of the software aren't integrated into this version of the code yet.

What is an ESF array?

ESFA is an array data structure with pure functional access, sparse traversal, extensibility, associative searching, and memory management, yet with O(1) access time.

The significance of the ESFA system is that it has a lower time and space complexity than appears to be possible. There is good reason to conjecture that it is impossible to obtain O(1) access time for the ESFA operations using a standard computer architecture, such as the RAM theoretical model or the von Neumann architecture.

The conjecture assumes that unrestricted usage of the ESFA operations is allowed; naturally if you restrict the way the arrays are updated, a conventional implementation can give O(1) access --- as long as it's acceptable to give wrong results if you perform an update that falls outside its limitations. In contrast, the ESFA system always performs every operation in unit time, regardless of the past history, and with no restrictions on how you use the arrays.

Here is a quick motivation for the conjecture: if you make several updates to an array, you can't afford to replicate the elements of the original array even though they are now shared among several arrays, and you also can't afford to follow any pointers through a linked data structure that implements the sharing.

So how does this system implement algorithms in less time than the lower bound? The answer is that ESFA is a software/hardware codesign: the algorithms run on a digital circuit which provides a smart memory that exploits the massive parallelism of circuits to achieve a lower time and space complexity than is apparently possible on conventional hardware. Furthermore, this is not achieved by shifting the complexity from the software to the hardware: the ESFA circuit provides powerful primitives without significantly increasing the complexity of standard hardware (in gate delay or component count). Depending on the cost model used, the ESFA circuit either has the same asymptotic cost as a conventional RAM memory, or its cost is increased by a negligible amount. There is, of course, a constant-factor slowdown.

ESFA supports several kinds of operation. The most important feature is pure functional arrays, but the techniques used in the implementation support several additional features (sparsity, extensibility, associativity) as well. Here are the main characteristics of the data structure:

Every ESFA operation takes unit time; that is, it requires a fixed number of clock cycles, regardless of the past history of operations.
ESF arrays are immutable, and are accessed using incremental updates and lookups (i.e. "one element at a time", just as with conventional imperative arrays). ESF arrays are purely functional: when you update an array, the result is a completely new array, and the old one is unchanged and still available. Instead of making a side-effect to an array to change its value, a program simply performs an update to create a new array, and it can't ever change the value of an old array. This is normal in a pure functional language; for example, in Haskell you don't modify a list by zapping its first element in memory, but instead you construct a new list using the (:) function.
There are no restrictions on the use of the ESFA functions. For example, a program can make many alternative updates to the same array, resulting in massive sharing of elements among arrays.
ESF arrays may be sparse, and a program can traverse an array from one defined index to the next, skipping over indices that don't have a defined value.
The arrays are extensible: you don't need to allocate space, or specify the minimum and maximum index values. At any time the minimum index can be decreased, or the maximum index increased, simply by updating the array with a new value at a lower (higher) index.
Several operations that support storage management are provided; these make it possible either for a program to manage space explicitly, or to integrate ESFA with a heap garbage collector. There is an operation to delete an array, which automatically reclaims all elements that are inaccessible while retaining all elements that are shared with a live array -- in O(1) time!

A variety of applications using ESFA are possible. For example, the data structure provides a very flexible "undo/redo" facility, and functional arrays directly implement the environment extension and lookup used in denotational semantics. This document focuses on how to use the operations and how the implementation works, rather than on the range of possible applications.

All of the ESFA operations can be implemented using standard algorithms on a conventional architecture, but this increases the time complexity of array access. That is unfortunate, because conventional imperative arrays offer O(1) time update and lookup.

The ESFA system described here uses a hardware/software codesign. The operations are implemented in three levels; the top two levels are software, but the lowest level is a digital circuit that implements a "smart memory". The crux of the system is that the smart memory has essentially the same complexity as a conventional RAM memory (in terms of gate delay and component count), yet it makes it possible to implement the ESFA operations in a fixed number of clock cycles -- exactly what it is conjectured that the RAM cannot do.

ESFA can be executed on a conventional von Neumann architecture, by but simulating the circuit, but this increases the time complexity because a sequential simulator requires O(n) time to simulate one clock cycle of a memory with N cells. A sequential simulator is still useful for experimenting with the data structure, and it gives accurate timings for a true parallel circuit implementation.

The digital circuit is specified using a functional hardware description language, but isn't yet realised in a physical chip. However, the hardware description language constitutes an executable specification, and executing the specification simulates the circuit.

It isn't necessary to use a special VLSI chip to implement the hardware; it can also be emulated on a parallel computer, such as a multicore processor or a GPU.

ESFA operations

There are two APIs to the operations on ESF arrays: a set of pure functions and a set of monadic operations. Therefore each ESF operation (with a few exceptions) has two types; the functions whose names begin with esfm are the monadic versions. Some of the operations, including initialization and some of the diagnostics, are meaningful only in the monadic version.

The "esfm" operations are not the lowest level of operation; they collect tracing data which is stored in the heap, and they provide some extra diagnostic tools. This makes it possible to provide operations such as esfmPrintSpecArrays. The low level operations, which run either on a real or simulated digital circuit, have names beginning with "ptc".

Initialization

See Section for an example that shows how to set up and run a program.

The esfm family of functions provides some convenient features that make it easier to experiment with ESF arrays, but this is not the lowest level API, and also not the most secure. (That API isn't documented yet.) The esfm API keeps the state of the simulated circuit in a state monad:

StESFA u v c

StESFA is defined using StateT, with IO as the inner monad. The type variable a is the user's state type; v is the element value type used for ESF arrays, and c is the return type, which is often just (). The user's state is there just for convenience, but this isn't very secure.

The function simulateESFM k x script generates an instance of the circuit containing 2^k cells, and runs the simulator on the script, using x as the initial user state.

simulateESFM :: Int -> u -> StESFA u v () -> IO ()
simulateESFM k x script

Before doing any other operations, the system must be initialized.

     esfmInitialize

Update and lookup

An array has a handle or "stable pointer" which is of type AName and is represented as an Int. This is an opaque type; that is, there is no particular significance to the integer value of an AName. An AName is analogous to the machine address used to represent an ordinary imperative array. (However, there is a difference between an AName and an imperative array address. For imperative arrays, the numeric value of an array's address is used to calculate the address of an element at a given index, but, as will be seen later, that is not the case for ESF arrays.)

An ESFA consists of a set of mappings from an index to a value. An index has type Int, and a value has a polymorphic type; in this document the type variable v will be used for the value type. All elements of all ESF arrays must have the same type, so it's common to define an algebraic data type with tags to allow element values of different types. The examples in this document just use Int for all element types.

The system always has an empty array that contains no elements. This is not created by the user, and cannot be deleted. There are two names, corresponding to the functional (esf) and monadic (esfm) API, but there is no difference between them; they are both defined just for consistency in naming. (An AName is really an Int, and empty has the value -1, although the user doesn't need to know that, and should always use esfmEmpty and should never use -1 directly.)

esfEmpty, esfmEmpty :: AName

Update takes an existing array, an index, a value, and a Bool. It returns a new array which is identical to the old one, except that it has the specified value at the given index. The old array is not modified.

esfUpdate, esfUpdateNfy
  :: AName                     -- old array to be updated
  -> (Int,e)                   -- new element (index, value)
  -> Maybe AName               -- new array
esfmUpdate, esfmUpdateNfy
  :: Show e
  => String                    -- label to make dumps more readable
  -> AName                     -- old array to be updated
  -> (Int,e)                   -- new element (index, value)
  -> StESFA u a (Maybe AName)  -- result is new array

The four versions of update are almost the same. The differences among them are:

The monadic ones (esfm) return a result in the StESFA monad, while the nonmonadic ones (esf) just return the result directly.
Since the monadic API has facilities to dump the arrays, the element type needs to be in the Show class.
The monadic operations take an extra String argument, which is a symbolic label that is used to make array dumps more readable (see esfmShowArrays). Since array dumps are not supported in the non-monadic functions, the String argument is omitted.
The versions that end in Nfy indicate that the element should receive special treatment if and when an array deletion makes the element inaccessible. If an Nfy operation was used, then the host system will be notified before the element is reclaimed. This is needed to handle pointers from an ESF array back to a heap object (see the section Notification and Zombies). If an array is created with esfUpdate or esfmUpdate, then the element will be reclaimed immediately and automatically if and when it becomes inaccessible.

The lookup function is used to an element from an array at a specified index.

esfLookup
  :: AName                 -- array to look up
  -> Int                   -- index
  -> Maybe (EltVal e)      -- value if it exists
esfmLookup
  :: Show v                -- allows dumping arrays
  => AName                 -- array to look up
  -> Int                   -- index
  -> StESFA u v (Maybe v)  -- value if it exists

The behaviour of ESF arrays is defined by two laws.

Law 1 (Empty array.) If you look up any index at all in an empty array, you get nothing.

esfLookup esfEmpty i = Nothing

Law 2 (Nonempty array.) Suppose an array a2 is defined as update a j v. Now if you look up a2 at index j, the result is Just v. If you look up a2 at any index other than j, the result is the same as looking up a at that index.

esfLookup (esfUpdate a j v) i
  | i==j  = Just v
  | i/=j  = esfLookup a j

Some important points.

You don't allocate an array with a given size
There is only one way to create an array, and that's using update
When you update an array, you don't modify it - you create a new array
An array is immutable: once created, its value will never change
You can look up an array at any index; that's why the result is a Maybe type

Extensibility and sparse traversal

The following two operations take an array and return the minimum (maximum) index at which the array has a defined value, as well as the value at that index. The result is a Maybe type because Nothing is returned if the array doesn't exist.

esfmMinIndex, esfmMaxIndex
  :: AName
  -> StESFA u v (Maybe (Int,v))

The ability to find the extent of an array directly provides part of the functionality of extensible arrays. Recall that the program doesn't declare the bounds of an array; the lower bound (or minimum defined index) is simply the lowest index that was used in the creation of that array. At any time, an array can be extended simply by updating it with a lower (higher) bound than already exists (of course, this doesn't modify the old array; it creates a new array with the extended bounds). Since the extent of an array is so dynamic, we need to be able to find the minimum and maximum index.

An array may contain indices between the minimum and maximum bounds where no value has been defined. Although many algorithms using arrays actually define values at every index, there are many applications where it is more efficient to represent only the non-zero (or more generally, non-default) values. But suppose we have an array defined at indices 0, 123, 124, 2050, ... If an algorithm has to do a lookup at each index 0, 1, 2, 3, ... to find these elements, it will be quite slow. The essential feature of a sparse array data structure is the ability to traverse an array from one non-default value directly to the next, skipping over the holes. The following operations provide that capability:

esfmNextIndex, esfmPrevIndex
  :: AName
  -> Int
  -> StESFA u v (Maybe (Int,v))

As with all the ESFA operations, these take a small constant number of clock cycles.

Associative searching

search
  :: AName                       -- array to search
  -> Word32                      -- mask
  -> Word32                      -- pattern
  -> StESFA u v (Maybe (Int,v))  -- index and value, if match found

searchNext, searchPrev
  :: AName                       -- array to search
  -> Int                         -- starting index for search
  -> Word32                      -- mask
  -> Word32                      -- pattern
  -> StESFA u v (Maybe (Int,v))  -- index and value, if match found

Tracing and diagnostics

esfmPrintState, esfmPrintMachineArrays, esfmPrintSPecArrays
  :: StESFA u v ()
esfmSetOption
  :: EsfmTraceOperation
  -> StESFA u a ()
esfmAnalyzeMachineState
  :: StESFA u v MachineStatistics
data MachineStatistics
  = MachineStats
      {ncells, narrays, nelts, nzombies, nfree :: Int}

esfmPrintState prints out the state of the circuit; i.e. the contents of the cell fields. Only the cells that contain useful information are printed; empty cells are skipped.
esfmPrintMachineArrays. This examines the state of the (simulated) circuit and works out the contents of the arrays that are represented in the machine. These arrays are printed out. This operation shows what is actually represented in the machine (cf. esfmPrintSpecArrays). This is a meta-operation that relies on the ability of the circuit simulator to gain access to the internal register states.
esfmPrintSpecArrays. Whenever an esfmUpdate is performed, the result according to the specification is calculated and stored in the tracing state. The Spec arrays are using an algebraic data type to calculate the correct values of the arrays; this is actually an executable specification. This value is completely independent of the machine state. By using esfmPrintSpecArrays, you can see what the correct arrays should be, according to the specification.
esfmAnalyzeMachineState. You can find out several statistics of the machine state. Also, you can extract these fields for yourself and process them.
esfmSetOption gives the user some control over the execution of the system, including how much trace information is retained and printed. You can turn on tracing of the operations using esfmSetOption EsfmTraceOperation

Annotated examples

This section describes several example programs that illustrate how the system works. Each program listing is interleaved with its output and some commentary explaining what is going on, and all the programs appear in the examples directory. You can experiment by modifying any of the programs and running it.

The examples here use the monadic versions of the ESF operations. However, it is not necessary to encapsulate ESFA operations in a monad! Monads are often used for imperative arrays in Haskell, because single-threaded access is required for imperative arrays, but the section below on Representation and sharing shows how ESFA handles multiple threaded arrays easily. The reasons that a monad is used here are:

The monad makes it possible to illustrate how storage management works. The pure function interface requires the ESFA system to be integrated fully with the Haskell garbage collector; otherwise the system will eventually run out of memory. This integration is not yet done.
It's helpful to print out the machine state at well defined points in the program, and the monad facilitates that.

Creating and accessing arrays

The program examples/InitUpdateLookup.hs shows how to set up a program, run it, initialize the machine, and do some simple array operations, particularly update and lookup. To run it, go into the examples directory, enter ghci InitUpdateLookup, then enter main.

The program begins by importing some required modules.

-- ESFA: InitUpdateLookup
-- Demonstrate initialization, update, lookup, and printing arrays
-- This program is discussed in doc/src/esfa-users-guide.txt

module Main where

import Control.Monad.IO.Class
import Data.Array.ESFA.SimArch

The main program uses sumulateESFM to simulate the circuit, and to run the ESFA algorithm on that simulated circuit. (Parallel implementations, including multicore, GPU, and FPGA, are underway, and will use the same interface but a different run function to use in place of simulateESFM.)

The first argument is the log of the number of memory cells; thus specifying 10 gives a machine with 1024 cells. The second parameter () is the initial value of the user's state, which is not used in this particular program. The script is the user program, running in the StateT monad.

The rest of the program is the body of the script. In it, you can do IO using liftIO. The ESF machine needs to be initialized, setting all the cells in the circuit to empty. We also set a trace option, which will cause the system to print a message each time an update or lookup is performed.

The program uses the StateT monad; StESFA is defined for convenience. The program needs to begin by initializing the system state. It also sets an option, EsfmTraceOperation, that will cause the system to print a message each time an update or other ESFA operation is performed.

main :: IO ()
main = simulateESFM 10 () script

script :: StESFA () Int ()
script =
  do liftIO (putStrLn "ESFA Example InitUpdateLookup")
     esfmInitialize
     esfmSetOption EsfmTraceOperation

Now the program will create some arrays using update. The empty array is called esfmEmpty (for now, at least). It's always present: the user doesn't need to create it, can't create it, and can't destroy it. The only way to get started is by updating esfmEmpty to get a new array. The following definition updates esfmEmpty at index 100 with the value 101. The string parameter "a1" isn't really part of the ESFA system, and isn't present at all in the machine representation. But the esfm API lets you give a string label to an array, which is used by esfmPrintMachineArrays and esfmPrintSpecArrays to make it easier to see which array is which. This is the only purpose of that label. It's possible for esfmUpdate to fail, so it returns a Maybe type. There are several ways to detect and handle errors, but in this demonstration program we just pattern match so that a1 is defined to be the actual array. The update would fail if the memory is completely full, or if the array being updated doesn't exist.

-- Create a couple of arrays
     Just a1 <- esfmUpdate "a1" esfmEmpty (100, 101)
     Just a2 <- esfmUpdate "a2" a1 (200, 201)
-- Expected contents of the arrays:
--     a1 = 100~>101
--     a2 = 200~>201, 100~>101

Since a trace was requested, each operation causes a line to be printed. The first update defines an array with the label "a1", which is created by updating the empty array (whose name is -1) with value 101 at index 100. The result of the update is a new array whose AName is 0. The second operation updates a1, whose AName is 0, and produces a2 with AName = 1.

ESFA Example InitUpdateLookup
esf update: define a1 (name=0) = update -1 [100]=101
esf update: define a2 (name=1) = update 0 [200]=201

Now do some lookups.

-- Fetch and print some array elements
     Just x1 <- esfmLookup a1 100   -- expect 101
     Just x2 <- esfmLookup a2 100   -- expect 101
     Just x3 <- esfmLookup a2 200   -- expect 201
     liftIO (putStrLn (show [x1,x2,x3]))

-- Fetch and print some non-existent array elements
     y1 <- esfmLookup a1 200     -- expect Nothing
     y2 <- esfmLookup a2 300     -- expect Nothing
     liftIO (putStrLn (show [y1,y2]))

Here is the output, which agrees with the expected values.

esf lookup  (name=0)  [100] = Just 101
esf lookup  (name=1)  [100] = Just 101
esf lookup  (name=1)  [200] = Just 201
[101,101,201]
esf lookup  (name=0)  [200] = Nothing
esf lookup  (name=1)  [300] = Nothing
[Nothing,Nothing]

Finally, the program dumps out the entire contents of all arrays in its memory. There is a meta-operation called esfmPrintMachineArrays that examines the states of the cell circuits, works out the arrays that are represented, and prints them out. Each time you do an esfmUpdate, the correct result --- correct according to the executable specification --- is automatically saved in the state. This is using a conventional tree data structure in the RAM to represent the ESF arrays. The esfmPrintSpecArrays operation prints those out.

-- Print the expected and then the actual arrays
     esfmPrintSpecArrays
     esfmPrintMachineArrays

The output is given below; again it agrees with the expected values that are given in comments below the updates.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.

esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.

By comparing the results of the PrintSpecArrays and PrintMachineArrays, you can check that all is well. The results should always be identical (except that printSpecArrays currently shows esfmEmpty).

This section explains how arrays are represented, and how elements can be shared among many arrays. The program used is examples/RepSharing.hs, and it is useful to look at the sections on Representation and machine state and Implementation of update while looking at these examples.

Beginning is standard.

-- ESFA example: RepSharing
-- Show how ESF arrays are represented and demonstrate sharing of elements
-- This program is discussed in doc/src/esfa-users-guide.txt

module Main where

import Control.Monad.IO.Class
import Data.Array.ESFA.SimArch

main :: IO ()
main = simulateESFM 10 () script

There are several operations that print information contained in the system state. In this example program, several of those will be used after each update. For convenience, they are collected into one operation called showState.

esfmPrintState shows the contents of the cells; this is the low level machine representation, and it requires a detailed understanding in order to interpret
esfmPrintMachineArrays is a meta-tool that examines the contents of each cell, works out the arrays that are represented in the system, and prints them out.
esfmPrintSpecArrays is another meta-tool that prints out the arrays that should be in the system, according to the specification. When an update is performed, esfmUpdate uses the specification to define the value of the new array, and puts it in a list of specified arrays hidden in the monadic state. The esfmPrintSpecArrays operation fetches this hidden state and prints the arrays. Naturally this is not a part of the real system; it's just a diagnostic tool.
esfmAnalyzeMachineState goes through all the cells and counts the number that fall into various categories.

showState =
  do esfmPrintState
     esfmPrintMachineArrays
     esfmPrintSpecArrays
     stats <- esfmAnalyzeMachineState
     liftIO (putStrLn (show stats))
     liftIO (putStrLn (take 75 (repeat '-')))

The program begins by defining an array a1[100]=101, and then uses showState.


script :: StESFA () Int ()
script =
  do liftIO (putStrLn "ESFA Example: RepSharing")
     esfmInitialize
     esfmSetOption EsfmTraceOperation

     Just a1 <- esfmUpdate "a1" esfmEmpty (100, 101)
-- a1 = 100-101
     showState

Here is the output of the program, up to and including the showState. The state of the machine contains the array a1, and this is printed in several different ways.

First the update prints a message (requested by setting EsfmTraceOperation) which says that the array with label "a1" has been defined, its AName (stable pointer) is 0, it was created by updating the array with AName = -1 (that's esfmEmpty), and the array element has index 100 containing the value 101.
The next line is the low level cell dump. Only cells with meaningful contents are printed. The meanings of the fields on this line will be discussed below, after the second update.
The esfmPrintMachineArrays analyzes the contents of the cells, works out what is represented, and prints the arrays. This should always show the same results as esfmPrintSpecArrays (except that shows the empty array right now -- that inconsistency should be fixed). This line says that the array with label "a1" has AName = 0, and it has one element [100]=101.
The esfmPrintSpecArrays prints out the arrays as they should be, according to the specification.
The esfmAnalyzeMachineState indicates the size of the memory, the number of arrays and array elements, and a few more statistics pertaining to memory management which will be discussed later.

Loading package esfa-0.0.5 ... linking ... done.
ESFA Example: RepSharing
esf update: define a1 (name=0) = update -1 [100]=101
    0.  *  >    1     1..1     [  100] = 101
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.

MachineStats {ncells = 1024, narrays = 1, nelts = 1, nzombies = 0, nfree = 1023}
---------------------------------------------------------------------------

Now the program updates the array a1, producing a new array a2.

     Just a2 <- esfmUpdate "a2" a1 (200, 201)
-- a2 = 200-201 100-101 
     showState

Again, the set of arrays actually represented in the machine state is the same as the set of arrays according to the specification. There are two arrays; a1 is the same as before (it hasn't changed although it has been updated), and a2 has AName=1, and it contains both the new element [200]=201 and the element [100]=101 that was inherited from a1.

Here is the full output from showState; after that we will examine the internal machine representation in detail.

esf update: define a2 (name=1) = update 0 [200]=201
    0.     >    1     1..2     [  100] = 101
    1.  *  >    2     2..2     [  200] = 201
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.

MachineStats {ncells = 1024, narrays = 2, nelts = 2, nzombies = 0, nfree = 1022}
---------------------------------------------------------------------------

When the arrays are printed out, shared elements appear multiple times; thus the [100] element appears on both the lines for a1 and a2. However, each element actually appears only one place in the machine's memory, and it's shared among the arrays that contain that element. This sharing, and the way it is implemented and exploited, is the essence of ESF arrays.

A tree diagram shown with indented text illustrates the sharing. Array a1 contains the [100] element. Array a2 contains the [200] element, and it also inherits [100]=101 from a1. Thus [100]=101 is shared between a1 and a2.

a1 [100]=101
  a2 [200]=201

Now let us consider in detail the machine representation. The state of the machine is printed with one line for each non-empty cell, showing the fields in the cells. This representation takes some work to interpret, because it's designed to be used by the algorithm, not to be read by users. Here again is the first line:

    0.     >    1     1..2     [  100] = 101

First is the CellId 0. You can think of this as the address of the cell, although the circuit doesn't actually use addresses to access cells (it uses associative searching and masking instead). The update chose the first available cell to hold the new array element, and that was Cell 0.
When an array is created, an AName is chosen for it. The value of the name is arbitrary, as long as it isn't being used for another array. The system simply uses the CellId of the cell that contains the new element as the name of the new array, so the name of the result of the first update is 0 (indicated by "0. > 1" on the trace line from the first update, and the notation "a1 (0)" in the PrintArrays operations; both of these say that this array has name

(Is it always safe to use the CellId of the cell with the new element? Yes -- because it is impossible for this array a1 to be removed without also removing its element [100]=101.) When a1 was created, the update operation returned the int 0, and the value of a1 in the main program is 0. When an array is created, an arbitrary empty cell is chosen to hold its element, and if the CellId of that cell is i, then the name of the array is AName=i. Similarly, a2 has name 1 and code 2.

All the internal calculations identify arrays by their codes, not their names. Furthermore, array codes must frequently be changed, so they are not suitable for references to an array from the outside world. The array names are stable, but the codes are volatile. Therefore the system maintains the Array Name/Code map, which is distributed through the cells. The notation "0. > 1" means that AName=0 is mapped to ACode=1. In other words, array "a1" will always have name 0, and currently it has code=1 but that may change.
The notation 1..2 is the inclusion set of the element, which is represented as a pair of natural numbers low..high. An element belongs to an array with code c if and only if low <= c <= high. Cell 0 says that code 1 corresponds to name 0, which is a1. Cell 1 says that code 2 corresponds to name 1, which is a2. Therefore both a1 and a2 contain the element [100]=101. The element [200]=201 has an inclusion set of 2..2, so it belongs only to a2. Thus the [100] element has inclusion set {ACode 1, ACode 2} which is the same as {AName 0, AName 1}, or {a1, a2}. This element is shared between two arrays. The [200] element belongs only to a2.
The final part of the cell is the array element, consisting of an index and a value. In the machine state this is written as, for example, [100]=101. In some other software tools, an element is written as a mapping from index to value, such as 100~>101.

Here is a more detailed text-tree description of the machine state. It's essentially the same as the machine state printout, but indentation shows the inheritance relationships. The first line can be read as "the array with label a1 has name 0 and code 1; its inclusion set is 1 to 2, and it has value 101 at index 100".

a1 0>1 1..2 [100]=101
  a2 1>2 2..2 [200]=201

If you want to find the elements of an array, say a1, first find its AName (0), then look in cell with CellID=0 to get the ACode (it's 1), then look at all the cells where 1 lies within the low..high interval (in the state right now, the only element with that property is [100]. This identifies the candidates for the elements of the array, but if several of these elements have the same index value, then the most recent update has to be chosen; the lookup algorithm does this by finding the candidate with the specified index that has the narrowest inclusion set interval.

Next the program defines a3 as an update to a2. There is a linear chain of dependencies so far; later we will see non-linear dependencies.

     Just a3 <- esfmUpdate "a3" a2 (300, 301)
-- a3 = 300-301 200-201 100-101 
     showState

The output continues the same pattern seen from the preceding updates. The most important point here is the sharing of array elements. An inclusion set like 3..3 indicates that the element belongs to just one array, but the inclusion sets for the [100] and [200] elements have expanded because these elements now occur also in the new array (a3 with name 2 and currently with code 3).

esf update: define a3 (name=2) = update 1 [300]=301
    0.     >    1     1..3     [  100] = 101
    1.     >    2     2..3     [  200] = 201
    2.  *  >    3     3..3     [  300] = 301
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.

MachineStats {ncells = 1024, narrays = 3, nelts = 3, nzombies = 0, nfree = 1021}
---------------------------------------------------------------------------

There is a simple pattern here because so far each new array is the result of updating the most recently created array.

a1 0>1 1..3 [100]=101
  a2 1>2 2..3 [200]=201
    a3 2>3 3..3 [300]=301

Now the program makes a non-linear update; that is, the next array a4 is defined by updating an older array a2, not the most recently created array.

     Just a4 <- esfmUpdate "a4" a2 (400, 401)
-- a4 = 400-401 200-201 100-101 
     showState

The resulting machine state no longer follows the simple pattern seen before after linear updates. The new array (with name 3) has a code that is one higher than the code of the array that was updated. Since a2 was updated, and its code was 2, the new array must have code 3. But that means the code of a3 must be changed by incrementing it. This is shown in the line Cell 2: previously its mapping was 2>3, but now it is 2>4, and its inclusion set has also changed from 3..3 to 4..4.

esf update: define a4 (name=3) = update 1 [400]=401
    0.     >    1     1..4     [  100] = 101
    1.     >    2     2..4     [  200] = 201
    2.     >    4     4..4     [  300] = 301
    3.  *  >    3     3..3     [  400] = 401
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.

MachineStats {ncells = 1024, narrays = 4, nelts = 4, nzombies = 0, nfree = 1020}
---------------------------------------------------------------------------

The [200] element how has inclusion set 2..4, so this element belongs to a2, a4, and a3 (the arrays with codes 2, 3, and 4). Both the [400] and [300] elements have singleton inclusion sets, and the code for a3 has been changed consistently, so the elements in a3 remain the same.

a1 0>1 1..3 [100]=101
  a2 1>2 2..4 [200]=201
    a4 3>3 3..3 [400]=401
    a3 2>4 4..4 [300]=301

The example program goes on to make several more nonlinear updates. By following the state in detail, it is easy to verify that each array always contains the right elements, that each element is shared among the right arrays, and that the array codes change frequently but these changes are done consistently so as to maintain a correct representation.

     Just a5 <- esfmUpdate "a5" a2 (500, 501)
-- a5 = 500-501 200-201 100-101 
     showState

esf update: define a5 (name=4) = update 1 [500]=501
    0.     >    1     1..5     [  100] = 101
    1.     >    2     2..5     [  200] = 201
    2.     >    5     5..5     [  300] = 301
    3.     >    4     4..4     [  400] = 401
    4.  *  >    3     3..3     [  500] = 501
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.

MachineStats {ncells = 1024, narrays = 5, nelts = 5, nzombies = 0, nfree = 1019}
---------------------------------------------------------------------------

     Just a6 <- esfmUpdate "a6" a3 (600, 601)
-- a6 = 600-601 300-301 200-201 100-101 
     showState

esf update: define a6 (name=5) = update 2 [600]=601
    0.     >    1     1..6     [  100] = 101
    1.     >    2     2..6     [  200] = 201
    2.     >    5     5..6     [  300] = 301
    3.     >    4     4..4     [  400] = 401
    4.     >    3     3..3     [  500] = 501
    5.  *  >    6     6..6     [  600] = 601
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.

MachineStats {ncells = 1024, narrays = 6, nelts = 6, nzombies = 0, nfree = 1018}
---------------------------------------------------------------------------

     Just a7 <- esfmUpdate "a7" a5 (700, 701)
-- a7 = 700-701 500-501 200-201 100-101 
     showState

esf update: define a7 (name=6) = update 4 [700]=701
    0.     >    1     1..7     [  100] = 101
    1.     >    2     2..7     [  200] = 201
    2.     >    6     6..7     [  300] = 301
    3.     >    5     5..5     [  400] = 401
    4.     >    3     3..4     [  500] = 501
    5.     >    7     7..7     [  600] = 601
    6.  *  >    4     4..4     [  700] = 701
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.

MachineStats {ncells = 1024, narrays = 7, nelts = 7, nzombies = 0, nfree = 1017}
---------------------------------------------------------------------------

     Just a8 <- esfmUpdate "a8" a1 (800, 801)
-- a8 = 800-801 100-101
     showState

esf update: define a8 (name=7) = update 0 [800]=801
    0.     >    1     1..8     [  100] = 101
    1.     >    3     3..8     [  200] = 201
    2.     >    7     7..8     [  300] = 301
    3.     >    6     6..6     [  400] = 401
    4.     >    4     4..5     [  500] = 501
    5.     >    8     8..8     [  600] = 601
    6.     >    5     5..5     [  700] = 701
    7.  *  >    2     2..2     [  800] = 801
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.
a8 (7)  =   [100] = 101.  [800] = 801.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.
a8 (7)  =   [100] = 101.  [800] = 801.

MachineStats {ncells = 1024, narrays = 8, nelts = 8, nzombies = 0, nfree = 1016}
---------------------------------------------------------------------------

     Just a9 <- esfmUpdate "a9" esfmEmpty (900, 901)
-- a9 = 900-901
     showState

esf update: define a9 (name=8) = update -1 [900]=901
    0.     >    2     2..9     [  100] = 101
    1.     >    4     4..9     [  200] = 201
    2.     >    8     8..9     [  300] = 301
    3.     >    7     7..7     [  400] = 401
    4.     >    5     5..6     [  500] = 501
    5.     >    9     9..9     [  600] = 601
    6.     >    6     6..6     [  700] = 701
    7.     >    3     3..3     [  800] = 801
    8.  *  >    1     1..1     [  900] = 901
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.
a8 (7)  =   [100] = 101.  [800] = 801.
a9 (8)  =   [900] = 901.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [200] = 201.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [500] = 501.
a6 (5)  =   [100] = 101.  [200] = 201.  [300] = 301.  [600] = 601.
a7 (6)  =   [100] = 101.  [200] = 201.  [500] = 501.  [700] = 701.
a8 (7)  =   [100] = 101.  [800] = 801.
a9 (8)  =   [900] = 901.

MachineStats {ncells = 1024, narrays = 9, nelts = 9, nzombies = 0, nfree = 1015}
---------------------------------------------------------------------------

A tree diagram (shown with indented text) illustrates the sharing. Array a1 contains the [100]=101 element. Array a2 contains [200]=201, and it also inherits [100]=101 from a1. Thus the element [100]=101 is shared between a1 and a2.

Now consider how these arrays are represented in the machine. The crucial point is that each element is represented in just one cell, even if it is shared among many arrays. Here's a tree diagram showing the array codes and the inclusion sets.

Shadowed elements

     Just a5 <- esfmUpdate "a5" a2 (200, mkInt 501)
     Just temp <- esfmLookup "a2" a2 200
     liftIO (putStrLn ("a2[200] = " ++ show temp)) -- expect 201
     Just temp <- esfmLookup "a5" a5 200
     liftIO (putStrLn ("a5[200] = " ++ show temp)) -- expect 501
     showState
-- a5 = 500-501 200-201 100-101

esf update: define a5 (name=4) = update 1 [200]=EsfUserOrdVal 501
esf lookup a2 (name=1)  [200] = Just (EsfUserOrdVal 201)
a2[200] = EsfUserOrdVal 201
esf lookup a5 (name=4)  [200] = Just (EsfUserOrdVal 501)
a5[200] = EsfUserOrdVal 501
    0.     >    1    1..5     [  100] = EsfUserOrdVal 101
    1.  *  >    3    3..5     [  200] = EsfUserOrdVal 201
    2.     >    2    2..2     [  300] = EsfUserOrdVal 301
    3.     >    5    5..5     [  400] = EsfUserOrdVal 401
    4.  *  >    4    4..4     [  200] = EsfUserOrdVal 501
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [200] = 501.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 501.  [200] = 201.

MachineStats {ncells = 16, narrays = 5, nelts = 5, nzombies = 0, nfree = 11}

Before doing some lookups, let's create another array.

     Just a6 <- esfmUpdate "a6" a3 (600, mkInt 601)
     showState
-- a6 = 600-601 300-301 100-101

esf update: define a6 (name=5) = update 2 [600]=EsfUserOrdVal 601
    0.     >    1    1..6     [  100] = EsfUserOrdVal 101
    1.     >    4    4..6     [  200] = EsfUserOrdVal 201
    2.     >    2    2..3     [  300] = EsfUserOrdVal 301
    3.     >    6    6..6     [  400] = EsfUserOrdVal 401
    4.     >    5    5..5     [  200] = EsfUserOrdVal 501
    5.  *  >    3    3..3     [  600] = EsfUserOrdVal 601
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [200] = 501.
a6 (5)  =   [100] = 101.  [300] = 301.  [600] = 601.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a3 (2)  =   [100] = 101.  [300] = 301.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 501.  [200] = 201.
a6 (5)  =   [100] = 101.  [300] = 301.  [600] = 601.

MachineStats {ncells = 16, narrays = 6, nelts = 6, nzombies = 0, nfree = 10}

The next part of the program performs some lookups. In a real program, lookup (and the sparse operations described later) are the only way to fetch data from the machine. To do a lookup, you specify an array and index, and the result is the contents of the array at that index. Naturally this is a Maybe type.

     Just x1 <- esfmLookup "a4" a4 400   -- expect 401
     Just x2 <- esfmLookup "a5" a5 100   -- expect 101
     Just x3 <- esfmLookup "a5" a5 200   -- expect 201
     Just x4 <- esfmLookup "a6" a6 300   -- expect 301
     liftIO (putStrLn (show (map getInt [x1,x2,x3,x4])))

esf lookup a4 (name=3)  [400] = Just (EsfUserOrdVal 401)
esf lookup a5 (name=4)  [100] = Just (EsfUserOrdVal 101)
esf lookup a5 (name=4)  [200] = Just (EsfUserOrdVal 501)
esf lookup a6 (name=5)  [300] = Just (EsfUserOrdVal 301)
[401,101,501,301]

Deletion reclaims exactly the right elements

     esfmDelete a3
     showState

esf delete name=2
    0.     >    1    1..5     [  100] = EsfUserOrdVal 101
    1.     >    3    3..5     [  200] = EsfUserOrdVal 201
    2.  *            2..2     [  300] = EsfUserOrdVal 301
    3.     >    5    5..5     [  400] = EsfUserOrdVal 401
    4.     >    4    4..4     [  200] = EsfUserOrdVal 501
    5.     >    2    2..2     [  600] = EsfUserOrdVal 601
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [200] = 501.
a6 (5)  =   [100] = 101.  [300] = 301.  [600] = 601.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 501.  [200] = 201.
a6 (5)  =   [100] = 101.  [300] = 301.  [600] = 601.

MachineStats {ncells = 16, narrays = 5, nelts = 6, nzombies = 0, nfree = 10}

A cascade of elements can be reclaimed by a deletion.

     esfmDelete a6
     showState

esf delete name=5
    0.     >    1    1..4     [  100] = EsfUserOrdVal 101
    1.     >    2    2..4     [  200] = EsfUserOrdVal 201
    3.     >    4    4..4     [  400] = EsfUserOrdVal 401
    4.     >    3    3..3     [  200] = EsfUserOrdVal 501
esfmPrintMachineArrays: arrays represented in machine state:
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 201.  [200] = 501.

esfmPrintSpecArrays: arrays according to specification:
esfmEmpty (-1)  =
a1 (0)  =   [100] = 101.
a2 (1)  =   [100] = 101.  [200] = 201.
a4 (3)  =   [100] = 101.  [200] = 201.  [400] = 401.
a5 (4)  =   [100] = 101.  [200] = 501.  [200] = 201.

MachineStats {ncells = 16, narrays = 4, nelts = 4, nzombies = 0, nfree = 12}

Notification and zombies

Tagging the element values

The program also contains a few utility functions that perform tagging and untagging of Int data, converting to and from the EsfEltVal type.

mkInt :: Int -> EsfEltVal Int
mkInt n = EsfUserOrdVal n
getInt :: EsfEltVal Int -> Int
getInt (EsfUserOrdVal n) = n
getInt _ = error "element should be EsfUserOrdVal but isn't"

Implementation

Layers and platforms

ESFA is implemented in several layers.

Top layer: macro operations. Each ESFA operation is implemented as a macro-operation defined using the micro operations comprising the middle layer. Every operation is implemented by a small fixed sequence of micro operations; thus the top layer could be implemented by a simple control circuit.
Middle layer: combinators (map, fold, scan). These are operations that act in parallel on every cell in the ESF machine. A parallel map updates each cell by applying a function to the old state, yielding the new state. A parallel fold uses an associative function to combine messages produced from each cell into a single value that is read by the controller. A parallel scan is similar to a fold, but it also updates each cell based on an incoming message.
Bottom layer: parallel implementation of the combinators. The reference implementation is a digital circuit; indeed, the middle layer is chosen because (1) the three combinators are primitive enough that they can be implemented directly in hardware, and (2) they are powerful enough that they can implement all the ESFA operations. However, the combinators can be implemented on other platforms, such as a GPU, multicore CPU, or sequential CPU. Naturally, platforms that provide less parallelism will yield slower implementations of the ESFA operations.

Parallel combinators

Machine state

Each cell contains the following fields:

data PtcCellState a = PtcCellState
  { ptcCellId       :: !Int,
    ptcSelect       :: !Bool,
    ptcMark         :: !Bool,
    ptcZombie       :: !Bool,
    ptcMapDef       :: !Bool,
    ptcMapCode      :: !ACode,
    ptcEltDef       :: !Bool,
    ptcLow, ptcHigh :: !ACode,
    ptcInd          :: !EltIndex,
    ptcValIsArray   :: !Bool,
    ptcValField     :: !(EsfEltVal a) }

During initialization, the cellID fields of the cells are set to 0, 1, 2, ..., and all the other fields are set to False or 0.

An ESF array has a stable name called an AName. This is simply an integer that identifies the array (it is not necessarily the address of the first element of the array, as for conventional imperative arrays). The empty array has an AName = -1.

Each ESF array also has a numeric code, called an ACode. Like an AName, this is an integer that identifies the array. The difference is that the ESF operations frequently need to change the codes for many arrays. The purpose of having an AName is so that the "outside world" can have a stable value to refer to an array. The ACodes are kept within the ESFM and are never sent to the outside world (apart from operations that dump the state, for tracing, instrumentation, and debugging purposes).

If there is an array with AName = n and ACode = c, then cell[n].MapDef = True and cell[n].MapCode = c. Thus we can find out the code for the array with name n using the encode function. The algorithms below are given as pseudo-code; for the full definitions see the src directory.

encode n =
  if cell[n].MapDef
    then Just (cell[n].mapCode]
    else Nothing

The encode function is implemented using a parallel fold.

Update

The update function takes an AName to be updated, and an index and value. It returns the AName of a new array which is just like the argument, except that it has the specified value at the specified index. Thus if b = update a (i,v) then b will satisfy these properties:

lookup b i = v
lookup b j = lookup a j, if i /= j

Update is a macro operation that takes several steps, each performed by a combinator:

The ACode for a is calculated: acode = encode a. This is done by a parallel fold, and requires one clock cycle. The ACode for the resulting array, called acode', is defined as acode' = acode+1 (the controller may spend a clock cycle doing that addition, or could do that in parallel with the next step).
An empty cell is allocated, the Select flag in that cell is set, and the cellID is returned. This is done in one clock cycle by a parallel scan. The scan (technically this is called a "mapping scan") updates each cell by setting or clearing its Select flag, and it returns a result which is the ID of the selected cell. A simplified way to understand how this works is to imagine an accumulator value scanning across the sequence of cells from left to right, checking whether the cell is empty (EltDef = False), and setting Select in the first such cell. The subtlety in the implementation is that this has to be realised by an associative function, so that the parallel tree circuit can do the entire mapping scan in one clock cycle with a gate delay on the order of log N, where there are N cells.
The existing cells are adjusted. The value of acode is broadcast (using the tree circuit) and each cell adjusts its fields as follows; this is performed by a parallel map and takes one clock cycle:

  if EltDef && Low > acode then Low++
  if EltDef && High >= acode then High++
  if MapDef && MapCode > acode then MapCode++

The fields of the selected cell are stored. In theory, this could be done by the same parallel map used in the previous step, but it simplifies the hardware to use several clock cycles to store these fields (for example, the number of wires in the tree datapaths is reduced, which increases the number of cells that can fit on a chip).

MapDef  = True
MapCode = acode'
EltDef  = True
Low     = acode'
High    = acode'
Index   = i
Value   = v

Finally, the AName of the new array (Aname n) is returned.