-
Notifications
You must be signed in to change notification settings - Fork 217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
swingset hierarchical object identifiers #455
Comments
I think the concept embodied by your "representative" idea is expressed by the word "plenipotentiary", defined (in the context of diplomacy) as "a person, especially a diplomat, invested with the full power of independent action on behalf of their government, typically in a foreign country", i.e., a thing that has the full power to act as the object without actually being the object. That said, "plenipotentiary" is a mouthful and also hard to type. |
I don't care for the notional constructor API, at least as it is used in the |
Hm, is there any way we could prototype the constructor/dehydrator part without first figuring out the data model / |
In today's meeting we sketched out some approaches for the user-level API. Dean's initial versionconst purse = (me, c) => ({
deposit(other) {
const otherBalance = c.get(other).balance;
c.update(me, { balance: me.balance + otherBalance});
c.update(other, { balance: 0 });
}
});
const c = liveslots.createContainer(purse);
function mint(initialBalance) {
return c.create({ balance: initialBalance});
} Dean's explicit state version, just balance🚀 🚀 🚀 Current winner const purse = (me, c) => ({
deposit(other) {
const myBalance = c.get(me);
const otherBalance = c.get(other);
c.update(me, myBalance + otherBalance);
c.update(other, 0);
}
});
const c = liveslots.createContainer(purse);
function mint(initialBalance) {
return c.create(initialBalance);
} Dean's "me" is external versionconst purse = (me, c) => ({
deposit(other) {
const otherBalance = c.get(other).balance;
const myBalance = c.get(me).balance;
c.update(me, { balance: myBalance + otherBalance});
c.update(other, { balance: 0 });
}
});
const c = liveslots.createContainer(purse);
function mint(initialBalance) {
return c.create({ balance: initialBalance});
} Promise version// Dean's explicit state version, just balance, promise
const purse = (me, c) => ({
async deposit(other) {
const o = await other;
// AWAIT /////
const myBalance = c.get(me);
const otherBalance = c.get(o);
c.update(me, myBalance + otherBalance);
c.update(o, 0);
}
});
const c = liveslots.createContainer(purse);
function mint(initialBalance) {
return c.create(initialBalance);
} |
Another idea Dean presented was how to save space by effectively compressing the kernel tables. In my original thinking, the exporting vat has a clist entry like Dean's first point was to make the kernel more aware of this scheme. The kernel object table only exists to figure out where to route messages to a given object, and we could effectively compress it by allowing kernel object IDs to be hierarchical as well. In this form, the client vat clist would say So far, the comms vat sees a zillion separate identifiers just like any other client vat (although we should be able to use this same Container scheme to keep that state out of RAM). But in the second step, we could choose to give the comms vat more power, by letting clist entries point to an entire prefix, and allowing the vat to make up whatever suffix it wants. Here, the comms vat clist would say In this approach, the comms vat can access Purses that nobody has sent through it, but we already rely upon the comms vat for a lot. The benefit is that the comms vat clist is now compressed (one entry per Issuer, not per Purse), and we aren't adding vat-mint c-lists entries for each Purse that goes out to an external machine. The comms vat must still maintain internal clists of size N, because we wouldn't want to grant the remote machine access to all purses. |
Liveslots does not yet provide any `vatGlobals`, but this ensures that any ones it does provide in the future will be added to the `endowments` on the vat's Compartment. We'll use this to populate `makeExternalStore` -type functions, which want to be globals (because threading them from `vatPowers` into modules that need them would be too annoying), but must also be in cahoots with closely-held WeakMaps and "Representative" state management code from liveslots, as described in #455 and #1846. closes #1867
Everything we talked about here was implemented in #1907 |
We had a long meeting today to discuss Dean's idea (which I tried and failed to capture back in #54 (comment)) about nested object identifiers. (for internal reference, the recording of our meeting is in the Agoric internal drive, Engineering / 2020-01-24 hierarchical object refs in swingset)
The problem this addresses is the vat-side liveslots tables. All of the kernel-side tables are presumed to live primarily on disk, so they can be of arbitrarily large size without causing memory problems. The vat-side liveslots table maps from inbound
o+4
-type reference identifiers to local Javascript objects/promises that can be the targets of inbound messages or arguments to the same. This same (bidirectional) table is also used to serialize objects/promises in outbound arguments. We only talked about object references, but I suspect promise references can be handled in basically the same way.kernel changes
Kernel-side object references (generally spelled
koNN
, and used to index a table that tracks which vat owns each, so messages can be routed to the correct vat) remain unchanged. The kernel-side c-lists map these kernel-widekoNN
identifiers to vat-specifico+NN
oro-NN
identifiers. These vat-side IDs will be changed to have a hierarchical identifier:o+NN.MM
(or beyond:o+NN.MM.SS.QQ
etc). The kernel does not parse or interpret any of that:o+1.2
ando+1.3
are entirely different objects, as far as it knows. It's important to note that one client vat having access to e.g.o+1.2
has no bearing on it getting access too+1.3
: each sub-id has an entirely separate identity and access-control meaures.inbound liveslots
Within liveslots is where things get interesting. The inbound table lookup will parse the kernel-provided identifier into an initial portion and a tail (the
car
and thecdr
, for us Lisp fans). In this simple case,o+1.2
gets split intoo+1
and2
(but more complex nested cases, likeo+1.2.3.4
, are conceivable).The inbound deserialization code then looks
o+1
up in the table to get a "container" object. It then invokes a.hydrate()
method on that container object and provides tail (2
) as an argument. (in the complex case, it would call.hydrate('2.3.4')
, and recursive lookup/construction would be performed).The container object is responsible for creating a new representative object to serve as the target for (or argument of) an inbound message. It will use some TBD database syscall to fetch the data necessary to construct this object, and will then invoke a constructor function provided back when the object identifier was created. The resulting object should be short-lived: it might be retained by a Promise
.then
or two, but it should not be held in any long-lived table.The liveslots layer will maintain a
WeakMap
from this generated object to the kernel-side object ID. If this object is sent outbound, this weakmap is used to serialize it back into the same object ID. This weakmap should remain small: only objects used by in-flight operations should appear here, whereas objects that are referenced by other vats but not actually involved in current operations should not be instantiated or tracked by the weakmap.creating containers and items
The process starts when application-level code invokes some liveslots facility to create a "Container", perhaps
c = liveslots.createContainer()
. At this time, the liveslots layer (probably) allocates an object ID for the container (e.g.o+1
).Later, some application-level code can create a new item within this container (e.g.
o = c.create(initialData, constructor)
). This allocates a sub-id (e.g..2
), uses the DB interface to store the initial data under that ID, then invokes theconstructor
function with the initial data to build the object representative. This representative is added to the WeakMap, then returned. If/when application code includes the representative in a message (or promise resolution), the WeakMap will recognize it and serialize it aso+1.2
to the kernel. When the application-level code in the vat stops referencing the representative, it will be GCed and removed. At that point, no Javascript object exists that represents the item. The kernel-side c-list will have a row that includes theo+1.2
identifier, and there will be a correspondingkoNN
kernel object table entry. The vat-side liveslots table (on disk, somehow) will have an entry mappingo+1.2
to the current data of that object. But there will be no actual JavascriptObject
for it, until a new message arrives referencingo+1.2
.The constructors could be tracked in a WeakMap (and assigned an integer), so the DB record could record the fact that
o+1.2
is associated with constructor number 3, and then map from3
to the specificFunction
object. This way we don't need to track N separate constructor functions for N virtual target objects (which would defeat the purpose). Or we could require that each container have exactly one constructor function (which might be easier to reason about, and would perhaps make upgrade / schema changes easier in the future).At some point in the future,
o+1.2
is delivered back into the vat. Liveslots mapso+1
to the container object, upon which.hydrate('2')
is called. This looks up2
in the DB, which gets us the constructor function and the current data contents. A new representative is built with the constructor, and either delivered the message (as a target) or is referenced as an argument to some other message.object identity
We very much want to avoid using
WeakRefs
(Javascript doesn't have them yet, but they've been proposed), because it is difficult to make the finalizers run in a deterministic order (necessary for consensus-based swingset hosts). As a result, we aren't going to guarantee that the same object reference appearing in two successive inbound messages will be deserialized to the same object representative. (We might decide that two copies of the same reference appearing in the same message would get the same representative, by using one WeakMap per inbound message, and discarding it at the end of the crank, but we didn't come to a firm conclusion about it).Therefore application code should not be performing EQ on these objects or using them as the keys of any tables. They are to be used for their encapsulated data, not their identity.
We didn't use this terminology during the meeting, but I'm now starting to like the idea of calling these things "representatives", in contrast to the virtual object that they represent. The virtual object lives only in the database. Each time it gets referenced, the liveslots deserialization creates a new short-lived representative to perform the object's functions.
This raises an interesting question of how (or whether) multiple representative for the same virtual object might coordinate with each other.
object constructors
Our basic assumption is that these objects will implement behavior that needs to read the object state from the DB, modify it in some way, then write the modified state back (after which point the object representative can be dropped). We talked about the "React pattern", in which the
container.create(initialData, constructor)
call would get a constructor function that looked something like this:Some ideas here:
useState()
gets a successive slot of the database recorduseState()
is creating a new record and filling it withinitialData
), or the second/etc time (in which caseuseState()
is reading the current state from the DB and ignoringinitialData
)makeCounter()
call is made for each message (one per representative)count
to not change during its operation: we define some granularity or window of time, and exactly one read happens at the beginning of that window, and any write won't happen until the endWe talked about giving the code an accessor for its state instead, so it could survive being used for multiple operations, but didn't come up with anything conclusive:
The text was updated successfully, but these errors were encountered: