Fressian is a self-describing binary serialization format developed for clojure.
This project is hammocked.
In its current state, this is an incomplete implementation of fressian. In addition to a couple missing types, part of the fressian protocol is caching, where you can hand off some data to serializer with a flag, and every subsequent time the serializer sees an "equal" object, it is written as a few bytes at most. As you can imagine for wasm consumers, this is a highly attractive feature, especially if you can agree on a cache table ahead of time and never pay the full cost of marshalling expensive data. This is the most important new feature moving forward, and will require some work.
- simple interface for caching
- custom user caches via traits
- choose your hash algorithm, hard code behavior, etc
- zero-copy, no-move cache-testing
- this may be impossible with heterogeneous types
- enable
&'static mut
caches- should be able to ship a lazy-static global cache & recycle its use
- big perf win for wasm
&mut
byte buffers- in single thread ctx, no reason to reallocate a buffer for every serializer instance
- multi-threaded can pool
- automatic caching via attribute
- serialize impls are pretty rigid. Possibly can wrap them with a Cache trait and ship default impls
- fill out type support
- equivalence between objects
This is easy in JVM & JS because everything is an object, we can use runtime reflection, and we can rely on clojure to do the work for us. For rust though, objects of different types will cannot be compared, and serde will not let us further constrain types with traits (Hash etc) for use in Serializers.
- serde wants to be stateless
Serde's Serialize
want to treat every object the same, and you cannot introduce new trait restrictions on serializable types. Short of thread-local global variables, you cannot write a Serialize
impl that accesses state somewhere to conditionally change serialization strategies.
adhoc reflection, serializing trait objects- tried & failed :-(
- serde-state?
- enable caching only for types that can be converted to a common
Value
enum, but with the constraint of using a dedicated serialization api for the types you intend to cache- This was the original impl for caching. I am going to revert it back
- completely wrap serde with a set of serialization traits designed with caching in mind
- this is probably the most robust way forward, but will break from serde idioms
Am I overlooking something?
When compiled for WebAssembly, serde-fressian can be used to convey rich values to and from clojurescript.
- fress
- an implementation of Fressian for clojurescript designed to interface with serde-fressian's wasm api from javascript.
- cargo-cljs
- a clojurescript library for scripting cargo via nodejs
- No records, BIGINT, BIGDEC, OBJECT_ARRAY, char
- No caching except for the types that require it
- No checksum/validation
- serde::fressian::value needs own Deserializer/Serializer impls, indexing, identity predicates
- plenty of wasm specific optimizations yet to implement
serde-fressian tries to follow the standard serde conventions. Deserializers can accept readers, vecs, and slices. Serializers at this time however only support writing to vecs.
use serde_fressian::ser;
use serde_fressian::de;
let write_data: Vec<String> = vec!["some".to_string(), "strings".to_string()];
let bytes: Vec<u8> = ser::to_vec(&data).unwrap();
// this is strongly typed deserialization
let read_data: Vec<String> = de::from_vec(&bytes).unwrap();
assert_eq!(write_data,read_data)
If you know ahead of time what the bytes are going to contain, you can use strongly typed deserialization to extract your values. This is less flexible but is very fast. If you are unsure of the content, serde_fressian::value::Value
is an enum encompassing all fressian types and will deserialize values as they are described.
use serde_fressian::ser;
use serde_fressian::de;
use serde_fressian::value::{Value};
let write_data: Vec<String> = vec!["some".to_string(), "strings".to_string()];
let bytes: Vec<u8> = ser::to_vec(&data).unwrap();
// this is weakly typed deserialization
let read_data: Value = de::from_vec(&bytes).unwrap();
// Value::LIST(vec![Value::STRING("some".to_string()),Value::STRING("strings".to_string())])
assert_eq!(read_data, Value::from(write_data))
The serde_fressian::wasm
module is designed to interop with fress.wasm
use serde_fressian::error::{Error as FressError};
use serde_fressian::value::{self, Value};
use serde_fressian::wasm::{self};
// called by javascript with ptr to written bytes
#[no_mangle]
pub extern "C" fn echo(ptr: *mut u8, len: usize) -> *mut u8
{
// read a value from javascript
let val: Result<Value, FressError> = wasm::from_ptr(ptr, len);
// from_ptr borrows, Value copies. So must own and free bytes separately
wasm::fress_dealloc(ptr, len);
//serializes the result, hands ownership of resulting bytes over to js
wasm::to_js(val)
}
For my own sanity, most documentation will be over at fress
Fressian was designed by Rich Hickey. There is a clojure wrapper here. There is a design talk here.