Remove @dataclass, add slots #144

Gobot1234 · 2020-08-27T12:18:33Z

Adds a custom metaclass for Message to quickly create dataclasses for Messages whilst also slotting them. Internally should still be basically the same as a dataclass.

Benefits:

This will almost certainly be faster than the current approach, I will need to run speed tests on it before I am 100% sure.
Removes an import from all generated messages, and improves visual clarity.

This is incorporates 2 breaking changes:

Removing @dataclass from all generated outputs, since this is for V2 however I think it is fine, there are a couple of options if you want to give warning like monkey patching the dataclass decorator to do nothing for Message subclasses.
Fields cannot now be access outside of instances of themselves, pretty minor change, this shouldn't really affect anyone as they (I think) would always be betterproto.Placeholder, but if necessary it could be remedied by implementing MessageMeta.__getattribute__.

Closes #50

abn · 2020-08-27T14:22:29Z

I am not sure if optimising for performance over usability by default is a good thing. While, I can understand the motivation for this change, I feel that dropping dataclasses means you are loosing a lot of niceties that it brings to the table. I am also unclear on how much of a gain we are talking here. We might also want to talk about making it optional, ie. dataclasses vs slots at compile time.

As an additional note, the alternative is to move from dataclasses to something that already fascilitates slots. The attrs project is a good option for that. Obviously, everything has trade-offs. :)

Gobot1234 · 2020-08-27T14:28:54Z

It isn't dropping anything internally, it just removes the decorator.

abn · 2020-08-27T14:30:13Z

It isn't dropping anything internally, it just removes the decorator.

Ah, I might have spoke too soon then. Will go through the changes. Ignore my reactionary comment. :)

Gobot1234 · 2020-08-27T23:52:04Z

Some tests

Results

Tabled results are for a Message of:

class Message(betterproto.Message):
    foo: int = betterproto.uint32_field(0)
    bar: int = betterproto.uint32_field(1)
    baz: int = betterproto.uint32_field(2)

unless specified otherwise.

Important results:

Test	Old implementation	New slotted implementation
Size of	576 bytes	256 bytes
Runtime Creation	741 usec	38.4 usec
Attribute Access	71.4 nsec	43.8 nsec
Instantiation	10 usec	11 usec

Size of Instance

Tested with 0, 3, 5, 10, 50 field large message instances. Using pympler.asizesof due to sys.getsizeof not being recursive.

Size	0	3	5	10	50
Old	152	576	728	1096	3848
New	208	256	272	312	632

Slots make messages significantly more linear in the amount of memory usage they end up using

Runtime overhead

Old raw times: 74.1 sec, 90.5 sec, 74.1 sec, 82 sec, 87.8 sec

100000 loops, best of 5: 741 usec per loop

Slotted raw times: 4.96 sec, 4.72 sec, 4.17 sec, 3.84 sec, 4.1 sec

100000 loops, best of 5: 38.4 usec per loop

This is ridiculously faster with MessageMeta at around 20x

Attribute Access

Old raw times: 13.2 msec, 8.81 msec, 7.48 msec, 7.28 msec, 7.14 msec

100000 loops, best of 5: 71.4 nsec per loop

Slotted raw times: 4.38 msec, 6.24 msec, 5.9 msec, 4.46 msec, 5.67 msec

100000 loops, best of 5: 43.8 nsec per loop

So it's about 40% faster with slots

Instantiation

Old raw times: 2.34 sec, 1.03 sec, 1 sec, 1.61 sec, 2.55 sec

100000 loops, best of 5: 10 usec per loop

Slotted raw times: 1.25 sec, 1.1 sec, 1.14 sec, 1.12 sec, 1.2 sec

100000 loops, best of 5: 11 usec per loop

Currently instantiation with the current implementation is faster, however only marginally and I think the instantiation can be improved on the new model, or if not we can just use the dataclasses implementation, if it doesn't hurt runtime overhead particularly.

Gobot1234 · 2020-08-29T20:04:22Z

I've optimized init more and it's now faster (by a couple of usec) than the dataclasses approach.

Gobot1234 · 2020-08-29T21:17:40Z

Tests again will fail until #130 is merged

# Conflicts: # src/betterproto/__init__.py # src/betterproto/templates/template.py.j2

src/betterproto/__init__.py

Gobot1234 · 2020-11-07T16:20:12Z

poe bench currently returns

All benchmarks:

       before           after         ratio
     [f10bec47]       [222a105b]
     <master>         <dataslots>
      1.99±0.09μs      1.93±0.2μs      0.96 benchmarks.BenchMessage.time_attribute_access
      10.5±0.4μs       9.01±0.2μs      0.86 benchmarks.BenchMessage.time_attribute_setting
      17.3±1μs         13.1±0.2μs      0.76 benchmarks.BenchMessage.time_init_with_values
      17.4±0.5μs       7.96±0.6μs      0.46 benchmarks.BenchMessage.time_instantiation
      722±30μs          293±9μs        0.41 benchmarks.BenchMessage.time_overhead
      22.5±2μs          22.2±1μs       0.99 benchmarks.BenchMessage.time_serialize

Gobot1234 · 2020-11-07T16:56:43Z

Not too sure why the getattr call isn't much of an improvement as it should be the same just __slot__ boosted?

Gobot1234 · 2021-01-03T16:23:40Z

I'm going to close this for now. I'm sure this can be done better but it requires a lot of work.

Gobot1234 and others added 10 commits August 24, 2020 13:39

First go at dataslots

64da80a

Implement Message.__bool__ for danielgtaylor#130

0d6ced5

Add a test for it

7746b91

Custom Metaclass that removes need for @DataClass

42cd924

Revert private attributes

955bbb3

Delete utils.py

1597c48

Remove brackets

f81d8cd

Cleanup

1b66467

See if this fixes stuff

5056997

Remove debug print

62de49c

Gobot1234 added 3 commits August 27, 2020 21:09

Update template.py.j2

397c6b0

Some bugfixes/optimizations

16cdaa8

Misc comments

8353b24

Fixes

5373229

Gobot1234 added 6 commits August 29, 2020 21:06

Optimize init

b08a036

Fix typing

86a9726

More fixes

18a0ea0

Remove dataclass

f0fb3d1

Bug fixes

18d5ec9

Blacken

13fc75d

Gobot1234 and others added 4 commits August 29, 2020 22:40

Couple of small tweaks

1a07d3b

Merge branch 'master' into dataslots

adc7eba

Blacken + tweak

3ff8213

Blacken again

2d757fc

Gobot1234 and others added 15 commits October 20, 2020 15:54

Blacken + tweak

29c3ad9

Blacken again

8b35c1e

Fixes

bdc1f46

Fixes

23728fe

Fix arg/kwarg handling

4e99449

Fix recursive messages & blacken

008c0d3

Final fix?

6f74c5d

Final final fix

8ed2805

Respond to self comments

35c4ee4

More speed improvements and final changes

68ae143

Add __bool__ to special members

86d7c30

Respond to some of the comments

a304628

Merge remote-tracking branch 'origin/dataslots' into dataslots

14748ae

# Conflicts: # src/betterproto/__init__.py # src/betterproto/templates/template.py.j2

Update __init__.py

5c8e926

Hmm

fec636a

nat-n reviewed Oct 25, 2020

View reviewed changes

src/betterproto/__init__.py Outdated Show resolved Hide resolved

Gobot1234 and others added 9 commits October 25, 2020 20:42

Fix broken branch

31579f6

Simplify bool

f10bec4

Fix broken branch P2 but with changes and a better structure

f48080b

Fix function

5e005d3

Update stuff for slack in case anyone looks

ff712f7

Whoops

529c488

Merge branch 'master' into dataslots

50949b5

Fix tests and benchmarks

3d0fca6

Fix tests

222a105

Gobot1234 closed this Jan 3, 2021

Gobot1234 deleted the dataslots branch January 10, 2021 18:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove @dataclass, add slots #144

Remove @dataclass, add slots #144

Gobot1234 commented Aug 27, 2020

abn commented Aug 27, 2020

Gobot1234 commented Aug 27, 2020

abn commented Aug 27, 2020

Gobot1234 commented Aug 27, 2020 •

edited

Loading

Gobot1234 commented Aug 29, 2020

Gobot1234 commented Aug 29, 2020

Gobot1234 commented Nov 7, 2020 •

edited

Loading

Gobot1234 commented Nov 7, 2020

Gobot1234 commented Jan 3, 2021

Remove @dataclass, add slots #144

Remove @dataclass, add slots #144

Conversation

Gobot1234 commented Aug 27, 2020

abn commented Aug 27, 2020

Gobot1234 commented Aug 27, 2020

abn commented Aug 27, 2020

Gobot1234 commented Aug 27, 2020 • edited Loading

Results

Important results:

Size of Instance

Runtime overhead

Attribute Access

Instantiation

Gobot1234 commented Aug 29, 2020

Gobot1234 commented Aug 29, 2020

Gobot1234 commented Nov 7, 2020 • edited Loading

Gobot1234 commented Nov 7, 2020

Gobot1234 commented Jan 3, 2021

Gobot1234 commented Aug 27, 2020 •

edited

Loading

Gobot1234 commented Nov 7, 2020 •

edited

Loading