Unlock utxos when TxPublish fails #1827

t-bast · 2021-05-27T06:49:35Z

This PR refactors the TxPublisher and adds some new features:

We now spawn a FSM for each tx publication: this makes the publish steps more clear and easier to extend
We return a TxPublishResult, that will in the future allow the parent TxPublisher to manage the list of pending txs and RBF when needed
We don't spend our anchor output when a commit tx is already confirmed
We unlock utxos when tx publishing fails (which happens when we restart eclair, which re-sends channel txs to the publisher who will try to publish them but fail because they're already in the mempool)

It can be useful to review commit by commit to see each change individually.

Isolate the tx publishing logic inside a dedicated actor. One actor will be created for each tx that should be published.

Use distinct behaviors for each phase of tx publishing. This will let us handle errors more easily and apply custom clean-up depending on what phase we were in (e.g. unlock utxos after funding).

If our commit tx or the remote commit tx has been confirmed, there's no need to claim our anchor output.

When we fail to publish a transaction to which we added wallet inputs, we must unlock these utxos. This can happen for various reasons, the most frequent one is a node restart (all txs are sent to the TxPublisher again, but they may already be in the mempool).

pm47 · 2021-06-07T12:53:24Z

Regarding the first commit (9ed346e):

have you considered handing the delayed txs in the TxPublish actor? Just my first reaction when looking at the code
it seems the new asynchronous publication effectively bypasses the singleThreadExecutionContext and we lose the sequentiality guarantee

t-bast · 2021-06-07T13:00:56Z

have you considered handing the delayed txs in the TxPublish actor? Just my first reaction when looking at the code

Yes, all txs will be handled by the TxPublish actor in the future, but one step at a time (I start with the ones that need fee-bumping and will later add the ones for which it's optional).

it seems the new asynchronous publication effectively bypasses the singleThreadExecutionContext and we lose the sequentiality guarantee

That's true. But I really think moving to dedicated actors is important before adding more features...
The only tx that needs to be broadcast first is the commit tx, maybe we can explicitly re-broadcast it in the first step and completely get rid of singleThreadExecutionContext?

pm47 · 2021-06-07T13:01:58Z

eclair-core/src/main/scala/fr/acinq/eclair/channel/TxPublisher.scala

+            // We retry when the next block has been found, we may have more funds available.
+            val nextBlockCount = nodeParams.currentBlockHeight + 1
+            val cltvDelayedTxs1 = cltvDelayedTxs + (nextBlockCount -> (cltvDelayedTxs.getOrElse(nextBlockCount, Seq.empty) :+ result.cmd))


That's a bit hacky, the transaction isn't really cltv-delayed here. Also, there is a potential risk of creating a herd effect if our wallet balance is low. That would be a nice stress test of our ability to concurrently fund many transactions and manages locks, but I'm not sure it's a good thing.

That will be improved later, when we'll introduce deadlines (block heights at which we should republish/bump some txs), but that requires more work. In the meantime I'd rather err on the safe side and republish aggressively to ensure we don't forget to publish an important transaction because of an unknown error.

pm47 · 2021-06-07T13:03:11Z

it seems the new asynchronous publication effectively bypasses the singleThreadExecutionContext and we lose the sequentiality guarantee

That's true. But I really think moving to dedicated actors is important before adding more features...
The only tx that needs to be broadcast first is the commit tx, maybe we can explicitly re-broadcast it in the first step and completely get rid of singleThreadExecutionContext?

IIRC, the most probable scenario would be local-commit and htlc-success

t-bast · 2021-06-07T13:04:57Z

The only scenario where it's an issue is the local commit tx, it must be published before its children, otherwise the rest is CSV-delayed so it's not impacted.

We ensure that the behavior we rely on works as expected.

We treat each transaction separately, which works fine in most cases and improves parallelism. But there are cases where we must ensure some sequentiality: - for standard commitments: commit tx must be published before htlc txs - for anchor output commitments: commit tx must be published before its anchor We previously relied on a custom execution context, but that doesn't work now that we create one actor per transaction.

t-bast · 2021-06-09T10:23:33Z

it seems the new asynchronous publication effectively bypasses the singleThreadExecutionContext and we lose the sequentiality guarantee

This is fixed in 81a1b51
I chose to optimistically publish txs and react on failures by publishing the parent first.
The most impacted case if htlc txs in non-anchor output commitments (because they aren't csv-delayed).

t-bast · 2021-06-22T15:19:02Z

Closed in favor of #1844

t-bast added 5 commits May 27, 2021 08:34

Create per-tx child actor

9ed346e

Isolate the tx publishing logic inside a dedicated actor. One actor will be created for each tx that should be published.

Separate FSM states inside TxPublish

f2d54d3

Use distinct behaviors for each phase of tx publishing. This will let us handle errors more easily and apply custom clean-up depending on what phase we were in (e.g. unlock utxos after funding).

Only use anchor when commit is unconfirmed

1365625

If our commit tx or the remote commit tx has been confirmed, there's no need to claim our anchor output.

Move TxPublish to separate file

3dc677f

t-bast requested a review from pm47 May 27, 2021 06:49

pm47 reviewed Jun 7, 2021

View reviewed changes

t-bast added 2 commits June 9, 2021 10:34

Enrich isTransactionOutputSpendable tests

5c93056

We ensure that the behavior we rely on works as expected.

t-bast mentioned this pull request Jun 18, 2021

Rework TxPublisher #1844

Merged

t-bast closed this Jun 22, 2021

t-bast deleted the tx-publisher-unlock-utxos branch June 22, 2021 15:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unlock utxos when TxPublish fails #1827

Unlock utxos when TxPublish fails #1827

t-bast commented May 27, 2021

pm47 commented Jun 7, 2021

t-bast commented Jun 7, 2021 •

edited

Loading

pm47 Jun 7, 2021

t-bast Jun 8, 2021 •

edited

Loading

pm47 commented Jun 7, 2021

t-bast commented Jun 7, 2021

t-bast commented Jun 9, 2021 •

edited

Loading

t-bast commented Jun 22, 2021

Unlock utxos when TxPublish fails #1827

Unlock utxos when TxPublish fails #1827

Conversation

t-bast commented May 27, 2021

pm47 commented Jun 7, 2021

t-bast commented Jun 7, 2021 • edited Loading

pm47 Jun 7, 2021

Choose a reason for hiding this comment

t-bast Jun 8, 2021 • edited Loading

Choose a reason for hiding this comment

pm47 commented Jun 7, 2021

t-bast commented Jun 7, 2021

t-bast commented Jun 9, 2021 • edited Loading

t-bast commented Jun 22, 2021

t-bast commented Jun 7, 2021 •

edited

Loading

t-bast Jun 8, 2021 •

edited

Loading

t-bast commented Jun 9, 2021 •

edited

Loading