Unbreak datafusion #1482

AhmedSoliman · 2024-04-29T17:24:15Z

Unbreak datafusion

Stack created with Sapling. Best reviewed with ReviewStack.

github-actions · 2024-04-29T17:57:29Z

Test Results

99 files +4 99 suites +4 8m 39s ⏱️ +52s
84 tests +2 82 ✅ +2 2 💤 ±0 0 ❌ ±0
216 runs +4 210 ✅ +4 6 💤 ±0 0 ❌ ±0

Results for commit 348b18a. ± Comparison against base commit 05eb4ba.

♻️ This comment has been updated with latest results.

igalshilman

Thanks @AhmedSoliman ! The changes look good to me, I'll test the PR locally in a bit, but otherwsie everything looks great!

igalshilman · 2024-04-30T07:50:21Z

crates/storage-query-datafusion/src/state/table.rs

-            for_each_state(schema, tx, rows);
-            Ok(())
-        };
-        stream_builder.spawn_blocking(background_task);


@AhmedSoliman what are your thoughts about drooping the spawn_blocking for these scans?
Is it a part of your plan mentioned offline of having eventually a dedicated runtime for df/rocksdb?

I'd assume these background threads will take awhile for large enough partitions.

Yes, we can move to normal async tasks once we move make the underlying db operations async. Whether this runs on its own runtime or sharing the runtime with the rest of the system is adjacent though.

igalshilman · 2024-04-30T08:05:02Z

crates/storage-query-datafusion/src/table_providers.rs

+}
+
+#[async_trait]
+impl<T, S> TableProvider for PartitionedTableProvider<T, S>


💯 very nice, it reads very well

tillrohrmann

Great work @AhmedSoliman. It is impressive how quickly you restructured our RocksDB layout! Really happy about how you changed it.

This PR looks good to me. +1 for merging.

tillrohrmann · 2024-04-30T07:54:58Z

crates/storage-query-datafusion/src/invocation_state/table.rs

    ) -> SendableRecordBatchStream {
+        let range = PartitionKey::MIN..=PartitionKey::MAX;
        let status = self.0.clone();


nit and outside of this PR: It seems that were are using status and state interchangeably in this module. Maybe something to pull straight at some point to avoid confusion with the invocation_status table.

tillrohrmann · 2024-04-30T07:56:47Z

crates/storage-query-datafusion/src/inbox/table.rs

+        let mut transaction = partition_store.transaction();
+        let rows = transaction.all_inboxes(range);


Out of curiosity, why do create a transaction here where for other table implementations this is not done?

I don't really know the origin but possibly to get a stable snapshot of the entire state?

tillrohrmann · 2024-04-30T08:00:28Z

crates/storage-query-datafusion/src/invocation_status/table.rs

+    async fn scan_partition_store(
+        partition_store: PartitionStore,
+        tx: Sender<Result<RecordBatch, datafusion::error::DataFusionError>>,
        range: RangeInclusive<PartitionKey>,
        projection: SchemaRef,
-    ) -> SendableRecordBatchStream {
-        let db = self.0.clone();
-        let schema = projection.clone();
-        let mut stream_builder = RecordBatchReceiverStream::builder(projection, 16);
-        let tx = stream_builder.tx();
-        let background_task = move || {
-            let rows = db.all_invocation_status(range);
-            for_each_status(schema, tx, rows);
-            Ok(())
-        };
-        stream_builder.spawn_blocking(background_task);
-        stream_builder.build()
+    ) {
+        let rows = partition_store.all_invocation_status(range);
+        for_each_status(projection, tx, rows).await;


Why is it ok to run this operation on the calling thread instead of spawning a task on the blocking thread pool as before? Maybe related question: Why are some tables spawning tasks on a blocking thread pool and others not?

Ok, it seems that the previous implementation of the df tables were not consistent (this one using a blocking send while others used non-blocking send, some implementations using transactions for reads, others read directly from the storage).

because it's now async, the caller wraps it in a task.

tillrohrmann · 2024-04-30T08:15:09Z

crates/storage-query-datafusion/src/table_providers.rs

+}
+
+impl<T, S> PartitionedTableProvider<T, S> {
+    pub(crate) fn new(processors_manager: S, schema: SchemaRef, partition_scanner: T) -> Self {


maybe s/processors_manager/partition_selector/?

tillrohrmann · 2024-04-30T12:31:36Z

crates/storage-query-datafusion/src/partition_store_scanner.rs

+
+            Ok(())
+        };
+        stream_builder.spawn(background_task);


We are not spawn blocking because we assume that the background_task won't do too much blocking I/O, right?

**IMPORTANT:** This breaks queries through datafusion until we workout how data fusion will shard queries across partitions.

This was referenced Apr 29, 2024

Introducing per-partition PartitionStore #1475

Merged

Rename storage-rocksdb to partition-store #1476

Merged

PartitionProcessorManager as long-living service #1481

Merged

AhmedSoliman force-pushed the pr1482 branch from 7e3482e to ccb7dc3 Compare April 29, 2024 17:26

AhmedSoliman marked this pull request as ready for review April 29, 2024 17:27

AhmedSoliman force-pushed the pr1482 branch from ccb7dc3 to e7350bc Compare April 29, 2024 17:30

AhmedSoliman requested review from tillrohrmann and igalshilman April 29, 2024 17:30

AhmedSoliman force-pushed the pr1482 branch 2 times, most recently from 688194b to cbf3e90 Compare April 30, 2024 07:12

AhmedSoliman mentioned this pull request Apr 30, 2024

PartitionId as NewType #1483

Merged

igalshilman approved these changes Apr 30, 2024

View reviewed changes

AhmedSoliman force-pushed the pr1482 branch from cbf3e90 to 348b18a Compare April 30, 2024 10:38

AhmedSoliman mentioned this pull request Apr 30, 2024

Betters shutdown logging #1484

Merged

tillrohrmann approved these changes Apr 30, 2024

View reviewed changes

tillrohrmann reviewed Apr 30, 2024

View reviewed changes

AhmedSoliman added 4 commits April 30, 2024 13:56

Introducing per-partition PartitionStore

65a20aa

**IMPORTANT:** This breaks queries through datafusion until we workout how data fusion will shard queries across partitions.

Rename storage-rocksdb to partition-store

7bf3d22

PartitionProcessorManager as long-living service

25e45df

Unbreak datafusion

8ee4120

AhmedSoliman force-pushed the pr1482 branch from 348b18a to 8ee4120 Compare April 30, 2024 13:02

AhmedSoliman merged commit 8ee4120 into main Apr 30, 2024
10 checks passed

AhmedSoliman deleted the pr1482 branch April 30, 2024 13:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unbreak datafusion #1482

Unbreak datafusion #1482

AhmedSoliman commented Apr 29, 2024 •

edited

Loading

github-actions bot commented Apr 29, 2024 •

edited

Loading

igalshilman left a comment

igalshilman Apr 30, 2024

AhmedSoliman Apr 30, 2024

igalshilman Apr 30, 2024

tillrohrmann left a comment

tillrohrmann Apr 30, 2024

tillrohrmann Apr 30, 2024

AhmedSoliman Apr 30, 2024

tillrohrmann Apr 30, 2024

tillrohrmann Apr 30, 2024

AhmedSoliman Apr 30, 2024

tillrohrmann Apr 30, 2024

tillrohrmann Apr 30, 2024

		let mut transaction = partition_store.transaction();
		let rows = transaction.all_inboxes(range);

Unbreak datafusion #1482

Unbreak datafusion #1482

Conversation

AhmedSoliman commented Apr 29, 2024 • edited Loading

github-actions bot commented Apr 29, 2024 • edited Loading

Test Results

igalshilman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tillrohrmann left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AhmedSoliman commented Apr 29, 2024 •

edited

Loading

github-actions bot commented Apr 29, 2024 •

edited

Loading