Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chore: update rowset sharding to use split points as start keys #1218

Merged
merged 2 commits into from
Apr 14, 2022

Conversation

igorbernstein2
Copy link
Contributor

RowSetUtil has 2 methods to split a RowSet: split and shard.
Split is used for ReadRows resumption, so the split needs to be included in the segment.
Shard is used for map/reduce style frameworks to parallelize the work. In this case the split points should be treated as start keys (since they usually come from SampleRowKeys and align with the start of a tablet. This PR fixes the shard behavior to use splits as start keys. It introduces some code duplication but overall makes it easier to reason about the 2 behaviors

@igorbernstein2 igorbernstein2 requested a review from a team as a code owner April 1, 2022 14:43
@product-auto-label product-auto-label bot added the api: bigtable Issues related to the googleapis/java-bigtable API. label Apr 1, 2022
@igorbernstein2 igorbernstein2 requested a review from mutianf April 1, 2022 14:45
Change-Id: I148c508de1abf43ebf6ecd0933cc3f13d9fa7fb4
@igorbernstein2 igorbernstein2 added the automerge Merge the pull request once unit tests and other checks pass. label Apr 1, 2022
@gcf-merge-on-green
Copy link

Merge-on-green attempted to merge your PR for 6 hours, but it was not mergeable because either one of your required status checks failed, one of your required reviews was not approved, or there is a do not merge label. Learn more about your required status checks here: https://help.github.com/en/github/administering-a-repository/enabling-required-status-checks. You can remove and reapply the label to re-run the bot.

@gcf-merge-on-green gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Apr 2, 2022
@igorbernstein2 igorbernstein2 added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Apr 7, 2022
@yoshi-kokoro yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Apr 7, 2022
@igorbernstein2 igorbernstein2 added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Apr 13, 2022
@yoshi-kokoro yoshi-kokoro removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Apr 13, 2022
@product-auto-label product-auto-label bot added the size: l Pull request size is large. label Apr 14, 2022
@igorbernstein2 igorbernstein2 added the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 14, 2022
@gcf-owl-bot gcf-owl-bot bot removed the owlbot:run Add this label to trigger the Owlbot post processor. label Apr 14, 2022
@igorbernstein2 igorbernstein2 added the automerge Merge the pull request once unit tests and other checks pass. label Apr 14, 2022
@gcf-merge-on-green gcf-merge-on-green bot merged commit 1ed6755 into googleapis:main Apr 14, 2022
@gcf-merge-on-green gcf-merge-on-green bot removed the automerge Merge the pull request once unit tests and other checks pass. label Apr 14, 2022
mpeddada1 pushed a commit that referenced this pull request Apr 22, 2022
RowSetUtil has 2 methods to split a RowSet: split and shard.
Split is used for ReadRows resumption, so the split needs to be included in the segment.
Shard is used for map/reduce style frameworks to parallelize the work. In this case the split points should be treated as start keys (since they usually come from SampleRowKeys and align with the start of a tablet. This PR fixes the shard behavior to use splits as start keys. It introduces some code duplication but overall makes it easier to reason about the 2 behaviors
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api: bigtable Issues related to the googleapis/java-bigtable API. size: l Pull request size is large.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants