-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Change CUB submodule to track Nvidia CUB project. #13322
Conversation
@mxnet-label-bot add [pr-work-in-progress] |
@frankfliu please take a look at the failed CI build and update PR accordingly |
677ba9b
to
eccb00f
Compare
@mxnet-label-bot update [pr-awaiting-review] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good!
Thanks for moving off the copy to the Nvidia cub!
@frankfliu did you do some sanity perf tests on GPU? we probably should for operators using cub. |
@frankfliu Thanks for the contribution. Your description indicates why we're using the name '3rdparty/nvidia_cub' but not why we're switching submodules, which in my opinion is the more important piece of information. Can you give some details around why we're switching back to the NVIDIA repo? If I remember correctly the reason we switched away from using the NVIDIA cub repo was because we had a bunch of force pushes and history re-writes in that repo that were causing us issues. Looking at the current repo it seems like it's in really good shape now. They describe their release process clearly and are making use of release branches. Given they're using release branching, I'd suggest we track a specific stable branch using git 1.8 (and higher) submodule branch feature. I hope this will help to protect us against future force-pushes. |
I've looked into this a little further and I suspect that tracking a branch is still not going to get us any protection against our submodules force pushing. I've opened a question on StackOverflow to see if anyone knows the details around what happens in this case: https://stackoverflow.com/questions/53438716/does-tracking-a-branch-with-a-submodule-protect-you-against-force-pushes |
Looks like adding a branch ref to the submodule isn't necessary / won't alleviate the problem I have in mind. Pending questions for this PR are only
|
@mxnet-label-bot update [pr-awaiting-response] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The submodule is pointing to a release tag v1.8.0
which I think is good.
@lebeg Any idea how to conduct a test on this? It seemed to be an improvement on the performance. |
@frankfliu - Can you please provide next steps here? and few open questions from Kellen. |
This PR is looking good to me, and I don't want to be a blocker. Consider me approving modulo short testing description. I think my "What problems does it solve / feature does it provide" has been answered well in the edited PR description. |
@frankfliu ping for update :) |
@mxnet-label-bot update [pr-awaiting-response] |
Directly change submodule URL will impact every developer. "git submodule update" won't work, developer has to use "git submodule sync" first.
@mxnet-label-bot add [pr-awaiting-merge] remove [pr-awaiting-response] |
@mxnet-label-bot remove [pr-awaiting-response] |
@KellenSunderland Could you please review/merge this PR after the latest changes? |
@frankfliu could you have a look at the incomplete CI? may be it needs to be re-triggered |
@frankfliu Could you please look at the incomplete CI? It seems no updates from last couple of weeks. Can you please look into it? Thanks! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thanks for your contribution!
The PR has been merged. Thank you! |
* Change CUB submodule to track Nvidia CUB project. Directly change submodule URL will impact every developer. "git submodule update" won't work, developer has to use "git submodule sync" first. * retrigger CI
* Change CUB submodule to track Nvidia CUB project. Directly change submodule URL will impact every developer. "git submodule update" won't work, developer has to use "git submodule sync" first. * retrigger CI
* Change CUB submodule to track Nvidia CUB project. Directly change submodule URL will impact every developer. "git submodule update" won't work, developer has to use "git submodule sync" first. * retrigger CI
…)" This reverts commit b6eac1d.
Description
Current cub submodule is pointing to DLMC's cub repo, the repo doesn't have customization of nvidia's cub project. The historic reason to use DLMC's cub is:
Since both above issue has been resolved on Nvidia side, we should move back to Nvidia's cub project to reduce upgrade/merge effort.
The proposal has been discussed on: https://lists.apache.org/thread.html/6c10ab4746ac15b08fb5ac7281abdfdfcd5b534fd3838200941cd2ca@%3Cdev.mxnet.apache.org%3E.
Directly change submodule URL will impact every developer.
"git submodule update" won't work, developer has to use
"git submodule sync" first.
To avoid impact every developer, remove CUB and add a new
"nvidia_cub" to workaround this issue.
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.