auto data sync between different devices #6549

jacquesqiao · 2017-12-13T02:52:35Z

project: #6403
In a sequence of operators, when some of them run on GPU, some can only run on CPU, we need to auto add some operators to copy host memory and device memory, so the whole graph can run in a multi devices environment.

some problem

now we use name to get var in scope, and data on different devices should have different names, if we add some operator to sync data between different devices, we should change the input name of corresponding operators.

jacquesqiao · 2017-12-13T03:22:21Z

after discussion, there are several ways to implement this:

auto copy data in OperatorWithKernel::Run() according to device.
auto copy data in Tensor::data(const Device& device);
add some op to copy data(not a good way).
use CUDA Unified Memory.(https://devblogs.nvidia.com/parallelforall/unified-memory-in-cuda-6/)

so I will do a survey on CUDA Unified Memory first.(#6549)

tensor-tang · 2017-12-13T03:43:29Z

I think it's important and it should not only be copy itself.

This sync mechanism should contain many kinds of conversions through devices, like CPUPlace, GPUPlace, and maybe MKLDNN, FPGA later.

I hope it can reserve a interface to let developers implement this sync between different Place.

jacquesqiao · 2017-12-13T03:49:27Z

@tensor-tang cool, that is a good suggestion!

luotao1 · 2017-12-13T04:05:28Z

The memory of MKLDNN is different from Paddle, and you can refer to /~https://github.com/PaddlePaddle/Paddle/tree/develop/doc/design/mkldnn#layers

tensor-tang mentioned this issue Dec 13, 2017

The data layout in MKLDNN #6567

Closed

tonyyang-svail added this to the Release 0.11.1 milestone Dec 18, 2017

tonyyang-svail added the MultiDevices label Dec 18, 2017

reyoung mentioned this issue Dec 18, 2017

Add design of switching kernel #6720

Merged

reyoung closed this as completed in #6720 Dec 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

auto data sync between different devices #6549

auto data sync between different devices #6549

jacquesqiao commented Dec 13, 2017 •

edited

Loading

jacquesqiao commented Dec 13, 2017 •

edited

Loading

tensor-tang commented Dec 13, 2017 •

edited

Loading

jacquesqiao commented Dec 13, 2017

luotao1 commented Dec 13, 2017

auto data sync between different devices #6549

auto data sync between different devices #6549

Comments

jacquesqiao commented Dec 13, 2017 • edited Loading

some problem

jacquesqiao commented Dec 13, 2017 • edited Loading

tensor-tang commented Dec 13, 2017 • edited Loading

jacquesqiao commented Dec 13, 2017

luotao1 commented Dec 13, 2017

jacquesqiao commented Dec 13, 2017 •

edited

Loading

jacquesqiao commented Dec 13, 2017 •

edited

Loading

tensor-tang commented Dec 13, 2017 •

edited

Loading