Optimize addN op by the multiple upload of textures #1538

Lewuathe · 2019-02-05T15:15:37Z

PERF

As well as the optimization for concat op, we can optimize addN op by uploading multiple textures up to WEBGL_MAX_TEXTURES_IN_SHADER so that we can reduce the overhead of texture upload.

Here is the benchmark of optimized addN ops. We can see the significant performance improvement in case of over 4000 tensors.

Environment

macOS: 10.13.6
Chrome: 71.0.3578.98

const nums = [1, 2, 3, 4, 5, 6, 7, 8, 9];
const results = nums.map(n => {
  const tensors = [];
  for (let i = 0; i < n * 1000; i++) {
    tensors.push(tf.ones([100, 100]));
  }
  const start = performance.now();
  const res = tf.addN(tensors);
  res.dataSync();
  return performance.now() - start;
});
console.log(results);

See: tensorflow/tfjs#989

This change is

nsthorat · 2019-02-07T18:58:33Z

src/kernels/webgl/addn_gpu.ts

@@ -0,0 +1,51 @@
+/**
+ * @license
+ * Copyright 2017 Google Inc. All Rights Reserved.


nsthorat · 2019-03-19T15:39:53Z

Looks good overall, one small note is that this isn't going to be supported when packing is enabled. If you feel motivated you can add the packed version of this, otherwise we can do this in a follow up and this LGTM :)

Lewuathe · 2019-03-20T00:11:34Z

@nsthorat Thanks! Is there any good reference (or code) to implement packed version?

nsthorat · 2019-03-22T17:11:49Z

Check out the code here for binary ops that are packed: /~https://github.com/tensorflow/tfjs-core/blob/master/src/kernels/webgl/binaryop_packed_gpu.ts

Lewuathe · 2019-03-23T03:08:09Z

@nsthorat Thank you! Try to take a look.

…nto addN-optimization

nsthorat · 2019-04-08T19:20:04Z

Let me know when this is ready to review :)

Lewuathe · 2019-04-09T00:04:46Z

@nsthorat Ah, sorry. It's basically ready for the review.

nsthorat

Reviewed 1 of 2 files at r1, 2 of 3 files at r2.
Reviewable status: 0 of 1 approvals obtained (waiting on @Lewuathe)

src/kernels/webgl/addn_gpu.ts, line 3 at r1 (raw file):

Previously, nsthorat (Nikhil Thorat) wrote…

2019 LLC

This license still needs updating 2019 Google LLC.

src/kernels/webgl/addn_packed_gpu.ts, line 3 at r2 (raw file):

/**
 * @license
 * Copyright 2017 Google Inc. All Rights Reserved.

2019 Google LLC

src/kernels/webgl/backend_webgl.ts, line 1474 at r2 (raw file):

  addN<T extends Tensor>(tensors: T[]): T {
    if (tensors.length === 1) {
      return tensors[0];

return tensors[0].clone()

src/kernels/webgl/backend_webgl.ts, line 1490 at r2 (raw file):

    const shapes = tensors.map(t => t.shape);
    // We can make sure shapes are identical in op level.
    const usePackedOp = ENV.getBool('WEBGL_PACK_BINARY_OPERATIONS');

this isn't really a binary operation, I think we might want to make a new flag for this

annxingyuan · 2019-04-09T18:30:38Z

src/kernels/webgl/backend_webgl.ts

+        .reduce((d1, d2) => upcastType(d1, d2));
+    const shapes = tensors.map(t => t.shape);
+    // We can make sure shapes are identical in op level.
+    const usePackedOp = ENV.getBool('WEBGL_PACK_BINARY_OPERATIONS');


Let's just check WEBGL_PACK here rather than checking WEBGL_PACK_BINARY_OPERATIONS or creating a new flag.

Thanks for pointing out!

…nto addN-optimization

Lewuathe · 2019-04-15T00:15:51Z

@nsthorat @annxingyuan I updated it accordingly. Please take a look when you get a chance.

nsthorat

Reviewed 4 of 4 files at r4.
Reviewable status: complete! 1 of 1 approvals obtained (waiting on @annxingyuan and @Lewuathe)

Optimize addN op by limit of the number of textures

93db4b1

Lewuathe force-pushed the addN-optimization branch from 17158f2 to 93db4b1 Compare February 5, 2019 15:17

Lewuathe changed the title ~~Optimize addN op by limit of the number of textures~~ Optimize addN op by the multiple upload of textures Feb 5, 2019

Merge branch 'master' into addN-optimization

2079325

nsthorat suggested changes Mar 19, 2019

View reviewed changes

Lewuathe added 3 commits April 7, 2019 20:28

Merge master and resolve conflict

1474151

Merge branch 'addN-optimization' of github.com:Lewuathe/deeplearnjs i…

c082655

…nto addN-optimization

Support packed operation for addN

7cff68a

nsthorat suggested changes Apr 9, 2019

View reviewed changes

annxingyuan reviewed Apr 9, 2019

View reviewed changes

Lewuathe and others added 4 commits April 10, 2019 20:10

Post review followup

247632e

Merge branch 'master' into addN-optimization

076ef58

Merge branch 'master' into addN-optimization

f3cf4cd

Merge branch 'addN-optimization' of github.com:Lewuathe/deeplearnjs i…

a1a6fb0

…nto addN-optimization

nsthorat approved these changes Apr 15, 2019

View reviewed changes

annxingyuan approved these changes Apr 15, 2019

View reviewed changes

annxingyuan merged commit 32b05de into tensorflow:master Apr 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize addN op by the multiple upload of textures #1538

Optimize addN op by the multiple upload of textures #1538

Lewuathe commented Feb 5, 2019 •

edited

Loading

nsthorat Feb 7, 2019

nsthorat commented Mar 19, 2019

Lewuathe commented Mar 20, 2019

nsthorat commented Mar 22, 2019

Lewuathe commented Mar 23, 2019

nsthorat commented Apr 8, 2019

Lewuathe commented Apr 9, 2019

nsthorat left a comment

annxingyuan Apr 9, 2019

Lewuathe Apr 10, 2019

Lewuathe commented Apr 15, 2019

nsthorat left a comment

Optimize addN op by the multiple upload of textures #1538

Optimize addN op by the multiple upload of textures #1538

Conversation

Lewuathe commented Feb 5, 2019 • edited Loading

nsthorat Feb 7, 2019

Choose a reason for hiding this comment

nsthorat commented Mar 19, 2019

Lewuathe commented Mar 20, 2019

nsthorat commented Mar 22, 2019

Lewuathe commented Mar 23, 2019

nsthorat commented Apr 8, 2019

Lewuathe commented Apr 9, 2019

nsthorat left a comment

Choose a reason for hiding this comment

annxingyuan Apr 9, 2019

Choose a reason for hiding this comment

Lewuathe Apr 10, 2019

Choose a reason for hiding this comment

Lewuathe commented Apr 15, 2019

nsthorat left a comment

Choose a reason for hiding this comment

Lewuathe commented Feb 5, 2019 •

edited

Loading