Skip to content
This repository has been archived by the owner on Aug 15, 2019. It is now read-only.

Optimize CPU conv2dDerInput #995

Merged
merged 1 commit into from
Apr 25, 2018
Merged

Optimize CPU conv2dDerInput #995

merged 1 commit into from
Apr 25, 2018

Conversation

piscisaureus
Copy link
Contributor

@piscisaureus piscisaureus commented Apr 25, 2018

This makes conv2dTranspose about 100x times faster on v8 (node v8.9.0).
There are more convolutional ops to optimize - doing it one at a time.


This change is Reviewable

This makes conv2dTranspose about 100x times faster on v8 (node v8.9.0).
There are more convolutional ops to optimize - doing it one at a time.
@dsmilkov
Copy link
Contributor

:lgtm_strong: Thank you Bert!


Reviewed 2 of 2 files at r1.
Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

@dsmilkov dsmilkov merged commit ac95bf3 into tensorflow:master Apr 25, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants