Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

[Clojure] Add methods based on NDArrayAPI/SymbolAPI #14195

Merged
merged 12 commits into from
Apr 13, 2019

Conversation

kedarbellare
Copy link
Contributor

@kedarbellare kedarbellare commented Feb 18, 2019

Description

  • This PR generates new methods based on the newer NDArrayAPI and SymbolAPI.
  • It uses the scala's Base/_LIB with direct access to JNI in order to generate the ndarray and symbol methods along with named arguments and documentation. Although the current PR doesn't add specs these can be added over time.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

  • The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
  • Changes are complete (i.e. I finished coding on this PR)
  • All changes have test coverage:
  • Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
  • Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
  • Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
  • Code is well-documented:
  • For user-facing API changes, API doc string has been updated.
  • For new C++ functions in header files, their functionalities and arguments are documented.
  • For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
  • Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
  • To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

  • NDArrayAPI methods, tests, (and when applicable, API doc)
  • SymbolAPI methods, tests, (and when applicable, API doc)

Reviewers

@gigasquid

@kedarbellare kedarbellare changed the title [Clojure] Add methods based on NDArrayAPI/SymbolAPI [WIP][Clojure] Add methods based on NDArrayAPI/SymbolAPI Feb 18, 2019
@roywei
Copy link
Member

roywei commented Feb 19, 2019

@mxnet-label-bot add [Clojure, pr-awaiting-review]

@marcoabreu marcoabreu added Clojure pr-awaiting-review PR is waiting for code review labels Feb 19, 2019
@gigasquid
Copy link
Member

Thanks for looking into this. It will be interesting to see how it comes out. The new API might be more difficult to interop with than the regular data structures of the current one.

@kedarbellare
Copy link
Contributor Author

@gigasquid it does seem like the new API is somewhat more cumbersome in terms of interop. the main benefit i see is the type checking but while in scala you can call the functions with named optional arguments (e.g. Dropout(ndarray, p = Some(0.25), out = Some(outputArray))) it's not so straightforward to have this in clojure.

@gigasquid
Copy link
Member

@kedarbellare Thanks so much for looking into this. I thought it might be the case that it might not be as nice to use. That's totally fine. We can stick to the other API for now and if for some reason we need to move off of it (like the Scala package deprecates it) , we can think of a different strategy, like moving a level down for NDArrary and Symbol and using the Scala Generator functions to generate our own functions directly using the JNI calls instead.

For example - getting all the backend functions by calling the JNI
/~https://github.com/apache/incubator-mxnet/blob/master/scala-package/macros/src/main/scala/org/apache/mxnet/GeneratorBase.scala#L63

and these to create the functions
/~https://github.com/apache/incubator-mxnet/blob/master/scala-package/macros/src/main/scala/org/apache/mxnet/GeneratorBase.scala#L73

Let me know what you think.

@kedarbellare
Copy link
Contributor Author

Makes sense. I'm working on unit tests for NDArray API right now which should give an idea of how we could start using it. I won't put too much effort into doing the same for Symbol API. It'll be great if we can start using JNI directly but I don't have much experience on that front.

@gigasquid
Copy link
Member

@kedarbellare Thanks for tackling the toughest part of the whole Clojure package 💯 I think you have made some really nice progress and we can definitely leverage some of this work.

Since we are starting a new ndarray api, we have a nice opportunity to improve upon the current api and make it more useful for Clojure users. I agree with your assessment that the typing makes things better, but all the positional arguments and options make it more cumbersome. This is due to the philosophical differences between Clojure and Scala, where Clojure loves data structures and Scala loves types :) I'm thinking we should take a step back and think about what we (the clojure package) would want and then try to shape what you have into it. After looking at what you have, I think there is a path forward with maybe generating only the arity with all the parameters and then taking a data structure of vectors or maps to get there.

I think it would be nice to outline what that API should look like with a couple examples from the NDArray and Symbol API before we code it up too much. Since you have experience and good feeling for what is possible now with your work, are you up for helping to draft up the future look of the API on the wiki?

@kedarbellare
Copy link
Contributor Author

Yep. I wholeheartedly agree about the philosophical differences between Scala and Clojure and I completely see the value in doing what's best for the Clojure package. I can try to take a stab at creating an outline so that we can add it to the wiki and ask for comments. Note, however, that I mostly work on this project during the weekends or when I have free time so I'll try my best to put something together soon (over the next week or so). Please let me know if you want to have something out there ASAP, in which case it may not make sense for me to work on it.

@gigasquid
Copy link
Member

@kedarbellare Definitely no rush - whatever pace works well for you. I appreciate your efforts and your contributions to improve the Clojure project.

I don't have rights to create accounts on the wiki so I'll ask out on dev for access for you.

Thanks again!

@gigasquid
Copy link
Member

@kedarbellare You should have access now - ping me if you have any trouble 😸

@vandanavk
Copy link
Contributor

@mxnet-label-bot update [Clojure, pr-work-in-progress]

@marcoabreu marcoabreu added pr-work-in-progress PR is still work in progress and removed pr-awaiting-review PR is waiting for code review labels Mar 4, 2019
@kedarbellare kedarbellare changed the title [WIP][Clojure] Add methods based on NDArrayAPI/SymbolAPI [Clojure] Add methods based on NDArrayAPI/SymbolAPI Apr 7, 2019
@kedarbellare
Copy link
Contributor Author

@gigasquid i've improved the api generation for both ndarrays and symbols. PTAL thanks!

@kedarbellare kedarbellare force-pushed the kedarb--ndarray-symbol-api branch from 37e86b3 to ffb73f8 Compare April 7, 2019 16:40
@gigasquid
Copy link
Member

Awesome work!!! I look forward to reviewing this 😸

Copy link
Contributor

@Chouffe Chouffe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Amazing work Kedar!
I left some minor comments. I will review more in depth if needed. @gigasquid will have better feedback for you :)
Looking forward to having this merged!

Copy link
Member

@gigasquid gigasquid left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @kedarbellare for tackling this and making such great progress!

After @Chouffe 's feedback is taken care of, I think this is fine to merge and will give us a good base to work on.

@gigasquid
Copy link
Member

@kedarbellare - I think the next step after this is to try to get /~https://github.com/apache/incubator-mxnet/blob/master/contrib/clojure-package/test/org/apache/clojure_mxnet/conv_test.clj to use the new symbol-api. It will probably shake out some further refinements.

It certainly is a much nicer api to use and would love to get it to the point were we could deprecate the old symbol and ndarray way of doing things 😸

@gigasquid
Copy link
Member

I'm also not sure why the unix-gpu keeps failing from CI, might want to try to rebase on master and see if that helps

@gigasquid
Copy link
Member

Oh @kedarbellare - one more thing... Would you mind marking the new api namespaces as Experimental ? That way users know that it is in active work and might change and not be stable yet.

Thanks!

@kedarbellare
Copy link
Contributor Author

@Chouffe @gigasquid : thanks for the detailed feedback! i'll take care of these issues soon (hopefully by end of the week).

@gigasquid : where should i mark the namespaces as Experimental? in the docs at the top or in the individual docs of the functions or somewhere else?

@gigasquid
Copy link
Member

@kedarbellare marking it as Experimental in the namespace docstring of contrib/clojure-package/src/org/apache/clojure_mxnet/ndarray_api.clj - or a comment should be fine. And in the corresponding symbol-api namespace. Thanks!

@Chouffe
Copy link
Contributor

Chouffe commented Apr 11, 2019

We should probably start rewriting some of the examples with this once it gets merged.
I can help with this @kedarbellare @gigasquid :)

@kedarbellare kedarbellare force-pushed the kedarb--ndarray-symbol-api branch from 3d2c01b to dc3a5fa Compare April 13, 2019 14:30
@kedarbellare
Copy link
Contributor Author

@gigasquid @Chouffe i've updated the PR with the following:

  1. addressed cleanup comments
  2. tagged the new apis as "Experimental"
  3. modified the conv_test.clj to use the new symbol-api and fixed a small bug that i discovered during that
  4. tweaked the coerce-param for symbol-api (especially for the attr param)
  5. rebased to the latest upstream/master to see if that fixes the unix-gpu CI failure

PTAL thanks!

@gigasquid gigasquid merged commit a5db391 into apache:master Apr 13, 2019
kedarbellare added a commit to kedarbellare/incubator-mxnet that referenced this pull request Apr 13, 2019
* [Clojure] Add methods based on NDArrayAPI/SymbolAPI

* Add symbol API methods and ndarray API unit tests

* Some more ndarray API unit tests

* Explore direct use of JNI

* Use library info directly instead of reflection

* Add tests for generation op info

* Fix ordering of keys using array-map

* Ignore generated test files

* Minor style changes

* Refactor code for better readability

* Address comments

* Small tweaks to symbol api coercion
haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019
* [Clojure] Add methods based on NDArrayAPI/SymbolAPI

* Add symbol API methods and ndarray API unit tests

* Some more ndarray API unit tests

* Explore direct use of JNI

* Use library info directly instead of reflection

* Add tests for generation op info

* Fix ordering of keys using array-map

* Ignore generated test files

* Minor style changes

* Refactor code for better readability

* Address comments

* Small tweaks to symbol api coercion
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Clojure pr-work-in-progress PR is still work in progress
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants