Showing content from https://github.com/apache/mxnet/releases/tag/0.11.0 below:
Release MXNet 0.11.0 · apache/mxnet · GitHub
This repository was archived by the owner on Nov 17, 2023. It is now read-only.
- Releases
- 0.11.0
nswamy
released this
05 Sep 18:56 0.11.0 Major Features
API Changes
- Added
CachedOp
. You can now cache the operators that’s called frequently with the same set of arguments to reduce overhead.
- Added sample_multinomial for sampling from multinomial distributions.
- Added
trunc
operator for rounding towards zero.
- Added linalg_gemm, linalg_potrf, ... operators for lapack support.
- Added verbose option to Initializer for printing out initialization details.
- Added DeformableConvolution to contrib from the Deformable Convolutional Networks paper.
- Added float64 support for dot and batch_dot operator.
allow_extra
is added to Module.set_params to ignore extra parameters.
- Added
mod
operator for modulo.
- Added
multi_precision
option to SGD optimizer to improve training with float16. Resnet50 now achieves the same accuracy when trained with float16 and gives 50% speedup on Titan XP.
Performance Improvements
- ImageRecordIter now stores data in pinned memory to improve GPU memcopy speed.
Bugfixes
- Fixed a bug in Adam that causes weight decay to be handled incorrectly. If you are using Adam, you may need to tune learning rate a little to get the same performance as previous versions.
- Remove WaitToRead in dist-kvstore: Improves performance 20-30% for distributed training.
- Cython interface is fixed.
make cython
and python setup.py install --with-cython
should install the cython interface and reduce overhead in applications that use imperative/bucketing.
- Fixed various bugs in Faster-RCNN example: #6486
- Fixed various bugs in SSD example.
- Fixed
out
argument not working for zeros
, ones
, full
, etc.
expand_dims
now supports backward shape inference.
- Fixed a bug in rnn. BucketingSentenceIter that causes incorrect layout handling on multi-GPU.
- Fixed context mismatch when loading optimizer states.
- Fixed a bug in ReLU activation when using MKL.
- Fixed a few race conditions that causes crashes on shutdown.
- Fixed image-classification example code.
Refactors
- Refactored TShape/TBlob to use int64 dimensions and DLTensor as internal storage. Getting ready for migration to DLPack. As a result TBlob::dev_mask_ and TBlob::stride_ are removed.
Known Issues
- Inception-V3 model can be converted into CoreML format but is unable to run on Xcode.
You can’t perform that action at this time.
RetroSearch is an open source project built by @garambo
| Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4