Commit graph

2 commits

Author SHA1 Message Date
Yuwen Hu
bfca337d09 [Nano] Add how-to guide for TensorFlow Keras training using bf16 (#7169)
* Add basic doc structure for bf16 tf training how-to guide, and change the incorrect order of tf inference guides in toc

* Add how-to guide for tf bf16 training

* Add warning box for tf bf16 hardware limitations

* Add a print message to show the default policy of model after unpatch

* Small fixes

* Small github action fixes for tf bf16 training how-to guide

* diable action test for tf bf16 train for now, due to the core dumped problem on platforms without AVX512

* Updated based on comments
2023-01-05 17:59:43 +08:00
Yuwen Hu
331c3054d9 [Nano] How-To Guides: Training - TensorFlow (#5836)
* Add basic guides structure of Training - TensorFlow

* Add how-to guides: How to accelerate a TensorFlow Keras application on training workloads through multiple instances

* Change import order and add pip install for tensorflow-dataset

* Diable other nano tests for now

* Add github action tests for how-to guides Tensorflow training

* Use jupyter nbconvert to test notebooks for training tensorflow instead to avoid errors

* Add how-to guide: How to optimize your model with a sparse Embedding layer and SparseAdam optimizer

* Enable other nano tests again

* Small Revision: fix typos

* Small Revision: refactor some sentences

* Revision: refactor contents based on comments

* Add How-to guides: How to choose the number of processes for multi-instance training

* Small Revision: fix typos and refactor some sentences

* Make timeout time for github action longer for TensorFlow, 600s->700s
2022-09-26 15:40:22 +08:00