Skip to content
Snippets Groups Projects
  • Mashiro's avatar
    b7866021
    [Refactor] Refactor the accumulate gradient implemention of OptimWrapper (#284) · b7866021
    Mashiro authored
    * merge context
    
    * update unit test
    
    * add docstring
    
    * fix bug in AmpOptimWrapper
    
    * add docstring for backward
    
    * add warning and docstring for accumuate gradient
    
    * fix docstring
    
    * fix docstring
    
    * add params_group method
    
    * fix as comment
    
    * fix as comment
    
    * make default_value of loss_scale to dynamic
    
    * Fix docstring
    
    * decouple should update and should no sync
    
    * rename attribute in OptimWrapper
    
    * fix docstring
    
    * fix comment
    
    * fix comment
    
    * fix as comment
    
    * fix as comment and add unit test
    b7866021
    History
    [Refactor] Refactor the accumulate gradient implemention of OptimWrapper (#284)
    Mashiro authored
    * merge context
    
    * update unit test
    
    * add docstring
    
    * fix bug in AmpOptimWrapper
    
    * add docstring for backward
    
    * add warning and docstring for accumuate gradient
    
    * fix docstring
    
    * fix docstring
    
    * add params_group method
    
    * fix as comment
    
    * fix as comment
    
    * make default_value of loss_scale to dynamic
    
    * Fix docstring
    
    * decouple should update and should no sync
    
    * rename attribute in OptimWrapper
    
    * fix docstring
    
    * fix comment
    
    * fix comment
    
    * fix as comment
    
    * fix as comment and add unit test