optim_ignite_adamw

For further details regarding the algorithm we refer to
<a href="https://arxiv.org/abs/1711.05101">Decoupled Weight Decay Regularization</a>

Provides functionality to define and train neural networks similar to
'PyTorch' by Paszke et al (2019) <doi:10.48550/arXiv.1912.01703> but written entirely in R
using the 'libtorch' library. Also supports low-level tensor operations and
'GPU' acceleration.

Daniel Falbel

torch

Tensors and Neural Networks with 'GPU' Acceleration

Javier Luraschi

Dmitriy Selivanov

Athos Damiani

Christophe Regouby

Krzysztof Joachimiak

Hamada S. Badr

Sebastian Fischer

Maximilian Pichler

 RStudio

optim_ignite_adamw function

<dl><dt>params</dt>
<dd>(iterable): iterable of parameters to optimize or dicts defining
parameter groups</dd>
<dt>lr</dt>
<dd>(float, optional): learning rate (default: 1e-3)</dd>
<dt>betas</dt>
<dd>(<code>Tuple[float, float]</code>, optional): coefficients used for computing
running averages of gradient and its square (default: (0.9, 0.999))</dd>
<dt>eps</dt>
<dd>(float, optional): term added to the denominator to improve
numerical stability (default: 1e-8)</dd>
<dt>weight_decay</dt>
<dd>(float, optional): weight decay (L2 penalty) (default: 0)</dd>
<dt>amsgrad</dt>
<dd>(boolean, optional): whether to use the AMSGrad variant of this
algorithm from the paper <a href="https://openreview.net/forum?id=ryQu7f-RZ">On the Convergence of Adam and Beyond</a>
(default: FALSE)</dd></dl>

Arguments

See <code>OptimizerIgnite</code>.

Fields and Methods

For further details regarding the algorithm we refer to
<a href='https://arxiv.org/abs/1711.05101'>Decoupled Weight Decay Regularization</a>

LibTorch implementation of AdamW — optim_ignite_adamw

<dl>

<dt>params</dt>
<dd>(iterable): iterable of parameters to optimize or dicts defining
parameter groups</dd>


<dt>lr</dt>
<dd>(float, optional): learning rate (default: 1e-3)</dd>


<dt>betas</dt>
<dd>(<code>Tuple[float, float]</code>, optional): coefficients used for computing
running averages of gradient and its square (default: (0.9, 0.999))</dd>


<dt>eps</dt>
<dd>(float, optional): term added to the denominator to improve
numerical stability (default: 1e-8)</dd>


<dt>weight_decay</dt>
<dd>(float, optional): weight decay (L2 penalty) (default: 0)</dd>


<dt>amsgrad</dt>
<dd>(boolean, optional): whether to use the AMSGrad variant of this
algorithm from the paper <a href='https://openreview.net/forum?id=ryQu7f-RZ'>On the Convergence of Adam and Beyond</a>
(default: FALSE)</dd>

</dl>

See <code>OptimizerIgnite</code>.

optim_ignite_adamw: LibTorch implementation of AdamW

Description

Usage

Arguments

Fields and Methods

Examples