optim_sgd

Implements stochastic gradient descent (optionally with momentum).
Nesterov momentum is based on the formula from
On the importance of initialization and momentum in deep learning.

Provides functionality to define and train neural networks similar to
'PyTorch' by Paszke et al (2019) <arXiv:1912.01703> but written entirely in R
using the 'libtorch' library. Also supports low-level tensor operations and
'GPU' acceleration.

Daniel Falbel

torch

Tensors and Neural Networks with 'GPU' Acceleration

Javier Luraschi

Dmitriy Selivanov

Athos Damiani

Christophe Regouby

Krzysztof Joachimiak

Hamada S. Badr

 RStudio

optim_sgd function

<dl><dt>params</dt>
<dd>(iterable): iterable of parameters to optimize or dicts defining
parameter groups</dd>
<dt>lr</dt>
<dd>(float): learning rate</dd>
<dt>momentum</dt>
<dd>(float, optional): momentum factor (default: 0)</dd>
<dt>dampening</dt>
<dd>(float, optional): dampening for momentum (default: 0)</dd>
<dt>weight_decay</dt>
<dd>(float, optional): weight decay (L2 penalty) (default: 0)</dd>
<dt>nesterov</dt>
<dd>(bool, optional): enables Nesterov momentum (default: FALSE)</dd></dl>

Arguments

If you need to move a model to GPU via <code>$cuda()</code>, please do so before
constructing optimizers for it. Parameters of a model after <code>$cuda()</code>
will be different objects from those before the call. In general, you
should make sure that the objects pointed to by model parameters subject
to optimization remain the same over the whole lifecycle of optimizer
creation and usage.

Warning

SGD optimizer — optim_sgd

<dl>

<dt>params</dt>
<dd>(iterable): iterable of parameters to optimize or dicts defining
parameter groups</dd>


<dt>lr</dt>
<dd>(float): learning rate</dd>


<dt>momentum</dt>
<dd>(float, optional): momentum factor (default: 0)</dd>


<dt>dampening</dt>
<dd>(float, optional): dampening for momentum (default: 0)</dd>


<dt>weight_decay</dt>
<dd>(float, optional): weight decay (L2 penalty) (default: 0)</dd>


<dt>nesterov</dt>
<dd>(bool, optional): enables Nesterov momentum (default: FALSE)</dd>

</dl>

optim_sgd: SGD optimizer

Description

Usage

Arguments

Warning

Examples