Linear regression (Gaussian GLM), logistic regression, and poisson
regression model fitters.
Usage
reg.fit(x, y, maxiter = 100)
logistic.fit(x, y, maxiter = 100)
poisson.fit(x, y, maxiter = 100)
Arguments
x, y
The input data x and response y. Each must be a shaq, and
each must be distributed in an identical fashion. See the details section
for more information.
maxiter
The maximum number of iterations.
Value
The return is the output of an optim() call.
Communication
The communication consists of an allreduce of 1 double (the local
cost/objective function value) at each iteration of the optimization.
Details
Each function is implemented with gradient descent using the conjugate
gradients method ("CG") of the optim() function.
Both of x and y must be distributed in an identical fashion.
This means that the number of rows owned by each MPI rank should match, and
the data rows x and response rows y should be aligned.
Additionally, each MPI rank should own at least one row. Ideally they should
be load balanced, so that each MPI rank owns roughly the same amount of data.
References
McCullagh, P. and Nelder, J.A., 1989. Generalized Linear Models, no. 37 in
Monograph on Statistics and Applied Probability.
Duda, R.O., Hart, P.E. and Stork, D.G., 1973. Pattern classification (pp.
526-528). Wiley, New York.