Learn R Programming

LogicReg (version 1.6.6)

logreg.anneal.control: Control for Logic Regression

Description

Control of simulated annealing parameters needed in logreg.

Usage

logreg.anneal.control(start=0, end=0, iter=0, earlyout=0, update=0)

Value

A list with arguments start, end, iter, earlyout, and update, that can be used as the value of the argument anneal.control oflogreg.

Arguments

start

the upper temperature (on a log10 scale) in the annealing chain. I.e. if start = 3, the annealing chain starts at temperature 1000. The acceptance function is the usual min(1,exp(-diff(scores)/temp)), so any temperature larger than what would be expected as possible differences between any two models pretty much generates a random walk in the beginning, and means that you need to wait longer on results. A too low starting temperature means that the chain may end up in a local optimal (rather than global optimal) solution. If you select both start and end the default of 0, the program will attempt to find reasonable numbers itself (it is known to be only moderately successful in this though).

end

the lower temperature (on a log10 scale) in the annealing chain. I.e. if end is -2, the annealing chain ends at temperature 0.01. If this temperature is very low one can use the early out possibility listed below, as otherwise the chain may run longer than desired!

iter

the total number of iterations in the annealing chain. This is the total over all annealing chains, not the number of iterations of a chain at a given temperature. If this number is too small the chain may not find a good (the best) solution, if the chain is too long the program may take long...

earlyout

if the end temperature is very low, the simulated annealing algorithm may not move any more, but one still needs to wait on all possible moves being evaluated (and rejected)! An early out possibility is offered. If during consecutive five blocks of earlyout iterations, in each block 10 or fewer moves are accepted (for which the score changes), the program terminates. This is a desirable option after one is convinced the program otherwise runs fine: it can be dangerous on the first run.

update

every how many iterations there should be an update of the scores. I.e. if update = 1000, a score will get printed every 1000 iterations. So if iter = 100000 iterations, there will be 100 updates on your screen. If you update = 0, a one line summary for each fitted model is printed. If update = -1, there is virtually no printed output.

Author

Ingo Ruczinski ingo@jhu.edu and Charles Kooperberg clk@fredhutch.org.

Details

Missing arguments take defaults. If the argument start is a list with arguments start, end, iter, earlyout, and update, those values take precedent of directly specified values.

This is a rough outline how the automated simulated annealing works: The algorithm starts running at a very high temperature, and decreases the temperature until the acceptance ratio of moves is below a certain threshold (in the neighborhood of 95%). At this point we run longer chains at fixed temperatures, and stop the search when the last "n" consecutive moves have been rejected. If you think that the search was either not sufficiently long or excessively long (both of which can very well happen since it is pretty much impossible to specify default values that are appropriate for all sorts of data and models), you can over-write the default values.

If you want more detailed information continue reading....

These are some more detailed suggestions on how to set the parameters for the beginning temperature, end temperature and number of iterations for the Logic Regression routine. Note that if start temperature and end temperature are both zero, the routine uses its default values. The number of iterations iter is irrelevant in this case. In our opinion, the default values are OK, but not great, and you can usually do better if you're willing to invest time in learning how to set the parameters.

The starting temperature is the log(10) value of start - i.e., if start is 2 it means iterations start at a temperature of 100. The end temperature is again the log(10) value. The number of iterations are equidistant on a log-scale.

Considerations in setting these parameters.....

1) start temperature. If this is too high you're "wasting time", as the algorithm is effectively just making a random walk at high temperatures. If the starting temperature is too low, you may already be in a (too) localized region of the search space, and never reach a good solution. Typically a starting temperature that gives you 90% or so acceptances (ignoring the rejected attempts, see below) is good. Better a bit too high than too low. But don't waste too much time.

2) end temperature. By the time that you reach the end temperature the number of accepted iterations should be only a few per 1000, and the best score should no longer change. Even zero acceptances is fine. If there are many more acceptances, lower end. If there are zero acceptances for many cycles in a row, raise it a bit. You can set a lower end temperature than needed using the earlyout test: if in 5 consecutive cycles of 1000 iterations there are fewer than a specified number of acceptances per cycle, the program terminates.

3) number of iterations. What really counts is the number of iterations in the "crunch time", when the number of acceptances is, say, more than 5% but fewer than 40% of the iterations. If you print summary statistics in blocks of 1000, you want to see as many blocks with such acceptance numbers as possible. Obviously within what is reasonable.

Here are two examples, with my analysis....

(A) logreg.anneal.control(start = 2, end = 1, iter = 50000, update = 1000)

The first few lines are (cutting of some of the last columns...)

log-tempcurrent scorebest scoreacc /rej /singcurrent parameters
2.0001198.7851198.7850000.508 -0.368 -0.144
1.9801197.9621175.311719(18)342291.273 -0.275 -0.109
1.9601197.9091168.159722(11)382290.416 -0.345 -0.173
1.9401181.5451168.159715(19)352310.416 -0.345 -0.173
...
1.0201198.2581167.578663(16)1281931.685 -0.216 -0.024
1.0001198.7561167.578641(23)1042321.685 -0.216 -0.024
1.0001198.7561167.5781( 0)001.685 -0.216 -0.024

Ignore the last line. This one is just showing a refitting of the best model. Otherwise, this suggests (i) end is ***way*** too high, as there are still have more than 600 acceptances in blocks of 1000. It is hard to judge what end should be from this run. (ii) The initial number of acceptances is really high (719+18)/(719+18+34))=95% - but when 1.00 is reached it's at about 85%. One could change start to 1, or keep it at 2 and play it save.

(B) logreg.anneal.control(start = 2, end = -2, iter = 50000, update = 1000) - different dataset/problem

The first few lines are

log-tempcurrent scorebest scoreacc /rej /singcurrent parameters
2.0001198.7851198.7850( 0)000.50847 -0.36814
1.9181189.9511172.615634(23)223220.38163 -0.28031
1.8371191.5421166.739651(24)322931.75646 -0.22451
1.7551191.9071162.902613(30)203371.80210 -0.32276

The last few are

log-tempcurrent scorebest scoreacc /rej /singcurrent parameters
-1.8371132.7311131.8660(18)7012810.00513 -0.45994
-1.9181132.7311131.8660(25)6762990.00513 -0.45994
-2.0001132.7311131.8660(17)7182650.00513 -0.45994
-2.0001132.7311131.8660( 0)010.00513 -0.45994

But there really weren't any acceptances since

log-tempcurrent scorebest scoreacc /rej /singcurrent parameters
-0.4491133.6221131.8664(21)8751000.00513 -0.45994
-0.5311133.6221131.8660(19)8291520.00513 -0.45994
-0.6121133.6221131.8660(33)8081590.00513 -0.45994

Going down from 400 to fewer than 10 acceptances went pretty fast....

log-tempcurrent scorebest scoreacc /rej /singcurrent parameters
0.7761182.1561156.354464(31)2582471.00543 -0.26602
0.6941168.5041150.931306(17)3553221.56695 -0.43351
0.6121167.7471150.931230(38)3833491.56695 -0.43351
0.5311162.0851145.920124(12)5712931.15376 -0.15223
0.4491143.8411142.32163(20)5903272.20150 -0.43795
0.3671176.1521142.321106(21)6492242.20150 -0.43795
0.2861138.3841131.86662(18)7311890.00513 -0.45994
0.2041138.2241131.86611(27)8231390.00513 -0.45994
0.1221150.3701131.86615(12)7222510.00513 -0.45994
0.0411144.5361131.86630(19)7891620.00513 -0.45994
-0.0411137.8981131.86621(25)911430.00513 -0.45994
-0.1221139.4031131.86612(30)883750.00513 -0.45994

What does this tell me - (i) start was probably a bit high - no real harm done, (ii) end was lower than needed. Since there really weren't any acceptances after 10log(T) was about (-0.5), an ending log-temperature of (-1) would have been fine, (iii) there were far too few runs. The crunch time didn't take more than about 10 cycles (10000 iterations). You see that this is the time the "best model" decreased quite a bit - from 1156 to 1131. I would want to spend considerably more than 10000 iterations during this period for a larger problem (how many depends very much on the size of the problem). So, I'd pick (A)logreg.anneal.control(start = 2, end = -1, iter = 200000, update = 5000). Since the total range is reduced from 2-(-2)=4 to 2-(-1)=3, over a range of 10log temperatures of 1 there will be 200000/3=67000 rather than 50000/4=12500 iterations. I would repeat this run a couple of times.

In general I may sometimes run several models, and check the scores of the best models. If those are all the same, I'm very happy, if they're similar but not identical, it's OK, though I may run one or two longer chains. If they're very different, something is wrong. For the permutation test and cross-validation I am usually less picky on convergence.

References

Ruczinski I, Kooperberg C, LeBlanc ML (2003). Logic Regression, Journal of Computational and Graphical Statistics, 12, 475-511.

Ruczinski I, Kooperberg C, LeBlanc ML (2002). Logic Regression - methods and software. Proceedings of the MSRI workshop on Nonlinear Estimation and Classification (Eds: D. Denison, M. Hansen, C. Holmes, B. Mallick, B. Yu), Springer: New York, 333-344.

Selected chapters from the dissertation of Ingo Ruczinski, available from https://research.fredhutch.org/content/dam/stripe/kooperberg/ingophd-logic.pdf

See Also

logreg, logreg.mc.control, logreg.tree.control

Examples

Run this code
myannealcontrol <- logreg.anneal.control(start = 2, end = -2, iter = 50000, update = 1000)

Run the code above in your browser using DataLab