MASON-INTEREST-L Archives

February 2009

MASON-INTEREST-L@LISTSERV.GMU.EDU

Options: Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Sean Luke <[log in to unmask]>
Reply To:
Date:
Tue, 3 Feb 2009 19:43:33 -0500
Content-Type:
text/plain
Parts/Attachments:
text/plain (67 lines)
Pelle Evensen wrote:

> Randy Casstevens wrote:
>> Do some random seeds give better quality results for
>> MersenneTwisterFast? I have heard that random seeds should be odd
>> numbers, large numbers, and/or prime numbers, but does it matter for
>> MersenneTwisterFast?
>>
> In short, for any good quality PRNG (the MersenneTwister would be one)
> the choice of seed does not matter at all as far as "quality" goes.

Actually, many very high-grade RNGs are known to have certain
pathological seed cases. The crucial issue is: do we know what those
cases are? If so, we're fine -- just avoid those seed choices.

Mersenne Twister is no exception: and it's considered among best of the
best! MT's internal state is an array of about 600 32-bit integers.
Its seeding pathology is to fill that array initially with lots of
zeros. It takes Mersenne Twister a while to work its way out of that
state, during which time it tends to produce lots of relatively poorly
random numbers.

This turned out to be a problem for early seeding algorithms common
among the Mersenne Twister community. The current standard seeding
algorithm is to take a seed and hand it off to a simple but relatively
random RNG (in this case, a particular linear congruential generator in
one of Knuth's books), and pump that generator 600-ish times. Each
output of the generator gets put into a different slot in the Mersenne
Twister array. That gives you a nice random collection of numbers, and
an acceptably small number of initial zeros.

FWIW, my MersenneTwisterFast code follows the current standards.


> The only thing to possibly worry about is if you start many different
> simulations with nearby seeds. For some generators (depending on how the
> seed is expanded to the internal state of the generator) this will be a
> problem and for some others, it won't be. What the state of the Mersenne
> Twister as implemented in MASON is, I don't know.

Generally MT doesn't have this problem, partly because as an algorithm
it's pretty robust to that issue, and partly because the Knuth seeding
procedure described above creates very different initial internal array
states from very similar random numbers. Overall, most high grade RNGs
are robust against similar seeds.


> A better (or at least safer) approach than the Mersenne Twister would be
> to have a generator that generates distinct streams. This article
> discusses the desirability of distinct streams;
> http://www.iro.umontreal.ca/~lecuyer/myftp/papers/rstream.pdf

Having a generator which is forkable is a nice feature. But I would be
very hesitant to rely on an algorithm that doesn't have fairly provable
features: and the paper you cited is actually a technical report for a
code library, not an algorithmic description. My suggestion about
random numbers: BE VERY CAREFUL ABOUT DEVIATING FROM WELL-STUDIED
ALGORITHMS. I don't get a good feeling about the lack of formality in
this paper.

Stream protocols etc. are often implementable directly on top of
well-regarded generators. MT would be particularly good for this given
its famously gigantic period. It might be worthwhile to investigate the
current RNG literature for formal protocols.

Sean

ATOM RSS1 RSS2