This SRFI seems to mix together two somewhat separate domains:
1. Generate random variables with various discrete and continuous
distributions. This programming is mathematical.
2. Choose among a selection or range of objects according to results
returned by a random number generator. This programming supports
various application domains.
That's true. I should probably put some headers in, but I don't see any particular need to divide the SRFI. It's not as if the sample implementation is huge and you might not want all of it.
I should add that the first author chose what to put in the Gauche library on which the SRFI iss based, though I took some things out.
1: Basic generator distributions:
(make-normal-generator [ s ] [ mean [ deviation ] ])
I'd prefer to have a N(0,1) generator that is later translated and
scaled by a general transform to fit application needs.
That makes sense to me, and a set of generator operations that perform appropriate transformations on their input values would be a Very Good Thing, either for this SRFI or a future SRFI. This applies to a number of the other functions you discuss below.
Missing useful standard distributions:
Bernoulli(p)
Definitely worthwhile; added. I've also added make-u1-generator for the special case of equiprobable bits.
Binomial(n,p)
I have no objection to adding this. Does anyone have a handy implementation in Scheme?
General finite discrete generator: Given probabilities p(i), for i in
[0,N), summing to 1, generate discrete random variable in [0,N) with the
given p(i) as probabilities.
This makes sense as well.
More esoteric distributions of varying difficulty and utility:
Gamma
Beta
t
Again, I have no objection: implementations are needed, though.
(make-random-integer-generator [ s ] lower-bound upper-bound)
(make-random-u8-generator [ s ])
All these just translate the result of random-integer from SRFI 27.
Absolutely. But they admit a particularly efficient implementation given access to the low-level generator below SRFI 27.
(make-random-string-generator [ s ] string)
I don't understand what this one does.
The argument k is missing from the prototype. The idea is to generate random strings whose contents are randomly chosen from string and whose length is bounded by k.
(gsampling [ s ] generator ...)
I don't understand the statement "When all the generators are exhausted
or no generators are specified, the new generator returns an end-of-file
object."
That makes it work with finite generators as well. Thus calling gsampling on a generator for days of the week and another generator for dwarves
will produce a finite generator of 7+13 (or 7+7, depending on your mythos) elements. These two functions are quite unlike the rest: they use random numbers to control their behavior rather than generating them.
Choosing among the generators with equal probability is an
unnecessary limitation.
It's an important special case that can be made more efficient.
(gweighted-sampling) [ s ] obj ..)
You probably want to choose randomly from N generators based on the
output of an N+1st generator that has range [0,N). Specifying
probabilities (basically the probability distribution function of a
discrete random variable) is too difficult in general.
I don't understand this statement. The idea is that if you have generators of bad widgets and good widgets, and you want 12% bad and 88% good in your output stream (IIRC this is optimal for human quality-control inspection), you specify (gweighted-sampling 0.12 bad-widgets 0.88 good-widgets). Or equivalently specify 12 and 88, or 3 and 22, or 12/100 and 88/100.
John Cowan
http://vrici.lojban.org/~cowan xxxxxx@ccil.orgNo, John. I want formats that are actually useful, rather than over-
featured megaliths that address all questions by piling on ridiculous
internal links in forms which are hideously over-complex.
--Simon St. Laurent on xml-dev