Wikipedia talk:WikiProject Probability
/Archive1 02:23, 18 August 2005 (UTC)
Specification
Capitalization
We should decide on whether to capitize the names of the distributions when we refer to them in passing. e.g Do we talk about the Gamma distribution or the gamma distribution. I have been using capitalizations because it seems like a proper name. Acuster 07:54, 19 August 2005 (UTC)
- We generally have to go with what's most common in the relevant literature. If there are several equally common alternatives, I'm generally in favor of preserving the case of any conventional piece of notation that the distribution may be named for. I personally write "Gamma distribution", and "Beta distribution", but "chi-square distribution" and "zeta distribution", because the first two involve upper-case Greek letters and the last two lower-case Greek letters. For other distributions it's pretty clear: "F-distribution", "t-distribution"; "Cauchy distribution", "Wishart distribution" (based on proper names); "binomial distribution", "exponential distribution" (not based on proper names). I tend to hesitate when it comes to "normal distribution" vs. "Normal distribution". --MarkSweep 08:55, 19 August 2005 (UTC)
Inline math
A question: how do we add inline math so it's elegant? I've tried adding it with 'math' tags but it looks goofy: e.g. . Acuster 08:01, 19 August 2005 (UTC)
- That's a complex issue. I'd say what you did was perfectly fine for inline math. If you prefer TeX's Computer Modern typeface for all math formulas, you can switch on mandatory PNG rendering in your user preferences. --MarkSweep 08:57, 19 August 2005 (UTC)
Prototypes
I'd suggest using the Normal distribution as our prototype of the continuous because it's the most complete and elegant page currently and possibly the most important distribution overall certainly it has the most text both here and on mathworld. Acuster 07:57, 19 August 2005 (UTC)
I'd additionally recommend the article on the exponential distribution, since it includes a discussion of Bayesian estimation. A discussion of (semi-)conjugate priors for the normal mean and variance/precision is currently missing from the article on the normal distribution. --MarkSweep 08:45, 19 August 2005 (UTC)
Specification of the distribution: Notation for discrete PMFs
As I see it, there are basically three choices of notation for discrete probability mass functions. Consider a one-parameter family like the Zeta distribution, whose parameter is called s. We try to use k as the main argument:
- Vector notation (advocated by PAR) would write the probability as with I call this "vector notation" because can thought of as column vector (or even a stochastic matrix), and k indexes the kth component of that vector.
- Unary function notation (advocated, apparently, by Michael Hardy) would write the probability as with There are plenty of precedents for writing function parameters as subscripts, but the disadvantage is that this may become hard to read with several parameters.
- General function notation (advocated by yours truly) would write the probability as with This is exactly the same as the conventions currently used for continuous distributions, and it has the advantage that several parameters are easily accommodated.
The problem with options 1 and 2 is that they are easy to confuse. Option 3 is unambiguous. --MarkSweep 17:59, 18 August 2005 (UTC)
- Ok, after thinking about it, I like the third choice. Its similarity to the continuous notation is a plus, and rational number subscripts in the first choice worry me even though they are countable. The second is worse because it is hard to specify a particular instance using a real number as a subscript and they are not countable. PAR 18:32, 18 August 2005 (UTC)