Talk:Probability generating function
| This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||||||||||||
| |||||||||||||||||||||
The listed p.g.f for negative binomial is wrong. There should be no on the numerator, i.e.
- It all depends on whether you are after the number of trials required to achieve n successes (which is at least n) or the number of failures before the nth success (which may be 0). From a quick check of the article geometric distribution, what you suggest above is correct for the latter definition, while the formula in the article is consistent with the text, which refers to the former. Ben Cairns 03:11, 2 Mar 2005 (UTC).
- Actually, it is correct that there should be no in the numerator. Having it in the numerator is appropriate in the case where the distribution is on , but the text here has it on for which there is clearly a nonzero probability of Since the probability of is found by plugging in it's easy to see that we cannot have in the numerator. Joelmiller (talk) 03:40, 23 August 2017 (UTC)
What on earth does the notation G(1-) mean? It's used all over this article, and I've never seen it anywhere before. Is it supposed to be G(-1)? Or G(1)? Or is it just a typo? Could someone clarify/correct it please.
- I hadn't seen it before but it is defined in the article. It is the limit as z->1 of G(z). Since this is used so much it is useful to have a shorthand though I have not seen this one before. --Richard Clegg 18:15, 4 June 2006 (UTC)
- I'm not sure, but doesn't it mean the limit of G(x) as x approaches 1 from below? PAR 18:25, 4 June 2006 (UTC)
- I think it is just a "typo". BM
The relation to "moment generating function"
[edit]Some Wikipedia articles about discrete random variables such as "Geometric distribution" use the term "moment generating function" instead of "probability generating function". So it would be useful to state that these are equivalent ideas in such cases.
edit; I don't mean "equivalent ideas". I should say "related ideas". My thought is that it would be useful to tell a reader if he can get the formula for the probability generating function by looking at the formula for the moment generating function since the articles favor giving the moment generating function.
- Can't this be merged into moment generating function? I can't see why this needs a separate article. —3mta3 (talk) 13:48, 3 May 2009 (UTC)
Examples
[edit]We need to take a close look at the didactical merits of the examples, deleting some if necessary. -Zahlentheorie (talk) 13:32, 8 July 2010 (UTC)
Clarifications needed
[edit]- What is "to make available the well-developed theory of power series with non-negative coefficients" in the lead referring to? ... the article seems to have nothing relevant (and a link to another article would be good if there is something relevant). The content at Series (mathematics)##Non-negative terms does not indicate anything particularly useful. And there seems nothing in Abel's theorem that makes specific use of "positive coefficients".
- The paragraph including "the probability-generating function of the difference of two independent random variables" must be extending the types of random variables concerned to include ones which can take negative values, but these are excluded from the present definition.
- The treatment of the G(1-) stuff needs to be rethought, particularly for the derivatives. Can a general reader be expected to interpret the content here as meaning that the mean does not exist if the the limit for the derivative does not exist?
Melcombe (talk) 12:08, 20 September 2010 (UTC)
- Replying to a 16-year-old post doesn't make a lot of sense, but I've just landed on that page for the first time because of Quantling's comment below. In case someone is wondering: there is indeed a well-developed theory of power series with non-negative coefficients. A central result in that context is the Vivanti–Pringsheim theorem. For more on this, see e.g. Flajolet's "Analytic combinatorics". Malparti (talk) 19:36, 13 March 2026 (UTC)
Why limit as z approaches 1 from below?
[edit]If we plug in z = 1 without taking any limits, the power series evaluates to the sum of the probability masses. That is simply 1 without needing any limits. As far as I can tell the talk of limits can be removed from the article. (If I have this wrong; please provide an example where G(z=1) does not converge absolutely. Thank you —Quantling (talk | contribs) 17:51, 13 March 2026 (UTC)
- You're correct. Note however that it is necessary to take the limit for derivatives; and that the notation that is currently (namely ) has to be introduced somewhere (so fixing the article will likely take a bit more work than just removing that limit). Malparti (talk) 19:32, 13 March 2026 (UTC)
- Thank you for your quick response. I'm not so sure that the limit helps with derivatives. Sure, with a probability distribution such as for and , the value will diverge unless . When , we get , both finite. When , I am thinking that both and will diverge "to ". That is, for this family of probability distributions, we can replace all instances of with . Is there a probability distribution that you know of where the probability generating function differs for and ? —Quantling (talk | contribs) 20:22, 13 March 2026 (UTC)
- There is no such example, in the following sence:
- since is nonnegative, always exists in ;
- since is differentiable on and is nondecreasing (because the coefficients of the power series are nonnegative), always exists in ;
- by monotone convergence, in .
- The issue is simpler than that: it's just that isn't well-defined in 1, in the usual sense (i.e. as a real or complex derivative) : for this, you'd need to be defined on a neighborhood of 1.
- Now, I agree that if you interpret as a real derivative, then exists and will give you the same result as what you get by "plugging in" in the formal derivative of the power series... So you could very well make up your own convention and say that . However:
- That is a non-standard convention, so non-ideal for Wikipedia;
- I'd advice against it, because it tends to hide the fact that something special could happen in 1; for instance, if the radius of convergence of the power series is 1, then 1 is guaranteed to be a singularity. Also, this "natural" convention is likely more arbitrary than it seems, because I assume — I haven't thought this through so I might very well be wrong here — that it should be possible to find probability generating functions with a singularity in 1 where you can get different limits of as from within the disc of convergence. If that is indeed the case, then the convention wouldn't make sense.
- Malparti (talk) 23:39, 13 March 2026 (UTC)
- Ah, I see, I think. I am interpreting as a formal power series created by differentiating the formal power series , term by term, a total of times. With that it is okay to simply evaluate the resulting formal power series with . However if we interpret as a (non-formal) function then differentiating at the point even once requires that be well defined in a neighborhood of that point.
- When we define we say that it is a generating function. I think if we're clear that by "differentiation" we mean "as a formal power series, term by term" then we can replace occurrences of with and simplify the discussion accordingly. However, without such a clarifying sentence, readers might easily assume that we mean differentiation in the normal sense for functions; keeping the and related discussion would cover that interpretation.
- Perhaps it boils down to which is more accessible to the reader: (a) formal differentiation of a formal power series term by term vs. (b) computing the desired values as limits. —Quantling (talk | contribs) 18:15, 14 March 2026 (UTC)
- The probability generating function is typically introduced as for or for — depending how familiar the audience is complex analysis.
- Now, is not a formal power series: it is defined entirely differently, and then proved to be numerically equal to the series , inside its disk of convergence. Of course, given this equality, it makes perfect sense if you want to define the probability generating function as a formal power series, rather than as a function — and in fact, this standard in analytic combinatorics. But it's not in probability theory (at least at the undergrad level).
- As a result, in probability courses the standard approach is to say that the expected value (in ) of is obtained as "the limit of as on the real line" (if is viewed as a real function, then it's also quite common to phrase things in terms of its left-derivative); not as "what you get by evaluating the corresponding power series at ". So I think this article should to stick to that.
- Cheers,
- Malparti (talk) 23:28, 14 March 2026 (UTC)
- I think of as a formal power series equal to ... I guess my background is analytic combinatorics. But because you indicate that this expectation is instead an ordinary function in undergraduate probability texts, that's good enough for me. Thanks! —Quantling (talk | contribs) 19:35, 15 March 2026 (UTC)
- @Malparti Maybe I gave in too easily. After all, a probability generating function is a generating function, right? And according to that Wikipedia article, a generating function is a formal series, not an ordinary function, right? I conclude that we are formally differentiating a formal series, not ordinary differentiation of a function that is defined by an ordinary series. So sure, we should be sure to explain what we mean by formal differentiation of a formal power series (in a brief sentence, with a wikilink to an appropriate article), but once we've done it, I am still thinking it is more appropriate to use than . —Quantling (talk | contribs) 02:59, 16 March 2026 (UTC)
- @Quantling "a probability generating function is a generating function, right? And according to that Wikipedia article, a generating function is a formal series, not an ordinary function, right?" → A <blabla> is whatever you define it to be in whatever field you're working in. In combinatorics, it is natural to view generating functions as formal power series, because they can have a zero radius of convergence. That's not the case for PGFs: they carry a lot of extra structure so it makes sense to view them as more constrained objects than formal power series.
- My point is not that defining as "the derivative of the formal power series, evaluated in 1" wouldn't make sense — it does make sense, in the sense that it doesn't create any obvious contradiction (although, as I said, I personally think it is a bad idea, because any notation that hides potential pitfalls under the rug is a recipe for errors, in practice). My point is: that's not the way things are done in probability theory, at least not in the textbooks I've personally read. So, if you want to keep arguing on this, you need to take the time to look at sources.
- Here is a (short) list of some of the most influential textbooks in probability theory, combining some old classics with more modern texts: Feller, Billingsley, Shiryaev, Chung, Williams, Durrett. I don't have all of them on my computer, and I do have a few other ones:
- Feller (1968) views as a real function, defined where the series converges; after discussing the convention , sets ; see Chapter XI.
- Kallenberg (1997) only mentions the PGF in passing, and defines it as a real function on , and doesn't really say anything about; see page 61.
- Durrett (2019) also doesn't spend a lot of time on the PGF, and also defines in as a real function on and is careful to say "We may have φ(s) = ∞ when s > 1 so we have to work carefully" and then goes on to use .
- I don't think Chung and Billingsley discuss PGFs.
- I don't think the burden should be on me to look at other sources.
- As a Wikipedia contributor, your contributions should not be to give your opinion on which convention makes more sense when working with PGFs, but rather to see what convention sources use, and summarize them here. Malparti (talk) 10:06, 16 March 2026 (UTC)
- Thank you for As a Wikipedia contributor, your contributions should not be to give your opinion on which convention makes more sense when working with PGFs, but rather to see what convention sources use, and summarize them here. I agree and I revert back to yielding to using the limit formulations. —Quantling (talk | contribs) 14:35, 16 March 2026 (UTC)
- @Malparti Maybe I gave in too easily. After all, a probability generating function is a generating function, right? And according to that Wikipedia article, a generating function is a formal series, not an ordinary function, right? I conclude that we are formally differentiating a formal series, not ordinary differentiation of a function that is defined by an ordinary series. So sure, we should be sure to explain what we mean by formal differentiation of a formal power series (in a brief sentence, with a wikilink to an appropriate article), but once we've done it, I am still thinking it is more appropriate to use than . —Quantling (talk | contribs) 02:59, 16 March 2026 (UTC)
- I think of as a formal power series equal to ... I guess my background is analytic combinatorics. But because you indicate that this expectation is instead an ordinary function in undergraduate probability texts, that's good enough for me. Thanks! —Quantling (talk | contribs) 19:35, 15 March 2026 (UTC)
- There is no such example, in the following sence:
- Thank you for your quick response. I'm not so sure that the limit helps with derivatives. Sure, with a probability distribution such as for and , the value will diverge unless . When , we get , both finite. When , I am thinking that both and will diverge "to ". That is, for this family of probability distributions, we can replace all instances of with . Is there a probability distribution that you know of where the probability generating function differs for and ? —Quantling (talk | contribs) 20:22, 13 March 2026 (UTC)