A known relationship that is usually given axiomatically:

Upon rearrangement gives the multiplication rule of probability:

Now observe a cool set up that is handy to keep in mind for proving the law of total probability and Bayes’ theorem.

Imagine that happens with one and only one of mutually exclusive events , i.e.:

By addition rule:

.

Now by multiplication rule:

.

This is the law of total probability

From the same set up imagine that we want to find the probability of even if is known to have happened. By the multiplication rule:

By neglecting and dividing the rest through we get:

And applying the law of total probability to the bottom we have the Bayes’ equation

Bunch of examples:

Problem: is a known probability of receiving phone calls during time interval . Also . Assuming that a number of received calls during two adjeicent time periods are independent find the probability of receiving calls for the time interval that equal .

Solution: Let be an event consisted of call in the interval till . Then clearly

which means that the event can be seen as sum of mutually exclusive events, such that in the first interval of duration number of calls received is and in the second interval of the same duration number of received calls is (). By rule of addition

.

By the rule of multiplication

If we change the notation so that

then

.

It is known that under quite general conditions

(Recall that the Poisson distribution is an appropriate model if the following assumptions are true. (a) is the number of times an event occurs in an interval and can take values . (b) The occurrence of one event does not affect the probability that a second event will occur. That is, events occur independently. (c) The rate at which events occur is constant. The rate cannot be higher in some intervals and lower in other intervals (that kinda a lot to take on faith really). (d) Two events cannot occur at exactly the same instant; instead, at each very small sub-interval exactly one event either occurs or does not occur. (e) The probability of an event in a small sub-interval is proportional to the length of the sub-interval. Or instead of those assumptions, the actual probability distribution is given by a binomial distribution and the number of trials is sufficiently bigger than the number of successes one is asking about (binomial distribution approaches Poisson).)

Parametrisation then gives

Note that

Then

The key point is that if for time interval we have that parametrized formula for we have the one above. It holds true for any multiples of as well.