lecture 4
TRANSCRIPT
Lecture 4Discrete Probability Distributions
Random variables
A random variable is a numerical description of the outcome of an experiment. A random variable is also known as a chance variable or stochastic variable.
Examples 4.1:
Experiment Random variable (x) Possible Values for therandom variable
1. Make 100 sales call Total number of sales 0,1,2,…,100
2. Inspect a shipmentof 70 radios
Number of defective radios 0,1,2,…,70
3. Operate a restaurant Number of customersentering in one day
0,1,2,…
4. Build a new library Percentage of projectcompleted after 6 months
Discrete random variables
If a variable X can assume a discrete set of values , , ,…, with respective probabilities , , ,…, , where + + +…+ =1, we say that a discrete probability distribution for X has been defined.
The function , which has the respective values , , ,…, for X = , , ,…, , is called the probability function, or frequency function, of X.
Because X can assume certain values with given probabilities, it is often called a discrete random variable. Example 4.2. Let a pair of fair dice be tossed and let X denote the sum of the points obtained. Then the probability distribution is as shown in Table4. 1. For example, the probability of
getting sum 5 is . Thus in 900 tosses of the dice we would expect 100 tosses to give the
sum 5.Table4. 1
X 2 3 4 5 6 7 8 9 10 11 121/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36
If X denotes a discrete random variable that can assume the values , , ,…, with respective probabilities , , ,…, , where + + +…+ =1, the expected value of X (or the mathematical expectation of X), denoted by (or ) is defined as
1
For a discrete random variable X with possible values , , ,…, and respective probabilities , , ,…, ( + + +…+ =1) the measure
of the dispersion, or variability of the possible values of X is called the variance and can be calculated by the following way:
Standard deviation: Binomial probability distribution
Binomial experiment is an experiment that satisfies the following conditions: 1. The experiment consists of a sequence of n identical trials.2. Two outcomes are possible on each trial. We refer to one outcome as a success and the
other as a failure.3. The probabilities of the two outcomes do not change from one trial to the next.4. The trials are independent (i.e., the outcomes of one trial does not affect the outcome of
any other trial).
An important discrete random variable associated with the binomial experiment isx= the number of success in the n trials.
x can have a value of 0, 1, 2, 3,….., n, depending on the number of successes observed in the n trials.
The probability distribution associated with this random variable is called the binomial probability distribution.
Mathematical formula for computing the probability of any value for the binomial random variable is
k= 0, 1,…, n (4.1)
Where , n = number of trials, p = probability of success on one trial,
k = number of successes in n trials, f(x=k) = probability of k successes in the n trials
The term n! in the preceding expression is referred to as n factorial and is defined as
And by definition
There are tables (for example, the table in the book of McClave and Benson) that give cumulative binomial probabilities:
2
.
If you know cumulative probabilities, you can find
and.
Example 4.3. The probability of getting exactly 2 heads in 6 tosses of a fair coin is
using formula (4.1) with n = 6, k = 2, and .
Example 4.4. The probability of getting at least 4 heads in 6 tosses of a fair coin is
The discrete probability distribution (4.1) is called the binomial distribution since for k = 0, 1, 2 ,…,n it corresponds to successive terms of the binomial formula, or binomial expansion,
(4.2)
where , are called the binomial coefficients.
Example 4.5. From (4.2) we get:
Number is the number of ways of selecting k objects out of n. For example, from four objects A, B, C, and D, there are six ways of selecting two:AB,AC,AD,BC,BD,CD.
The numbers have the following properties:
1.
2. This property can be seen displayed in Pascal’s triangle:
1 1
1 2 1
1 3 3 1
1 4 6 4 1
1 5 10 10 5 1
1 6 15 20 15 6 1
3.
3
Distribution (4.1) is also called the Bernoulli distribution after James Bernoulli, who discovered it at the end of the seventeenth century. Some properties of the binomial
distribution are listed in Table 4.2
Table 4.2
Mean
Variance
Standard deviation
Example 4.6. In 100 tosses of a fair coin the mean number of heads is
this is the expected number of heads in 100 tosses of the coin. The standard
deviation is .
Example 4.7. To illustrate the binomial probability distribution, let us consider the experiment of
customers entering the Nastke Clothing Store. To keep the problem relatively small, we restrict the experiment to the next three customers. If, based on experience, the store manager estimates that the probability of a customer making a purchase is 0.3, what is the probability that exactly two of the next three customers make a purchase?
We first want to demonstrate that three customers entering the clothing store and deciding whether to make a purchase can be viewed as a binomial experiment. Checking the four requirements for a binomial experiment, we note the following:
1. The experiment can be described as a sequence of three identical trials, one trial for each of the three customers who will enter the store.
2. Two outcomes – the customer makes a purchase (success) or the customer does not make a purchase (failure) – are possible for each trial.
3. The probabilities of the purchase (0.30) and no purchase(0.70) outcomes are assumed to be the same for all customers.
4. The purchase decision of each customer is independent of the purchase decision of the other customers.
Thus, if we define the random variable x as the number of customers making a purchase (i.e., the number of successes in the three trials), we satisfy the requirements of the binomial probability distribution.
With n = 3 trials and the probability of a purchase p = 0.30 for each customer, we use equation (4.1) to compute the probability of two customers making a purchase. This probability, denoted f(2), is
Similarly, the probability of no customers making a purchase, denoted f(0), is
Similarly, equation (4.1) can be used to show that the probabilities of one and three purchases are f(1) = 0.441 and f(3) = 0.027. Table below summarizes the binomial probability distribution for the Nastke Clothing Store problem.
4
k f(x=k)0 0.3431 0.4412 0.1893 0.027
Total 1.000
The Poisson Distribution
The discrete probability distribution
(5.5)
where e = 2.71828… and is a given constant, is called the Poisson distribution after Simeon-Denis Poisson, who discovered it in the early part of the nineteenth century.A Typical examples of random variables for which the Poisson probability distribution provides a good model are as follows:1.The number of industrial accidents per month at a manufacturing plant2.The number of noticeable surface defects (scratches, dents, etc.) found by quality inspectors on a new automobile3.The parts per million of some toxin found in the water or air emission from a manufacturing plant4.The number of customer arrivals per unit of time at a supermarket checkout counter5.The number of death claims received per day by an insurance company6. The number of errors per 100 invoices in the accounting records of a company
Characteristics of a Poisson Random Variable1. The experiment consists of counting the number of times an event occurs during a
given unit of time (or in a given area, or volume, or any other unit of measurement.2. The probability that the event occurs in a given unit of time (or area, or volume, …) is
the same for all the units.3. The number of events that occur in a given unit of time (or area, or volume, …) is
independent of the number of events that occur in other units.
Some properties of the Poisson distribution are listed in Table 4.3
Table 4.3Mean (expected value)
Variance
Standard deviation
Finding Poisson Probabilities
Example4.8 Suppose the number, x, of a company's employees who are absent on Mondays has (approximately) a Poisson probability distribution. Furthermore, assume that the average number of Monday absentees is 2.6.
5
a. Find the mean and standard deviation of x, the number of employees absent on Monday
b. Find the probability that fewer than two employees are absent on a given Monday
c. Find the probability that more than five employees are absent on a given Monday
d. Find the probability that exactly five employees are absent on a given Monday
Solution a. The mean and variance of a Poisson random variable are both equal to . Thus, for this example,
b. To find the probability that fewer than two employees are absent on a given Monday we should use Table4.4.The rows of the Table 4.4 correspond to different values of , and the columns correspond to different values (k) of the Poisson random variable x. The entries in the table give the cumulative
probability
Table4.4 k 0 1 2 3 4 5 6 7 8 9
2.2 .111 .355 .623 .819 .928 .975 .993 .998 1.000
1.000
2.4 .091 .308 .570 .779 .904 .964 .988 .997 .999 1.0002.6 .074 .267 .518 .736 .877 .951 .983 .995 .999 1.0002.8 .061 .231 .469 .692 .848 .935 .976 .992 .998 .9993.0 .050 .199 .423 .647 .815 .916 .966 .988 .996 .9993.2 .041 .171 .380 .603 .781 .895 .955 .983 .994 .9983.4 .033 .147 .340 .558 .744 .871 .942 .977 .992 .9973.6 .027 .126 .303 .515 .706 .844 .927 .969 .988 .9963.8 .022 .107 .269 .473 .668 .816 .909 .960 .984 .9944.0 .018 .092 .238 .433 .629 .785 .889 .949 .979 .9924.2 .015 .078 .210 .395 .590 .753 .867 .936 .972 .9894.4 .012 .066 .185 .359 .551 .720 .844 .921 .964 .9854.6 .010 .056 .163 .326 .513 .686 .818 .905 .955 .9804.8 .008 .048 .143 .294 .476 .651 .791 .887 .944 .9755.0 .007 .040 .125 .265 .440 .616 .762 .867 .932 .9685.2 .006 .034 .109 .238 .406 .581 .732 .845 .918 .9605.4 .005 .029 .095 .213 .373 .546 .702 .822 .903 .9515.6 .004 .024 .082 .191 .342 .512 .670 .797 .886 .9415.8 .003 .021 .072 .170 .313 .478 .638 .771 .867 .9296.0 .002 .017 .062 .151 .285 .446 .606 .744 .847 .916
Probability is a cumulative probability and therefore is the entry in Table4.4in the row corresponding to and the column corresponding to , that is 0.267. c. To find the probability that more than five employees are absent on a given Monday, we consider the complementary event
where 0.951 is the entry in Table4.4 corresponding to and .
6
d. To use Table4.4 to find the probability that exactly five employees are absent on a Monday, we must write the probability as the difference between two cumulative probabilities:
Home Work:
1. A Harris Interactive survey for InterContinental Hotels & Resorts asked respondents, “We traveling internationally, do you generally venture out on your own to experience culture, or stick with your tour group and itineraries?” The survey found that 23% of the respondents stick with their tour group (USA Today, January 21, 2004).
a. In a sample of six international travelers, what is the probability that two will stick with their tour group?
b. In a sample of six international travelers, what is the probability that at least two will stick with their tour group?
c. In a sample of 10 international travelers, what is the probability that none will stick with the tour group?
2. When a new machine is functioning properly, only 3% of the items produced are defective. Assume that we will randomly select two parts produced on the machine and that we are interested in the number of defective parts found.
a. Describe the conditions under which this situation would be a binomial experiment.b. How many experimental outcomes yield 1 defect?c. Compute the probabilities associated with finding no defects, 1 defect, and 2 defects.
3. Military radar and missile detection systems are designed to warn a country of enemy attacks. A reliability question deals with the ability of detection system to identify an attack and issue the warning. Assume that a particular detection system has a 0.90 probability of defecting missile attack. Answer the following questions using the binomial probability distribution.
a. What is the probability that 1 detection system will detect an attack?b. If 2 detection systems are installed in the same area and operate independently, what is
the probability that at least 1 of the systems will detect the attack?c. If 3 systems installed, what is the probability that at least 1 of the systems will detect the
attack?d. Would you recommend that multiple detection systems be operated? Explain.
4. The Federal Deposit Insurance Corporation (FDIC) insures deposits of up to $100,000 in banks that are members of the Federal Reserve System against losses due to bank failure or theft. Over the last 5 years, the average number of bank failures per year among insured banks was 5.8 (FDIC Stats at a Glance, Mar. 2003). Assume that x, the number of bank failures per year among insured banks, can be adequately characterized by a Poisson probability distribution with mean 6. a.Find the expected value and standard deviation of x. b. Find probabilities , ,
7