lecture 4

11
Lecture 4 Discrete Probability Distributions Random variables A random variable is a numerical description of the outcome of an experiment. A random variable is also known as a chance variable or stochastic variable. Examples 4.1: Experiment Random variable (x) Possible Values for the random variable 1. Make 100 sales call Total number of sales 0,1,2, …,100 2. Inspect a shipment of 70 radios Number of defective radios 0,1,2,…,70 3. Operate a restaurant Number of customers entering in one day 0,1,2,… 4. Build a new library Percentage of project completed after 6 months Discrete random variables If a variable X can assume a discrete set of values , , ,…, with respective probabilities , , ,…, , where + + +…+ =1, we say that a discrete probability distribution for X has been defined. The function , which has the respective values , , ,…, for X = , , ,…, , is called the probability function, or frequency function, of X. 1

Upload: madina-suleimenova

Post on 23-Oct-2014

119 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Lecture 4

Lecture 4Discrete Probability Distributions

Random variables

A random variable is a numerical description of the outcome of an experiment. A random variable is also known as a chance variable or stochastic variable.

Examples 4.1:

Experiment Random variable (x) Possible Values for therandom variable

1. Make 100 sales call Total number of sales 0,1,2,…,100

2. Inspect a shipmentof 70 radios

Number of defective radios 0,1,2,…,70

3. Operate a restaurant Number of customersentering in one day

0,1,2,…

4. Build a new library Percentage of projectcompleted after 6 months

Discrete random variables

If a variable X can assume a discrete set of values , , ,…, with respective probabilities , , ,…, , where + + +…+ =1, we say that a discrete probability distribution for X has been defined.

The function , which has the respective values , , ,…, for X = , , ,…, , is called the probability function, or frequency function, of X.

Because X can assume certain values with given probabilities, it is often called a discrete random variable. Example 4.2. Let a pair of fair dice be tossed and let X denote the sum of the points obtained. Then the probability distribution is as shown in Table4. 1. For example, the probability of

getting sum 5 is . Thus in 900 tosses of the dice we would expect 100 tosses to give the

sum 5.Table4. 1

X 2 3 4 5 6 7 8 9 10 11 121/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36

If X denotes a discrete random variable that can assume the values , , ,…, with respective probabilities , , ,…, , where + + +…+ =1, the expected value of X (or the mathematical expectation of X), denoted by (or ) is defined as

1

Page 2: Lecture 4

For a discrete random variable X with possible values , , ,…, and respective probabilities , , ,…, ( + + +…+ =1) the measure

of the dispersion, or variability of the possible values of X is called the variance and can be calculated by the following way:

Standard deviation: Binomial probability distribution

Binomial experiment is an experiment that satisfies the following conditions: 1. The experiment consists of a sequence of n identical trials.2. Two outcomes are possible on each trial. We refer to one outcome as a success and the

other as a failure.3. The probabilities of the two outcomes do not change from one trial to the next.4. The trials are independent (i.e., the outcomes of one trial does not affect the outcome of

any other trial).

An important discrete random variable associated with the binomial experiment isx= the number of success in the n trials.

x can have a value of 0, 1, 2, 3,….., n, depending on the number of successes observed in the n trials.

The probability distribution associated with this random variable is called the binomial probability distribution.

Mathematical formula for computing the probability of any value for the binomial random variable is

k= 0, 1,…, n (4.1)

Where , n = number of trials, p = probability of success on one trial,

k = number of successes in n trials, f(x=k) = probability of k successes in the n trials

The term n! in the preceding expression is referred to as n factorial and is defined as

And by definition

There are tables (for example, the table in the book of McClave and Benson) that give cumulative binomial probabilities:

2

Page 3: Lecture 4

.

If you know cumulative probabilities, you can find

and.

Example 4.3. The probability of getting exactly 2 heads in 6 tosses of a fair coin is

using formula (4.1) with n = 6, k = 2, and .

Example 4.4. The probability of getting at least 4 heads in 6 tosses of a fair coin is

The discrete probability distribution (4.1) is called the binomial distribution since for k = 0, 1, 2 ,…,n it corresponds to successive terms of the binomial formula, or binomial expansion,

(4.2)

where , are called the binomial coefficients.

Example 4.5. From (4.2) we get:

Number is the number of ways of selecting k objects out of n. For example, from four objects A, B, C, and D, there are six ways of selecting two:AB,AC,AD,BC,BD,CD.

The numbers have the following properties:

1.

2. This property can be seen displayed in Pascal’s triangle:

1 1

1 2 1

1 3 3 1

1 4 6 4 1

1 5 10 10 5 1

1 6 15 20 15 6 1

3.

3

Page 4: Lecture 4

Distribution (4.1) is also called the Bernoulli distribution after James Bernoulli, who discovered it at the end of the seventeenth century. Some properties of the binomial

distribution are listed in Table 4.2

Table 4.2

Mean

Variance

Standard deviation

Example 4.6. In 100 tosses of a fair coin the mean number of heads is

this is the expected number of heads in 100 tosses of the coin. The standard

deviation is .

Example 4.7. To illustrate the binomial probability distribution, let us consider the experiment of

customers entering the Nastke Clothing Store. To keep the problem relatively small, we restrict the experiment to the next three customers. If, based on experience, the store manager estimates that the probability of a customer making a purchase is 0.3, what is the probability that exactly two of the next three customers make a purchase?

We first want to demonstrate that three customers entering the clothing store and deciding whether to make a purchase can be viewed as a binomial experiment. Checking the four requirements for a binomial experiment, we note the following:

1. The experiment can be described as a sequence of three identical trials, one trial for each of the three customers who will enter the store.

2. Two outcomes – the customer makes a purchase (success) or the customer does not make a purchase (failure) – are possible for each trial.

3. The probabilities of the purchase (0.30) and no purchase(0.70) outcomes are assumed to be the same for all customers.

4. The purchase decision of each customer is independent of the purchase decision of the other customers.

Thus, if we define the random variable x as the number of customers making a purchase (i.e., the number of successes in the three trials), we satisfy the requirements of the binomial probability distribution.

With n = 3 trials and the probability of a purchase p = 0.30 for each customer, we use equation (4.1) to compute the probability of two customers making a purchase. This probability, denoted f(2), is

Similarly, the probability of no customers making a purchase, denoted f(0), is

Similarly, equation (4.1) can be used to show that the probabilities of one and three purchases are f(1) = 0.441 and f(3) = 0.027. Table below summarizes the binomial probability distribution for the Nastke Clothing Store problem.

4

Page 5: Lecture 4

k f(x=k)0 0.3431 0.4412 0.1893 0.027

Total 1.000

The Poisson Distribution

The discrete probability distribution

(5.5)

where e = 2.71828… and is a given constant, is called the Poisson distribution after Simeon-Denis Poisson, who discovered it in the early part of the nineteenth century.A Typical examples of random variables for which the Poisson probability distribution provides a good model are as follows:1.The number of industrial accidents per month at a manufacturing plant2.The number of noticeable surface defects (scratches, dents, etc.) found by quality inspectors on a new automobile3.The parts per million of some toxin found in the water or air emission from a manufacturing plant4.The number of customer arrivals per unit of time at a supermarket checkout counter5.The number of death claims received per day by an insurance company6. The number of errors per 100 invoices in the accounting records of a company

Characteristics of a Poisson Random Variable1. The experiment consists of counting the number of times an event occurs during a

given unit of time (or in a given area, or volume, or any other unit of measurement.2. The probability that the event occurs in a given unit of time (or area, or volume, …) is

the same for all the units.3. The number of events that occur in a given unit of time (or area, or volume, …) is

independent of the number of events that occur in other units.

Some properties of the Poisson distribution are listed in Table 4.3

Table 4.3Mean (expected value)

Variance

Standard deviation

Finding Poisson Probabilities

Example4.8 Suppose the number, x, of a company's employees who are absent on Mondays has (approximately) a Poisson probability distribution. Furthermore, assume that the average number of Monday absentees is 2.6.

5

Page 6: Lecture 4

a. Find the mean and standard deviation of x, the number of employees absent on Monday

b. Find the probability that fewer than two employees are absent on a given Monday

c. Find the probability that more than five employees are absent on a given Monday

d. Find the probability that exactly five employees are absent on a given Monday

Solution a. The mean and variance of a Poisson random variable are both equal to . Thus, for this example,

b. To find the probability that fewer than two employees are absent on a given Monday we should use Table4.4.The rows of the Table 4.4 correspond to different values of , and the columns correspond to different values (k) of the Poisson random variable x. The entries in the table give the cumulative

probability

Table4.4 k 0 1 2 3 4 5 6 7 8 9

2.2 .111 .355 .623 .819 .928 .975 .993 .998 1.000

1.000

2.4 .091 .308 .570 .779 .904 .964 .988 .997 .999 1.0002.6 .074 .267 .518 .736 .877 .951 .983 .995 .999 1.0002.8 .061 .231 .469 .692 .848 .935 .976 .992 .998 .9993.0 .050 .199 .423 .647 .815 .916 .966 .988 .996 .9993.2 .041 .171 .380 .603 .781 .895 .955 .983 .994 .9983.4 .033 .147 .340 .558 .744 .871 .942 .977 .992 .9973.6 .027 .126 .303 .515 .706 .844 .927 .969 .988 .9963.8 .022 .107 .269 .473 .668 .816 .909 .960 .984 .9944.0 .018 .092 .238 .433 .629 .785 .889 .949 .979 .9924.2 .015 .078 .210 .395 .590 .753 .867 .936 .972 .9894.4 .012 .066 .185 .359 .551 .720 .844 .921 .964 .9854.6 .010 .056 .163 .326 .513 .686 .818 .905 .955 .9804.8 .008 .048 .143 .294 .476 .651 .791 .887 .944 .9755.0 .007 .040 .125 .265 .440 .616 .762 .867 .932 .9685.2 .006 .034 .109 .238 .406 .581 .732 .845 .918 .9605.4 .005 .029 .095 .213 .373 .546 .702 .822 .903 .9515.6 .004 .024 .082 .191 .342 .512 .670 .797 .886 .9415.8 .003 .021 .072 .170 .313 .478 .638 .771 .867 .9296.0 .002 .017 .062 .151 .285 .446 .606 .744 .847 .916

Probability is a cumulative probability and therefore is the entry in Table4.4in the row corresponding to and the column corresponding to , that is 0.267. c. To find the probability that more than five employees are absent on a given Monday, we consider the complementary event

where 0.951 is the entry in Table4.4 corresponding to and .

6

Page 7: Lecture 4

d. To use Table4.4 to find the probability that exactly five employees are absent on a Monday, we must write the probability as the difference between two cumulative probabilities:

Home Work:

1. A Harris Interactive survey for InterContinental Hotels & Resorts asked respondents, “We traveling internationally, do you generally venture out on your own to experience culture, or stick with your tour group and itineraries?” The survey found that 23% of the respondents stick with their tour group (USA Today, January 21, 2004).

a. In a sample of six international travelers, what is the probability that two will stick with their tour group?

b. In a sample of six international travelers, what is the probability that at least two will stick with their tour group?

c. In a sample of 10 international travelers, what is the probability that none will stick with the tour group?

2. When a new machine is functioning properly, only 3% of the items produced are defective. Assume that we will randomly select two parts produced on the machine and that we are interested in the number of defective parts found.

a. Describe the conditions under which this situation would be a binomial experiment.b. How many experimental outcomes yield 1 defect?c. Compute the probabilities associated with finding no defects, 1 defect, and 2 defects.

3. Military radar and missile detection systems are designed to warn a country of enemy attacks. A reliability question deals with the ability of detection system to identify an attack and issue the warning. Assume that a particular detection system has a 0.90 probability of defecting missile attack. Answer the following questions using the binomial probability distribution.

a. What is the probability that 1 detection system will detect an attack?b. If 2 detection systems are installed in the same area and operate independently, what is

the probability that at least 1 of the systems will detect the attack?c. If 3 systems installed, what is the probability that at least 1 of the systems will detect the

attack?d. Would you recommend that multiple detection systems be operated? Explain.

4. The Federal Deposit Insurance Corporation (FDIC) insures deposits of up to $100,000 in banks that are members of the Federal Reserve System against losses due to bank failure or theft. Over the last 5 years, the average number of bank failures per year among insured banks was 5.8 (FDIC Stats at a Glance, Mar. 2003). Assume that x, the number of bank failures per year among insured banks, can be adequately characterized by a Poisson probability distribution with mean 6. a.Find the expected value and standard deviation of x. b. Find probabilities , ,

7