Chapter
6
Not
only is probability of great importance in testing, it is of great importance in
all areas of human thought. I
believe its importance compares favorably with that of reading.
When
we say in English that an event will probably occur, we usually mean something
like:
“the
chances are in its (the event’s ) favor
“it
is more likely to occur than to not occur”
“the
odds are for it”
We
can specify degrees of probability without recourse to numbers by saying that
something is very probable, extremely probable or nearly certain. Such phrases suggest getting closer and closer to something
and that something can be labeled nicely as “certainty.” It has turned out
to be convenient to use numbers for degrees of probability, though.
Since there are as many points as we could ever need between the number 0
and the number 1, and since we seem to be approaching a particular point as
something gets more and more probable, 1 serves as the point of certainty and 0
as the point of impossibility.
When
we say that something is very, very probable, that it is quite likely to occur,
we are trying to convey the idea of “a little looseness.” There doesn’t seem to be any looseness at all in the
connection between my getting married and my becoming a husband. Anybody who said it was probable that I should become a
husband if I marry would be playing with language--that is, kidding--or
misunderstanding what the words mean. The
event of becoming a husband isn’t probable under the hypothesis of my getting
married; it is absolutely certain. Here,
then, there is no looseness at all. But
absolutely certain events are rare and usually the genuinely certain ones
aren’t the events in this world that we are interested in.
We are interested in contingent--chancy--events like winning a game or an
election, amazing a certain person or, as teachers, having our students succeed.
We know there will be a little looseness.
What we are interested in is: how
much? We can learn to express
little bits of looseness in words, common fractions or decimal fractions.
An
event that is very highly probable but not certain is one that isn’t defined
certainly. Suppose I were to have a
wrestling match with the world heavyweight champion.
If everyone agrees to call the match a victory for him no matter what
happens during the match itself, we might as well not bother to wrestle.
The result is a foregone conclusion.
However, by the usual rules, there does seem to be a microscopic
possibility that I--smaller, older, less fit and less knowledgeable--would win.
We sometimes refer to thought and say, “It is conceivable.
I can imagine, under very weird circumstances, a victory for Kirby.”
Maybe my appearance would reduce the champion to laughing helplessness.
Maybe I would be lucky. And
here we come to the common fraction idea--if somehow, I could wrestle the
champion 100 or 1,000,000 times, maybe, just maybe, I could win once.
So we say the probability is 1/100 or 1/1,000,000.
Some people would prefer to say the odds are 1 to 99 or 1 to 999,999 in
my favor. In the same way, the
probability that the champ will win is 99/100 or 999,999/1,000,000.
Or the odds are 99 to 1 against me.
Or there is a 99% chance that I’ll lose.
If
the probability that I will win is 1/100 and something makes it a little less
likely, that something has loosened the connection between my trying and my
succeeding a little bit. The new
probability might be 1/101 or 1/102. That
is, I might have to wrestle him a few more times before I could expect one
measly victory. However, handling
numbers like 1/100 and 1/102 is awkward. In fact, it is a big job to find how much the probability
fell when it went from 1/101 to 1/102. So,
decimals come in. Decimal fractions
aren’t as pictorial as common fractions but they are much easier to calculate
with.
Probability
is an amazing subject in many ways. Why
the ancients didn’t develop it is certainly a mystery.
The first person usually cited as a probabilist is the “gambling
scholar” of Italy, Cardano (1501-1576) but the real grandfathers of
probability are the French mathematicians, Pascal and Fermat.
In the 1600’s, these men discussed, by correspondence, questions
concerning the probable outcome of games. However,
it has only been in this century that the subject has expanded to its present
incredible extent and variety. It
is certainly a mathematical subject and the purely mathematical writings on such
things as stochastic processes are steadily expanding.
Probability is also a philosophical subject and such modern philosophers
as Lucas, Kyburg, Salmon, Ayers and Wisdom find it worthwhile.
These men are probably (there’s that notion again) not familiar to you
but if you have an opportunity to study areas of philosophy dealing with the
ultimate difference between men and machines or the meaning of scientific
investigation, you would meet them. However,
probability would be a limited and rarified subject if it were of interest only
to mathematicians and philosophers. Today,
sciences from the physics of the tiniest bits of matter to the study of mass
psychology find probabilistic concepts and calculations the only language and
tools useful in many circumstances. There
is more. Virtually every branch of
government, from urban planning to crime control to weather prediction to
national defense is steadily expanding its use of probability.
Business methods of planning and decision-making based on probability are
continuously moving from theoretical studies in graduate schools to practical
applications in on-going companies with immediate problems.
In education, the most common explicit use of probability is in graduate
or research investigations. But the
concepts and some methods of manipulating probabilities are spreading.
There has been an interest in making probabilistic statements in
counseling and career planning for some time.
A student might be told that he has only a 10% chance of success as a
forester, instead of being told he won’t make a successful forester.
Of course, testing draws heavily on probabilistic ideas and methods.
We often make use of probability to try to differentiate between skill
and luck, as in the chapter on the lady-testing-tea test.
I
must admit, though, that all uses of probability on real-world, or empirical,
data result in nothing more than probabilistic statements in the end.
There is no way we can start with probabilities and merely calculate our
way to certainly. Still
probabilities can give us very helpful guidance, especially after we have gained
experience in reading and using them. A
famous example of the unusual insight we can sometimes get from probability but
not from our personal judgment is called “the birthday problem.”
The question is: How many
people must be in a room before there is a good chance that at least two of them
will have the same day of the year as their birthday? Answers
to the question are often wildly off the mark.
Many mathematics books, such as the excellent “Introduction to Finite
Mathematics” by Kemeny, Snell and Thompson give the answer and a full
explanation. (There is 70% chance
with 30 people and 99% with 60 people.)
If you grant that the subject may be worthwhile, what constitutes a start in learning it? The most basic things have already been mentioned:
Beyond
these simple principles, we are already beginning to climb into the subject
proper. Most initial instruction
goes from these principles into notation and manipulation. For the use of probability in studying problems of testing
and grading, our notational needs are simple.
We will write “the probability of A”, where A is some event, so often
that it is handy to have the usual symbolism
P (the event A) = P(A) =
for
“the probability of the event A equals”. Occasionally we wish to be able to
state restrictions or special conditions for the event. For instance, the probability of the event “having a girl
baby” could be symbolized P(having a girl baby).
If we were interested in the probability subject to the condition that
the expectant mother had red hair, the typical symbolism is
P
(having a girl baby|a red-headed mother).
The
vertical stroke is read “given.” Events
and their conditions may be symbolized by letters but it is important to
remember that such letters stand for English phrases, not for numbers.
So, using G for “having a girl baby” and R for “red-headed
mother,” we could write more quickly
P (G|R)
Instruction
in the manipulation of basic probabilities can be quite complex and most of the
usual matters are outside the scope of this book. However, one part of the traditional material is useful and
easily mastered. It concerns
successful repetitions of an event. Suppose
there is a 50% chance of having a girl baby and a 50% chance of having a boy.
The question is: What happens to the 50% as we specify longer and longer
strings of girls born to the same mother? Some
students have had enough instruction in probability to remember something about
chances remaining the same. Partial
memories will lead them astray if they believe that the chances of two girls
(and no boys) in a family of 2 children are the same as the chances of one girl
in a family of one child. There is
something that remains the same and it is just that initial 50%.
We get the probability of an event happening “in a row” by using the
base probability, the 50%. But whatever we do, we clearly must arrive at smaller chances
for more difficult or more unlikely events.
It is harder to get two girls in a row, with no intervention from boys,
than to get a single girl at the beginning.
It is harder to get five heads in a row with a coin than to get one head
with one flip.
If
half the babies born are girls, and all those with daughters have a second child
and the chances of a girl are always 50%, then half of those with a girl the
first time will have a girl the second time.
Half had a girl the first time and half of that half will have a girl the
second time.
P (G twice) = P(G) x P(G) =
[P (G)]2 = 1/2 x 1/2 = 1/4 = 25%
How
about three daughters? You can’t
have three daughters if your first two children aren’t girls. You have a 1/4 chance [P(G)]2 of two girls.
If you are in the two girls group, you have that same old 50% chance of
another girl.
P (G three times) =
P(G) x P(G) x P(G)
=
[P(G)]3 = 1/2 x 1/2 x
1/2 =
1/8 =12.5%.
So
the probability of an event occurring N times in a row equals N of the base
probabilities multiplied together.
P (event N times in a row) =
The
next step in probability instruction is usually to learn how to calculate the
chance of specified combinations of successes and failures.
We’ll skip that.
That
about winds up a basic introduction to probability for testing and grading use.
The only important matter omitted so far is the actual obtaining of a
probability. In purely mathematical
work, this step may be omitted as being a problem of nonmathematical practical
interest. However, we will need
probabilities. How the initial base
probability is best obtained is the subject of sharp and continuing debate.
There are four main methods--counting, direct opinion, analysis of bets
and mathematical models.
For
the present, we will restrict ourselves to counting.
How do we obtain the probability of randomly selecting a girl from a
class? We count the number of girls
and the number of nongirls (sometimes called “boys”). Then the
probability is
number of girls
number
of students in all
This
“frequentist” definition of a probability can be applied to many situations.
When I ask students about the probability of randomly selecting a person
from the class who is mentally
healthy, some of them protest. Mental
health is a vague concept, they say, and the subject of disagreement and debate,
even among experts. I agree, but
the probability can still be defined in the same way:
P
(randomly selecting a mentally healthy person) =
number of mentally healthy student
total number
of students
Is
this idea a vacuous one? No,
because various definitions could be applied to a group to see what sort of
probabilities emerged. In such an
investigation, new insights into the meaning of mental health and its
appropriate definition might be uncovered.
Why might a teacher want the probability of selecting students with
various characteristics? Because
probabilities offer a better grasp of the complexities of a class than mere
percentages. Some people say that being told 80% of a class is
right-handed something conveys little to them.
But thinking of the 80% as the chance of selecting a student can help.
In the 80% right-handed class, there is a 64% chance of picking a
right-handed student twice in a row. It
is just about uncertain whether you can choose three such students in a row or
not. (P) = .8 x .8 x .8 = .512)
However, it is definitely unlikely that you will pick four in a row.
(P) = (.8)4 = .41) The
more you know about probability, the more you can play with that 80% figure.
Probability can clearly make percentage figures more meaningful and
useful.
Feller,
W., An Introduction to Probability Theory and Its Applications, 3rd edition,
1968, Wiley.
Kemeny,
J. et al., Introduction to Finite Mathematics, 3rd edition, 1974, Prentice-Hall.