Contents     Contents Index APA Style Guide Dr. P's Place

Frequency Distributions

  1. Ungrouped
    1. Example - [Minitab]
    2. Rules
    3. Review of the Mathematics Involved
  2. Grouped
    1. Example - [Minitab]
    2. Rules

Homework


A frequency distribution is a procedure for describing a set of data. There are two types: ungrouped and grouped.

I. Ungrouped

  1. Example - [Minitab]

    The following data represent the number of hours of TV viewing per week (X) for 20 people.

    7 5 4 7 4 6 6 6 5 4
    6 7 2 7 5 5 6 2 2 7

    What follows is an ungrouped frequency distribution for the above data.

    X f fr=p % Cf Cp C%
    7 5 5/20=.25 25 20=N 1.00=p 100
    6 5 .25 25 15 .75 75
    5 4 .20 20 10 .50 50
    4 3 .15 15 6 .30 30
    3 0 .00 0 3 .15 15
    2 3 .15 15 3 .15 15
        åf=20=N p=1.00 %=100

    How frequently the scores are distributed within the distribution is clearly shown. For example, you can quickly see that half of the people watched 6 or more hours of TV. It is shown in different ways (i.e., f, p, & %). Notice also the Cumulative frequency columns. The concept refers to the frequency of scores falling at or below the upper exact limit. Lastly, note that decimal usage within each column is consistent.

  2. Rules
    1. Locate the extreme values (the lowest score or XL and the highest score or XH) and make a column that lists all the values from XH to XL (this is a convention).
    2. Make an adjacent column labeled "frequency" (f). Count the frequency of each X and put it in the f column. This is called a tally of the scores. Check that the åf = N and put this in a row at the bottom of the column.
    3. Create another column called "relative frequency" or "proportion" (fr = p = f/N). As a general rule, proportions should be expressed in hundredths. Check that the åfr » 1 and show it at the bottom of the column.
    4. Create another column called "percent" (% = fr * 100). As a general rule, percents should be expressed without decimal places (or sometimes with one). Show that å% » 100 at the bottom of the column.
    5. Create another column called "cumulative frequency."
      To do this, start at XL and work your way up by adding all frequencies for the scores at or below the score you are interested in. Thus:
        For XL, the Cf = f
        For XL+1, the Cf = f of XL + f of XL+1, etc.
        The Cf of XH = N.
    6. Create another column called "cumulative proportion" (Cp = Cf/N).
    7. Could create another column called "cumulative percent" (C% = Cp * 100).

  3. Review of the Mathematics Involved
    1. fr = p = f/N = f * 1/N (where 1/N = the unit proportion).
    2. % = fr * 100 = f/n * 100 = f * 100/N (where 100/N = the unit percentage)
    3. åp » 1.00 and å% » 100. "»" rather "=" is used because of rounding error.
      Furthermore, using the unit proportion or percentage in calculations can increase this kind of error.
    4. Cp = Cf/N = Cf * 1/N (and the Cp for XH » 1.00)
    5. C% = Cp * 100 = Cf * 100/N (and the C% for XH » 100).

II. Grouped Frequency Distribution

Useful for large data sets and because they make the form or shape of the distribution more obvious. A disadvantage, though, is that the scores lose their individual identity.

  1. Example - [Minitab]
    The following data are the expected scores on Exam 1 (N=25).

    95 88 81 79 73
    92 88 81 79 72
    92 86 81 77 67
    91 85 80 77 62
    89 84 80 74 61

    A simple tally or ungrouped frequency distribution would not be very helpful in this case. What follows is a grouped frequency distribution for this data.

    Interval Exact Limits Midpoint f p Cf C%
    95-99 94.5-99.5 97   1  .04 25=N 100
    90-94 89.5-94.5 92 3 .12 24 96
    85-89 84.5-89.5 87 5 .20 21 84
    80-84 79.5-84.5 82 6 .24 16 64
    75-79 74.5-79.5 77 4 .16 10 40
    70-74 69.5-74.5 72 3 .12 6 24
    65-69 64.5-69.5 67 1 .04 3 12
    60-64 59.5-64.5 62 2 .08 2 8
    åf=25 p=1.00

    Note how easy it is to see the form or shape of the distribution. For example, it is easy to derive:
    Number
    Grade
    Letter
    Grade
    % of
    Class
    90s
    A
    16
    80s
    B
    44
    70s
    C
    28
    60s
    D
    12
    100  

    However, using just the grouped frequency distribution we wouldn't be able to tell exactly what the highest score is (i.e., the individual scores have lost their identity).

  2. Rules
    1. Compute the Range

      (Consider the scores 4, 5, & 6. R = 6 - 4 + 1 = 3 scores.)
    2. Use about 10 to 20 groups or intervals.
    3. The width of a group or interval (i) should be an odd number.
    4. Make the lowest stated limit divisible by i. This interval should contain XL.
    5. Use the formula:

      to help you determine an appropriate number of groups or intervals to group the data into. Note that the formula is a guide and is not engraved in stone. To work with the formula, start by using the lowest possible value of i (i.e., 3) and divide it into R.

    So, in our example R = XH - XL + 1 = 95 - 61 + 1 = 35.

    Since we have limited space on the screen, an i of 5 was chosen (& thus 7 groups). Note however, that since 61 is not divisible by i, a lowest stated limit of 60 was used which explains why there were actually 8 groups.


Contents Index APA Style Guide Dr. P's Place Copyright © 1997-2006 M. Plonsky, Ph.D.
Comments? mplonsky@uwsp.edu.