|
|
DataMuseum.dkPresents historical artifacts from the history of: DKUUG/EUUG Conference tapes |
This is an automatic "excavation" of a thematic subset of
See our Wiki for more about DKUUG/EUUG Conference tapes Excavated with: AutoArchaeologist - Free & Open Source Software. |
top - metrics - downloadIndex: T c
Length: 4150 (0x1036)
Types: TextFile
Names: »contab.man«
└─⟦a0efdde77⟧ Bits:30001252 EUUGD11 Tape, 1987 Spring Conference Helsinki
└─⟦this⟧ »EUUGD11/stat-5.3/eu/stat/doc/contab.man«
CONTAB(1) |STAT February 3, 1987
NAME
contab - contingency table and chi-square analysis
SYNOPSIS
contab [-bsy] [-i nfactors] [-c entries] [factor names]
OPTIONS
-b Use blank lines to make frequency tables more readable. This
option is recommended when the cells are filled with values from
the cell entry format option.
-c cell-entries
Choose table cell entries from expected values (e), differences
of obtained and expected frequencies (d), all percentages (p),
row percentages (r), column percentages (c), or cell percentages
of the total count (t).
-i nfactors
Maximum number of factors in interactions. By default, all
interactions are analyzed, however, it may be that only those
contingency tables involving one or two factors will be of
interest.
-s Do not print significance tests (only print tables).
-y Do not apply Yates' correction for continuity for chi-square
tests with one degree of freedom.
DESCRIPTION
Hays (1973) warns ``...there is probably no other statistical method
that has been so widely misapplied.'' (p. 735). Contingency tables
and chi-square are used to summarize and test for association between
frequencies of events.
Input Format
The input format is simple. Each cell frequency is preceded by codes
of the levels of factors under which it was obtained. For example, if
rats are categorized by age and are further categorized by the length
of their tails, the contingency data would look like:
young long 12
old long 10
young short 30
old short 2
young short 3
Note that the cell frequencies for young short-tailed rats add up to
33.
The most important assumption for the input data is that all
frequencies are independent. For example, each subject in an
experiment must contribute one and only one count to one cell. Also,
if more than a small percentage of expected cell frequencies are
small, then the chi-square statistic may be invalid. A text should be
consulted.
Output
_▶08◀c_▶08◀o_▶08◀n_▶08◀t_▶08◀a_▶08◀b prints a summary of the names and number of levels of factors.
For main effects, a frequency table and significance test of deviation
from equal cell frequencies is printed. For each two-way interaction,
a contingency table and test for independence is printed. Yates'
correction for continuity is applied whenever there is one degree of
freedom. For total cell frequencies in 2x2 tables up to 100, the
Fisher exact test is computed for both one and two tailed cases. A
text should be used to interpret the output statistics.
ALGORITHM
The calculation of chi-square is standard for designs with more than
one degree of freedom and adequate expected cell frequencies.
Different texts have different methods for corrections for small
designs. The methods used here reflect several sources.
When there is one degree of freedom, small cell frequencies can bias
the chi-square test. Yates' correction for continuity reduces the
absolute differences of obtained and expected frequencies by up to
0.5, following the discussion by Fisher (1970) in ``Statistical
Methods for Research Workers.'' The Fisher Exact test for 2x2 tables
is drawn from Bradley (1968) ``Distribution-Free Statistical Tests.''
The two-tailed calculation has only been tested against Bradley's one
example.
FILES
UNIX /tmp/contab.????
MSDOS contab.tmp
STATUS
This is the second version of the program. Later versions will have
tests of association for contingency tables with greater than 2
dimensions. The ability to supply expected frequencies may be added
if there is demand. Suggestions are welcome.
LIMITS
Use the -L option to determine the program limits.