Binomial test

 

1 Bi-level Categorical Dependent Variable

0 Independent Variables

 

A one sample binomial test measures whether the proportion of two categorical dependent variables significantly differs from a hypothesized proportion. [13]

 

A 1989 study performed by William Cole uses the binomial test to measure the results of an information visualization experiment [1]. Cole began by demonstrating humans have difficulty performing Bayesian reasoning. To illustrate the problem, Cole cites a classic case requiring Bayesian reasoning. The case is as follows: (1) physicians correctly diagnose a given disease 95% of the time, (2) 90% of the time, physicians correctly rule out the given disease when it is not present, and (3) the disease is present in one person out of a thousand. Given this example, what is the probability a person, chosen at random, who tests positive for the disease, actually has the disease? Most people, including Harvard physicians, answer incorrectly, estimating the probability at 90% or higher, when the correct probability is actually closer to 1%. Cole developed visual displays he believed would help humans to better comprehend Bayesian reasoning problems such as this one.

 

In the experiment, test subjects, all of who demonstrate difficulty performing a Bayesian reasoning task in a pre-test, learn to use Cole’s Bayesian reasoning visual display. After using the visual display, subjects take a second Bayesian reasoning test. Cole uses the binomial test to measure whether the subject’s score on the post-test improves after using the visual display, and finds it does.

 

The post-test had just one question, and there were only 2 possible answers to the question, one right and one wrong. Cole tested whether subjects who learned to use his graphical visualization display chose the correct answer more than 33% of the time, where 33% was the binomial test hypothesized value.

 

Cole’s study demonstrates a valid use of the binomial test. First, the dependent variable, the post-test score, was expressed in categorical terms; either the subject got the answer right or wrong. Further, the dependent variable was recorded as a proportion (percent right versus wrong) and compared to a hypothetical proportion. One piece of information Cole does not report, but should have, was the reason he chose 33% as his hypothesized proportion. The lack of explanation does not invalidate the test, but without it, the reader is unsure what claim can be made with respect to the test results. Cole’s experiment is a good example of why the binomial test is not used very often in HCI research; there aren’t very many situations where researchers need to compare a dichotomous dependent variable against a hypothesized value.

Values to report:

·         the hypothesized value, and why it was chosen

·         percent of values satisfying the condition (e.g. actual percentage of correct answers)

·         p value