Increase in power by obtaining 10 or more controls per case when type-1 error is small in large-scale association studies.

Katki, HA; Berndt, SI; Machiela, MJ; Stewart, DR; Garcia-Closas, M; Kim, J; Shi, J; Yu, K; Rothman, N

View/Open

s12874-023-01973-x.pdf (1.768Mb)

Date

2023-06-29

ICR Author

Garcia-Closas, Montserrat

Author

Katki, HA

Berndt, SI

Machiela, MJ

Stewart, DR

Garcia-Closas, M

Show all

Type

Journal Article

Metadata

Show full item record

Abstract

BACKGROUND: The rule of thumb that there is little gain in statistical power by obtaining more than 4 controls per case, is based on type-1 error α = 0.05. However, association studies that evaluate thousands or millions of associations use smaller α and may have access to plentiful controls. We investigate power gains, and reductions in p-values, when increasing well beyond 4 controls per case, for small α. METHODS: We calculate the power, the median expected p-value, and the minimum detectable odds-ratio (OR), as a function of the number of controls/case, as α decreases. RESULTS: As α decreases, at each ratio of controls per case, the increase in power is larger than for α = 0.05. For α between 10-6 and 10-9 (typical for thousands or millions of associations), increasing from 4 controls per case to 10-50 controls per case increases power. For example, a study with power = 0.2 (α = 5 × 10-8) with 1 control/case has power = 0.65 with 4 controls/case, but with 10 controls/case has power = 0.78, and with 50 controls/case has power = 0.84. For situations where obtaining more than 4 controls per case provides small increases in power beyond 0.9 (at small α), the expected p-value can decrease by orders-of-magnitude below α. Increasing from 1 to 4 controls/case reduces the minimum detectable OR toward the null by 20.9%, and from 4 to 50 controls/case reduces by an additional 9.7%, a result which applies regardless of α and hence also applies to "regular" α = 0.05 epidemiology. CONCLUSIONS: At small α, versus 4 controls/case, recruiting 10 or more controls/cases can increase power, reduce the expected p-value by 1-2 orders of magnitude, and meaningfully reduce the minimum detectable OR. These benefits of increasing the controls/case ratio increase as the number of cases increases, although the amount of benefit depends on exposure frequencies and true OR. Provided that controls are comparable to cases, our findings suggest greater sharing of comparable controls in large-scale association studies.

Subject

Science & Technology

Life Sciences & Biomedicine

Health Care Sciences & Services

Control selection

Multiple comparisons

Study design

Molecular epidemiology

GENOME-WIDE ASSOCIATION

MULTIPLE CONTROLS

SUSCEPTIBILITY

METAANALYSIS

INDIVIDUALS

REPLICATION

EFFICIENCY

JOINT

RISK

LOCI

Research team

Integrative Cancer Epidem

Language

eng

Date accepted

2023-06-10

License start date

2023-06-29

Citation

BMC Medical Research Methodology, 2023, 23 (1),

Publisher

BMC

Except where otherwise noted, this item's license is described as https://creativecommons.org/publicdomain/zero/1.0/

Publications Repository

Publications Repository