GAVA¶

GAVA is a variant association algorithm, based on the VAAST algorithm. http://www.ncbi.nlm.nih.gov/pubmed/21700766

Computes a p-value for a genomic feature (typically a single gene) using a randomization test, where the affection status of the subjects is permuted and the test statistic computed using a composite likelihood ratio test as described in the VAAST paper.

Note: This is not an exact implementation and for example we have a bailout parameter to avoid spending too much computing time on uninteresting features.

Usage¶

We need to first set the case and control lists, the disease model (regular, dominant, or recessive), and other parameters. Then the procedure is as follows: * Before processing a feature, call initializeGroup() to clear the counts. * Iterate through the data for the feature and for each variant found for some subject, add the call counts using addFeature(). * Calculate the p-value (and a few other values) using calculateValues(). * Repeat the process for the next feature.

gor ... | GAVA maxiterations -casefile <filename> -ctrlfile <filename> [ attributes ]

or:

gor ... | GAVA maxiterations -caselist <subjects> -ctrllist <subjects> [ attributes ]

Options¶

`-recessive`	Use recessive model, i.e. only pick 2 variants per subject.
`-dominant`	Use dominant model, i.e. only pick 1 variant per subject.
`-casepene number`	The case penetrance (locus heterogeneity).
`-ctrlpene number`	The ctrl penetrance.
`-noMaxAlleleCounts`	No max allele count for \| each case in dominant/recessive models.
`-protective`	Include protective alleles.
`-grouping number`	Collapsing threshold for rare variants (default is 5).
`-bailout number`	Bail out of the randomization with this number of hits (default is 10).
`-usePhase`	Use the phase (in recessive modeling).
`-maxAf number`	Upper threshold for the estimated control allele frequency.

Warning

casepene and ctrlpene specify the number of cases and ctrls which DO NOT have to comply with the inheritance model.