-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why we use MIN_DP for DP and then AD of the ref allele at Hom-Ref block derived genotypes. #7185
Comments
A user brought this up recently, and I think it's a good point. I wouldn't want to change the default behavior, but I would be open to adding a new argument to GenotypeGVCFs to use median instead of min. What do you think @droazen ? Or maybe this is a question for @vdauwera and @eitanbanks as GATK product owners. |
@ldgauthier defer to you on this, but agree that it seems confusing/misleading, especially in the case of large ref blocks with highly variable depth |
Agree with adding an option to use median -- we can change the default behavior in GATK 5 ;) |
@vruano can you take on adding the arg and alternate behavior? Should be pretty quick. |
Ok. |
Adds an argument to GenotypeGVCFs to give the option to preserve the original DP, the median across the block, instead of substituing it by the minimum (MIN_DP format) if present for genotypes that come from HomRef Blocks: --keep-median-dp-in-homref-blocks
Question
Why we use MIN_DP over DP for synthetic Ref allele depth for genotypes derived from hom-ref blocks?
Would it make more sense to keep and use the average or median?
Tool(s) or class(es) involved
GenotypeGVCFs
The text was updated successfully, but these errors were encountered: