-
Notifications
You must be signed in to change notification settings - Fork 3
Flanders fails to lift over X chromosome #113
Description
Hello.
I would like to follow up on the issue of including the X chromosome #86.
I am running Flanders with two datasets that both include the X chromosome, one that requires liftover as it is on build 37 and one that does not as it is on build 38. The dataset that does not require liftover works well, and X chromosome variants are in the resulting loci file. However, in the dataset that does require liftover the X chromosome variants do not appear in the loci or _dataset_aligned.tsv.gz files even though X chromosome variants pass the significance threshold in this dataset. It appears this is caused by the difference in how X chromosome is coded in Flanders (23, as in --chromosomes 1-23 in the input) vs the chain file (http://hgdownload.cse.ucsc.edu/goldenpath/hg19/liftOver/hg19ToHg38.over.chain.gz) where the chr is coded as X.
I think this should be easily fixed by adding a line after line 27 to s01_alpha_sort_alleles_snpid.R where the chain is imported ch <- import.chain(default_chain_file) to grep and replace "X" with "23" in the chain file. Does this sound appropriate? It would also likely be best to update chr Y to 24.
Thanks,
Sophie