# This file contains information on how to process reference data sets. # dataset - name of data set, this label will be printed. # type - True Positives (TP) and False Positives (FP). # overlap percentages labeled as (Precision, Sensitivity) and (False Discovery Rate, Type I Error) respectively. # - annotation. # file is used for GENCODE annotation of frame shift and non frame shift Indels. # filter - filter applied to variants for this particular data set. # path - path of indexed BCF file. # #dataset type filter path 1000g TP N_ALLELE==2&&VTYPE==INDEL /net/fantasia/home/atks/ref/vt/grch38/1000G.v5.snps.indels.complex.svs.sites.bcf mills TP N_ALLELE==2&&VTYPE==INDEL /net/fantasia/home/atks/ref/vt/grch38/mills.indels.sites.bcf dbsnp TP N_ALLELE==2&&VTYPE==INDEL /net/fantasia/home/atks/ref/vt/grch38/dbSNP138.snps.indels.complex.sites.bcf gencode_v27 cds_annotation . /net/fantasia/home/atks/ref/vt/grch38/gencode.v27.cds.bed.gz dust cplx_annotation . /net/fantasia/home/atks/ref/vt/grch38/mdust.bed.gz rmsk repeat_annotation . /net/fantasia/home/atks/ref/vt/grch38/rmsk.bed.gz