Entering edit mode
                    5.2 years ago
        MAPK
        
    
        ★
    
    2.1k
    I am trying to run genomicsdbImport in GATK.
/gatk --java-options "-Xms4G -Xmx32G -DGATK_STACKTRACE_ON_USER_EXCEPTION=true" GenomicsDBImport \
--genomicsdb-workspace-path ${WORKDIR} \
       --batch-size 50 \
       -L chrX \
       --sample-name-map ${mylist.list} \
       --tmp-dir=${tmpPATH} \
       --max-num-intervals-to-import-in-parallel 10 \
       --reader-threads 16
I am supplying a list of gvcfs in the command above. I have gvcf file names with sample_name^Barcode^project_name (mylist.list). With this input, gatk is giving me illegal character error for input gvcf files because of ^. Is there a way to deal with this without renaming the input gvcf files?
This is my mylist.list:
CAP_100       /dir/CAP_100^8036056040^201909_SEY.aln.srt.isec-paddedexome.markd.recal.raw.snps.indels.g.vcf.gz       
CAP_28       /dir/CAP_28^8036056033^201909_SEY.aln.srt.isec-paddedexome.markd.recal.raw.snps.indels.g.vcf.gz       
CAP_2       /dir/CAP_2^8036056474^201909_SEY.aln.srt.isec-paddedexome.markd.recal.raw.snps.indels.g.vcf.gz
This is the error message I am getting:
A USER ERROR has occurred: Malformed URI java.net.URISyntaxException: Illegal character in path at index 49: /dir/CAP_100^8036056040^201909_SEY.aln.srt.isec-paddedexome.markd.recal.raw.snps.indels.g.vcf.gz
You can try escaping the
^characters, like so:Using a caret in a filename was a bad idea. You're better off either renaming the files or creating softlinks with clean names to these files.
Escaping character did not work. I will try with soft links.