Question: Human reference files in HG38 GATK resource bundle
0
gravatar for win
3.0 years ago by
win860
India
win860 wrote:

hi all, maybe i missed something obvious in the GATK resource bundle (cloud bucket) at the following site.

https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0?pli=1

they have this

Homo_sapiens_assembly38.fasta

file but it's

.sa

index file is missing. Whereas there is another file

Homo_sapiens_assembly38.fasta.64.sa

Is this the sa file for the alt reference assembly? If so why is the sa file for the main assembly not show?

which assembly is to be used for exome analysis?

thanks in advance.

vcf • 3.5k views
ADD COMMENTlink written 3.0 years ago by win860

Perhaps you just need to symlink Homo_sapiens_assembly38.fasta to Homo_sapiens_assembly38.fasta.64.

ADD REPLYlink written 3.0 years ago by GenoMax94k

Hum... .what kind of 'index' do you expect ? samtools ? bwa ?

ADD REPLYlink written 3.0 years ago by Pierre Lindenbaum133k

i was expecting bwa indices

ADD REPLYlink written 3.0 years ago by win860

The files with .64.sa/.ann/.bwt/.pac are the bwa index files.

ADD REPLYlink modified 3.0 years ago • written 3.0 years ago by GenoMax94k

So what is the significance of the .64 in the fasta file name?

ADD REPLYlink written 3.0 years ago by win860

I had the same confusion. Why add .86? How is it different?

ADD REPLYlink written 8 weeks ago by field65410
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1464 users visited in the last hour
_