Human reference files in HG38 GATK resource bundle
0
0
Entering edit mode
6.2 years ago
win ▴ 970

hi all, maybe i missed something obvious in the GATK resource bundle (cloud bucket) at the following site.

https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0?pli=1

they have this

Homo_sapiens_assembly38.fasta

file but it's

.sa

index file is missing. Whereas there is another file

Homo_sapiens_assembly38.fasta.64.sa

Is this the sa file for the alt reference assembly? If so why is the sa file for the main assembly not show?

which assembly is to be used for exome analysis?

thanks in advance.

VCF • 6.3k views
ADD COMMENT
2
Entering edit mode

Run samtools faidx Homo_sapiens_assembly38.fasta

ADD REPLY
0
Entering edit mode

Perhaps you just need to symlink Homo_sapiens_assembly38.fasta to Homo_sapiens_assembly38.fasta.64.

ADD REPLY
0
Entering edit mode

Hum... .what kind of 'index' do you expect ? samtools ? bwa ?

ADD REPLY
0
Entering edit mode

i was expecting bwa indices

ADD REPLY
0
Entering edit mode

The files with .64.sa/.ann/.bwt/.pac are the bwa index files.

ADD REPLY
1
Entering edit mode

So what is the significance of the .64 in the fasta file name?

ADD REPLY
0
Entering edit mode

I had the same confusion. Why add .86? How is it different?

ADD REPLY

Login before adding your answer.

Traffic: 2420 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6