Which hg38 file?
0
1
Entering edit mode
10 months ago
amy__ ▴ 50

Hi,

I need the hg38 reference fasta file, does anyone know which download link it would be from this? https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.15_GRCh38/seqs_for_alignment_pipelines.ucsc_ids/

enter image description here

Or if these are even the correct files?

Thanks, Amy

reference hg38 NCBI • 1.7k views
ADD COMMENT
0
Entering edit mode

Hello,

Choose the 5th sequence from top.

Read the README file for your reference.

ADD REPLY
0
Entering edit mode

Thanks @sunnykev97, I did think it was that one! Thanks, Amy

ADD REPLY
0
Entering edit mode

Someone’s told me to use the GCA_000001405.15_GRCh38_no_alt_analysis_set.fna.gz

So I’m still unsure! I have read the readme but still not sure which for WES germline analysis

ADD REPLY
0
Entering edit mode

Oh wait they may be correct: The no_alt_analysis_set contains the sequences, in FASTA format, of the chromosomes, mitochondrial genome, unlocalized scaffolds, and unplaced scaffolds. The alternate locus scaffolds are omitted because many Next Generation Sequence read alignment pipelines are incompatible with the full assembly model

ADD REPLY
1
Entering edit mode

Well, Two types of genome assembly

  1. Primary assembly - assembly at the Chromosome level only (23 + 1 mitogenome) in humans
  2. Secondary assembly - alternate loci information and some unplaced scaffolds. It's good to choose the alternate assembly for more information.

If you like the post, upvote.

ADD REPLY
3
Entering edit mode

No, it’s not ‚good‘ as this information requires special alignment procedures that is not trivial and not implemented in most aligners. It even leads to false alignment results if using standard aligners because reads from these loci would come out as multimappers. For most applications use the one without ALT.

ADD REPLY
4
Entering edit mode

As ATpoint said, ALT information is tricky to deal with. This blog post elaborates nicely on this issue of choosing a good reference genome.

ADD REPLY
0
Entering edit mode

Thank you all, I appreciate the help!

ADD REPLY
0
Entering edit mode

So would you not recommend this tutorial as it is using GRCh38 with alternate contigs to map reads?

ADD REPLY

Login before adding your answer.

Traffic: 2265 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6