Question: Creating genome alignment
gravatar for dec986
12 months ago by
United States
dec986130 wrote:


I am confused about the difference between GRCh37.p13.genome.fa (767 MB, and GRCh37.primary_assembly.genome.fa (830 MB which is lifted from Gencode release 27. I want to use the most recent version of GRCh37, with the most corrections/updates/etc. I can't use GRCh38 because of alt loci making accurate quantification difficult.

I can see pluses and minuses for each choice. which genome should I be using for STAR genome alignments?

Perhaps there is a version of GRCh37.p13.genome.fa which is lifted or related to release 27?

rna-seq star • 387 views
ADD COMMENTlink modified 12 months ago by lshepard140 • written 12 months ago by dec986130

Heng Li has a blog post on which human genome to use.

ADD REPLYlink written 12 months ago by genomax62k
gravatar for lshepard
12 months ago by
United States
lshepard140 wrote:

From the Gencode website:

"Primary assembly: Nucleotide sequence of the GRCh38 (or GRCh37 if you want that) primary genome assembly (chromosomes and scaffolds) The sequence region names are the same as in the GTF/GFF3 files"

The larger file contains all regions including assembly patches and haplotypes. Normally, most people choose primary assembly. But to note, using the latest version (GRCh38) shouldn't really give you any issues with STAR and downstream analysis.

ADD COMMENTlink modified 12 months ago • written 12 months ago by lshepard140
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 766 users visited in the last hour