Question: Creating genome alignment
gravatar for dec986
6 months ago by
United States
dec986110 wrote:


I am confused about the difference between GRCh37.p13.genome.fa (767 MB, and GRCh37.primary_assembly.genome.fa (830 MB which is lifted from Gencode release 27. I want to use the most recent version of GRCh37, with the most corrections/updates/etc. I can't use GRCh38 because of alt loci making accurate quantification difficult.

I can see pluses and minuses for each choice. which genome should I be using for STAR genome alignments?

Perhaps there is a version of GRCh37.p13.genome.fa which is lifted or related to release 27?

rna-seq star • 288 views
ADD COMMENTlink modified 6 months ago by lshepard130 • written 6 months ago by dec986110

Heng Li has a blog post on which human genome to use.

ADD REPLYlink written 6 months ago by genomax54k
gravatar for lshepard
6 months ago by
United States
lshepard130 wrote:

From the Gencode website:

"Primary assembly: Nucleotide sequence of the GRCh38 (or GRCh37 if you want that) primary genome assembly (chromosomes and scaffolds) The sequence region names are the same as in the GTF/GFF3 files"

The larger file contains all regions including assembly patches and haplotypes. Normally, most people choose primary assembly. But to note, using the latest version (GRCh38) shouldn't really give you any issues with STAR and downstream analysis.

ADD COMMENTlink modified 6 months ago • written 6 months ago by lshepard130
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 977 users visited in the last hour