Question: Shorter Sequences In The Human Genomic Dna Download From Ucsc Genomic Browser Website
gravatar for nadarajanvelu
4.2 years ago by
nadarajanvelu10 wrote:


I have downloaded the human genomic DNA from the UCSC genomic browser website from the link, hg38.chromFa.tar.gz . When I extracted, I got all the chromosomal sequences. While I got the chromosome 1 as chr1.fa, I also see several shorter chr1 sequences with names such as chr1_GL383518v1_alt.fa, chr1_KI270706v1_random.fa, chr1_KI270759v1_alt.fa to name a few. There are a total of twenty one such short sequences for chr1. I couldn't find any documentation in the website about these sequences. I would like to know what are these sequences.


genomic ucsc • 2.1k views
ADD COMMENTlink modified 4.2 years ago by Neilfws47k • written 4.2 years ago by nadarajanvelu10
gravatar for Giovanni M Dall'Olio
4.2 years ago by
London, UK
Giovanni M Dall'Olio26k wrote:

Alternate loci are a new feature in the latest releases (see ). Random chromosomes were also present in previous releases (What do chromosome codes such as 'chr_random' represent? )

ADD COMMENTlink written 4.2 years ago by Giovanni M Dall'Olio26k
gravatar for Neilfws
4.2 years ago by
Sydney, Australia
Neilfws47k wrote:

The short answer is that these are sequences which, for various reasons, are not included in the current assembly of the chromosome. Some of them cannot be mapped or ordered reliably, some of them can be mapped but are unfinished. On other chromosomes, e.g. chr6, there are alternative versions arising from different haplotypes (large blocks of chromosome).

See the data/download FAQ, in particular the section chrN_random tables. You can also view assembly issues at the GRC website.

ADD COMMENTlink written 4.2 years ago by Neilfws47k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 727 users visited in the last hour