Soft Masked Version Of Hs37D5 Reference
1
0
Entering edit mode
8.9 years ago

Hi,

I am working with the hs37d5 reference assembly from the 1000 human genome project. For some downstream analysis I would need a FASTA version of the file with repeats soft masked by Repeat Masker as lower case letters. Can I produce such a file or download it already?

Cheers, Daniel

fasta 1000genomes • 3.3k views
ADD COMMENT
0
Entering edit mode

Hi, I am looking for hard-masked hs37d5 reference. Does anybody know where I can get it? Any help is appreciated.

ADD REPLY
0
Entering edit mode
8.9 years ago

you could use the chromosomes of the UCSC :

http://hgdownload.cse.ucsc.edu/goldenpath/hg19/chromosomes/

This directory contains the Feb. 2009 assembly of the human genome (hg19, GRCh37 Genome Reference Consortium Human Reference 37 (GCA_000001405.1))

  • chr*.fa.gz: compressed FASTA sequence of each chromosome.

(...)

Repeats from RepeatMasker and Tandem Repeats Finder (with period of 12 or less) are shown in lower case; non-repeating sequence is shown in upper case.

ADD COMMENT

Login before adding your answer.

Traffic: 1386 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6