How to apply bam file into assembly sequence

0

Entering edit mode

3.9 years ago

nonaddldy ▴ 10

http://p2.so.qhimgs1.com/bdr/300_115_/t02d3afc4c46284a492.jpg

I used the bam file into UCSC Genome Browser, then I want to assembly these reads into the sequence at chromosome level, what should I do?

I mean how to extract the reads according to the nucleotide order of GRCh38 reference sequence

sequence assembly 1000 human genomes • 1.6k views

ADD COMMENT • link updated 3.7 years ago by Biostar 20 • written 3.9 years ago by nonaddldy ▴ 10

0

Entering edit mode

I want to assembly these reads into the sequence at chromosome level,

it's not clear to me.

ADD REPLY • link 3.9 years ago by Pierre Lindenbaum 163k

0

Entering edit mode

A BAM file typically contains the genome location where individual (short) reads have aligned to. This seems to have been successful in your case as the snapshot shows.

What exactly is it that you're after? Do you simply want a text file with the ACGTs that make up, say, chromosome 1? That would be the reference sequence, i.e. the FASTA file that you used to generate the BAM file in the first place.

ADD REPLY • link 3.9 years ago by Friederike 9.0k

0

Entering edit mode

I want to assembly an individual human chromosome Y rather than reference sequence, so I should sort the reads and output the sequence according the order of reference human Y, while the reads in the bam file are in disorder, thus, my question is how sort the reads aligned to the chromosome Y by the nucleotide order.

ADD REPLY • link 3.9 years ago by nonaddldy ▴ 10

0

Entering edit mode

You can use samtools sort -o sorted.bam input.bam on your aligned BAM. Samtools does co-ordinate sort by default.