Question: Align short reads to multiple reference sequence
1
gravatar for Sandeep
5.2 years ago by
Sandeep250
Manipal, India
Sandeep250 wrote:

I am trying to align miRNA sequencing reads (very short reads) to multiple fasta sequence reference(~150). I am expecting my reads to align separately against individual fasta sequences. To achieve this I am using SHRiMP aligner with the following command 

SHRiMP_2_2_3/bin/gmapper-ls -N 2 -o 1 -E input.fasta ../reference.fasta > output.sam

The output consists of all reads that are mapped to the all the reference sequences, but each read is mapped only once. So assuming read1 maps against my 1st fasta reference sequence, then it will not report any hit against the other reference sequence.
Is there a way to achieve what I am trying to do.
I have also tried creating index file for my reference sequence and tried aligning using bowtie

bowtie index input.fastq > output

But, even this result in the read aligning only once with the reference.

Is there a parameter I can add to SHRiMP to obtain hits for individual fasta reference?

ADD COMMENTlink modified 5.2 years ago by cts1.6k • written 5.2 years ago by Sandeep250
1

You mean you have a big reference fasta file consisting of 150 sequences and you are aligning your reads against it. As you are using "-o1" parameter , the aligner is reporting the top aligment (maximum alignment score) for reads. I guess if a read is aligned against different fasta sequences with equal alignment score, then one of the alignment is getting reported due to "-o 1" parameter. The best thing to do would be align these reads to every fasta sequence individually. You can create 150 reference indices and align the reads to them. You can then sort all the reads by queryname and then write a script that pick up a read in all sam files and check if it has been mapped to all the reference sequences with equal scores.  

ADD REPLYlink written 5.2 years ago by Ashutosh Pandey11k

I do have 150 reference sequences. I wanted to avoid creating 150 reference sequences and aligning them. I have tried bowtie with -a option.

Will try to align separately also. Thank you.

ADD REPLYlink written 5.2 years ago by Sandeep250
1
gravatar for cts
5.2 years ago by
cts1.6k
Pasadena
cts1.6k wrote:

These programs are designed to give only a single alignment. If you want alignments from all of your query sequences onto your database you could just use blast. Alternatively bowtie can also give you all alignments with the -a option; check out the manual http://bowtie-bio.sourceforge.net/manual.shtml#reporting-modes 

ADD COMMENTlink written 5.2 years ago by cts1.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2159 users visited in the last hour