Question: Is there a method or HMM set to quickly determine if a sequence has a tRNA sequence for a large number of sequences/reads?
gravatar for O.rka
9 months ago by
O.rka210 wrote:

I am looking at over 10 million reads in the form of fasta. I want to determine if each sequence has a portion of a tRNA sequence. I've been using tRNAscan-SE but this takes a very long time for a large number of reads since it was designed for contigs.

Scanning for HMMs is very quick. Is there a tool that does this or a HMM set that I can use to say "read_X has or does not have a tRNA hit"

I looked in the source code of tRNAscan-SE and didn't notice any HMMs.

trna sequence • 175 views
ADD COMMENTlink modified 9 months ago by Mensur Dlakic6.4k • written 9 months ago by O.rka210
gravatar for Mensur Dlakic
9 months ago by
Mensur Dlakic6.4k
Mensur Dlakic6.4k wrote:

tRNAscan-SE uses covariance models, which one can think of as HMMs that simultaneously model and score both primary sequence and secondary structure. This is why tRNAscan-SE is slow, but that's the price of extra calculation to get maximum sensitivity.

The fact that tRNA requires base-pairing between relatively distant portions of the sequence means that any tool scanning short sequences will miss secondary structure interactions. Still, there seems to be one:

It includes a covariance model for tRNA, but be warned that it will not be speedy. I suspect that any increase in speed will be at the expense of specificity.

ADD COMMENTlink written 9 months ago by Mensur Dlakic6.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1638 users visited in the last hour