Is there a way to separate highly similar strains from metatranscriptomic data using HISAT2?
1
1
Entering edit mode
3.7 years ago
sheinsch ▴ 10

I have metatranscriptomic data for several communities. When I use HISAT2 to align the reads from some of the pairings to a single genome I typically get near 50% of the reads aligning. There are a couple cases where near 100% of the reads align to a single genome. In these cases I am assuming that the genomes are very similar. Unfortunately I only have the whole genome sequence for one of the strains.

Is there a way to limit the amount of reads aligning to the wrong genome in HISAT2? My instinct is to increase the stringency of the alignment using the --score-min option. However I would like to hear from the community in case there is a more commonly used solution.

RNA-Seq HISAT2 metatranscriptomics • 877 views
ADD COMMENT
0
Entering edit mode

highly similar strains

You have noted the main problem yourself. There are no magic bullets/programs here especially since one is looking at relatively small snippets of sequence that could be very similar.

There is BBSplit from BBMap suite which can allow fine grained control over how reads are binned/classified that you can try but at the end of the day limitations imposed by technology are going to prevail.

ADD REPLY
0
Entering edit mode
3.7 years ago
colindaven ★ 3.1k

First, why use HISAT2 ? Are these prokaryotes or pico-eukaryotes ? If prokaryotes I would not use a spliced aligner.

Always align to all genomes at once, including contaminants (human?). Maybe you can find a draft assembly of the new genome as well.

I once created a file like this, which was useful to some users (and still is to us). http://genomics1.mh-hannover.de/genometa/

https://docs.google.com/open?id=0B-ZVOKUcgOHRakRrb0hqSWlvT3M

ADD COMMENT

Login before adding your answer.

Traffic: 1607 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6