Multiple Sequence Alignment of Smaller Groups Within a Larger File
0
0
Entering edit mode
7.7 years ago
newsome6 • 0

Hello,

I'm trying to align small groups of RNA-Seq reads within a larger FASTQ file. I need a separate alignment for each group. I can separate the read groups with an ID number or something similar, but it's not feasible to separate the groups into their own files, since there are around 10^6. I only have one reference sequence I'm trying to align to, and there isn't any alternative splicing.

Is there something that I can use for this? I've looked at the manuals for EMBOSS and Clustal O, but they don't seem to have anything appropriate. BWA has the option "-R" for setting the read group id, but I think that's just for output.

Thanks in advance for any help.

alignment RNA-Seq • 1.2k views
ADD COMMENT
0
Entering edit mode

You could use filterbyname.sh from BBMap, search with that name on this page to see usage OR faSomeRecords utility from Jim Kent. Both will allow you to pull out subsets of records from your large file on demand.

ADD REPLY
0
Entering edit mode

I will try that, thank you so much!

ADD REPLY

Login before adding your answer.

Traffic: 2997 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6