Question: Multiple Sequence Alignment of Smaller Groups Within a Larger File
gravatar for newsome6
2.7 years ago by
newsome60 wrote:


I'm trying to align small groups of RNA-Seq reads within a larger FASTQ file. I need a separate alignment for each group. I can separate the read groups with an ID number or something similar, but it's not feasible to separate the groups into their own files, since there are around 10^6. I only have one reference sequence I'm trying to align to, and there isn't any alternative splicing.

Is there something that I can use for this? I've looked at the manuals for EMBOSS and Clustal O, but they don't seem to have anything appropriate. BWA has the option "-R" for setting the read group id, but I think that's just for output.

Thanks in advance for any help.

rna-seq alignment • 664 views
ADD COMMENTlink modified 2.7 years ago • written 2.7 years ago by newsome60

You could use from BBMap, search with that name on this page to see usage OR faSomeRecords utility from Jim Kent. Both will allow you to pull out subsets of records from your large file on demand.

ADD REPLYlink modified 2.7 years ago • written 2.7 years ago by genomax85k

I will try that, thank you so much!

ADD REPLYlink written 2.7 years ago by newsome60
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 767 users visited in the last hour