Separate Broken Pairs
4
0
Entering edit mode
12.0 years ago
Tom • 0

I am currently preprocessing my paired-end reads and was wondering if there is an efficient way to remove broken pairs from my sequence file.

Thanks for the help.

illumina paired • 3.0k views
ADD COMMENT
0
Entering edit mode

Some more details would be good. What format is your RNA-seq data in and how has it been processed so far? And, how would you like to define "broken pairs"?

ADD REPLY
0
Entering edit mode
12.0 years ago

This post might help: Filtering paired end reads

ADD COMMENT
0
Entering edit mode
12.0 years ago
Tom • 0

It is in fastq format and following QC filtering with fastx I end up single reads that no longer have a pair. It seems that these single reads are messing up my assemblies in velvet. Recently, I tried using sickle to filter and it seems to have resolved the issue.

Thanks for the help.

ADD COMMENT
0
Entering edit mode
12.0 years ago
Ian 6.0k

Picard tools has a handy webpage for working out the meaning of SAM flags (and combinations of flags):

http://picard.sourceforge.net/explain-flags.html

These can be used with 'samtools view -f N', where N is the flag you want to filter your reads with.

E.g. to extract all the properly paired reads:

samtool view -S -f 2 -o proper-paired-reads.sam input.sam
ADD COMMENT
0
Entering edit mode
12.0 years ago
Tom • 0

That is a great tool!

Cheers

ADD COMMENT

Login before adding your answer.

Traffic: 1540 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6