Question: Filter single exon reads
0
gravatar for sloux
2.2 years ago by
sloux0
University of Kentucky
sloux0 wrote:

I used Trinity to create a fasta file for my transcriptome, which I then mapped to the genome. Unfortunately, there are a lot of single exon reads in the BAM file which appear to just be junk. Does anyone know of a good way to filter these out? I can do it manually, but I really don't want to do that for a full BAM file.

bam single exon reads • 559 views
ADD COMMENTlink written 2.2 years ago by sloux0
1

there are a lot of single exon reads

What exactly does that mean?

ADD REPLYlink written 2.2 years ago by genomax72k

There are single exon reads I want to remove from my BAM file, i.e. they don't have splice sites. Many of them appear to be pre-mRNA, or intronic reads which I don't want in my final file.

ADD REPLYlink written 2.2 years ago by sloux0
1

Take a look at this: A: Filtering out the spliced reads from bam file. You want the opposite of what is being asked there.

See if reformat.sh in=mapped.bam out=filtered.bam maxdellen=0 does what you need.

ADD REPLYlink modified 2.2 years ago • written 2.2 years ago by genomax72k

My guess would that those reads are DNA contamination in your cDNA library.

ADD REPLYlink written 2.2 years ago by WouterDeCoster41k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2157 users visited in the last hour