Question: Reads aligning in unstranded RNA-Seq library
0
gravatar for CY
2.2 years ago by
CY470
United States
CY470 wrote:

Unstranded kit does not distinguish reads strand in RNA-Seq library. If a particular read can be aligned to one transcript on one strand and to another transcript on another strand, How does aligner, such as STAR, handle this. Does aligner aligns the read to both transcript? Thanks

strand rna-seq alignment • 1.2k views
ADD COMMENTlink modified 2.2 years ago by h.mon29k • written 2.2 years ago by CY470

Are you aligning to the genome or to the transcriptome? Reads aligning equally well to multiple locations will most typically be aligned multiple times (multi mapping reads). Most commonly, read counting afterwards will ignore these reads.

But I'm not sure if that scenario applies to your question. It seems you have a single genomic location in mind, with two transcripts in the opposite direction. Right?

ADD REPLYlink written 2.2 years ago by WouterDeCoster43k

I am aligning to genome, say using STAR. For a specific genomic location, if it is unstranded library, I may got both 'ACACAA' and 'TGTGTT'. The sequence of very location on reference genome is 'TCTGTT'. In this case, aligner still align both read to the reference, just different strand, right? Like what igor said below.

ADD REPLYlink written 2.2 years ago by CY470
1
gravatar for igor
2.2 years ago by
igor9.8k
United States
igor9.8k wrote:

STAR aligns to the genome. The genome FASTA is technically only one strand. The strand info is based on whether the read is identical sequence as the FASTA or the reverse complement.

For example, two reads AAAT and TTTA would align to the same place, but in different orientations. Thus, they are on different strands.

ADD COMMENTlink written 2.2 years ago by igor9.8k
1
gravatar for h.mon
2.2 years ago by
h.mon29k
Brazil
h.mon29k wrote:

STAR will map the reads to the genome, strandedness will have no influence whatsoever. The difference will be at the quantification step, most programs will either ignore or count the read multiple times (e.g. HTSeq), depending on the settings you choose.

If you are aligning / quantifying the transcriptome (with Salmon or kallisto), the read counts will be apportioned according to an EM algorithm to each isoform / overlapping feature .

If you want a more specific answer, please ask a more specific question, or edit your question to provide more details.

ADD COMMENTlink written 2.2 years ago by h.mon29k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1180 users visited in the last hour