I have gotten some RNAseq data and I'm starting to process it. It was carried 3’mRNA library preparation and thus this is a single-end sequencing experiment. It was then sequenced with Illumina Nextseq5000. I'm now trying to understand what I need to do and have a couple of basic questions:
- As this is a 3' sequencing, does this mean that the strandedness of my fastQs is always reverse?
- Despite this, the PCR step still includes both forward and reverse primers. Why is this so considering this is a single-stranded ?
- For each file I have the following, but am confused as to exactly what these mean:
90_L001_R1_001.fastq.gz
90_L002_R1_001.fastq.gz
90_L003_R1_001.fastq.gz
90_L004_R1_001.fastq.gz
- If this had been a paired-end sequencing experiment, I would've gotten two fastQ files per sample. How do you associate them with forward, reverse, or unstranded sequencing?
Thanks a lot in advance
These seem to be only the forward reads of a read pair (R1).
You also need the files containg the mates (R2).Also, does the90
signify your sample? Because then you may have technical replicates here, as theL(...)
may signify, that this sample has been repeatedly sequenced on different lanes?Edit: single-end reads of course don't have mates.
@ponganta, I have re-added my initial point 3. Yes, I believe those are 4 different lanes, as this seems to be the case for flowcells in Nextseq 5000.
90
is indeed the sample, but I'm still trying to puzzle it exactly what it means to my own sample metadata. Should we haveR2
if this is single-end sequencing?