Hi, I have difficulty in calculation for average insert size and standard deviation of RNA-seq NGS data to submit GEO database. I generated two fastq files of paired end read(Read1, Read2) using illumina NGS sequencer. And, I used bacteria genome as reference which has 3 chromosomes. For analysis, I used bowtie2 pipeline to align paired end data to each chromosomes. To get average insert size, such as 'samtools stats' is good using alignment. But I think using generated each sam file ,which is divided from fastq data, for calculation is not accurate. Is there any way to get the average insert size and standard deviation from total fastq? thank you in advance.
Question: How to determine the total average insert size from differently divided from one fastq data pool to each reference?
0
bioinfo • 10 wrote:
0
GenoMax ♦ 95k wrote:
Use instructions in this post using BBMap suite: C: Target fragment size versus final insert size
There are two methods to do this. Either by alignment to a reference or by merging (if R1/R2 reads show sequence overlap).
Please log in to add an answer.
Use of this site constitutes acceptance of our User
Agreement
and Privacy
Policy.
Powered by Biostar
version 2.3.0
Traffic: 1807 users visited in the last hour