Question: Snpcalling with replicates
0
gravatar for eyonesi
7 weeks ago by
eyonesi40
eyonesi40 wrote:

Hello everybody I'm running snpcalling on RNASeq data. My data are related to 6 sample including two conditions, control and treatment with 3 replicates for each. I want to use the star aligner. I am confused at alignment step. Do I need to align three replicate of each condition with genome reference in one run and create one BAM file for each condition? then, I assign reads of each BAM file (3 replicate of control or treatment) to a specific read group with AddOrReplaceReadGroups (picardtools). So, I will achieve one vcf file for each condition and then I have to compare them.

OR

DO i need to align each replicate (Including 2 fastq file) with my reference genome and create a BAM file for each replicate. then, should I assign reads of each BAM files (each replicate) to a specific reads group with AddOrReplaceReadGroups. next step, should I merge vcf files related to the replicate of each condition. Eventually I get two vcf files, each of which is the result of merging the vcfs of replicates.

Thank you

snp • 168 views
ADD COMMENTlink modified 6 weeks ago by hafiz.talhamalik240 • written 7 weeks ago by eyonesi40

with 3 replicates for each

Are those technical replicates, originating from the same individual or cell line?

ADD REPLYlink written 6 weeks ago by WouterDeCoster44k
0
gravatar for hafiz.talhamalik
6 weeks ago by
Pakistan
hafiz.talhamalik240 wrote:

You should map individual sample and then make a combined vcf file. Its always good to make merged vcf file using from bam files rather than making individual vcf files and then merging it.

ADD COMMENTlink written 6 weeks ago by hafiz.talhamalik240

I'm not sure if this is good advice. If the replicates are from the same individual, combining them will increase the coverage and as such be helpful for calling variants. Especially in RNA-seq the coverage distribution is very uneven, and for lowly abundant genes this can make a big difference to have 3x the coverage.

ADD REPLYlink written 6 weeks ago by WouterDeCoster44k

thank you for your answer

they are biological replicates, do you mean that I create a bam file from my three biological replicates? Do I define them as a reads group?

ADD REPLYlink modified 6 weeks ago • written 6 weeks ago by eyonesi40

These samples do not come from the same individual? So you expect them to have different variants? In that case just align everything separately.

ADD REPLYlink written 6 weeks ago by WouterDeCoster44k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1628 users visited in the last hour