Question

Snpcalling with replicates

0

Entering edit mode

3.7 years ago

eyonesi ▴ 60

Hello everybody I'm running snpcalling on RNASeq data. My data are related to 6 sample including two conditions, control and treatment with 3 replicates for each. I want to use the star aligner. I am confused at alignment step. Do I need to align three replicate of each condition with genome reference in one run and create one BAM file for each condition? then, I assign reads of each BAM file (3 replicate of control or treatment) to a specific read group with AddOrReplaceReadGroups (picardtools). So, I will achieve one vcf file for each condition and then I have to compare them.

OR

DO i need to align each replicate (Including 2 fastq file) with my reference genome and create a BAM file for each replicate. then, should I assign reads of each BAM files (each replicate) to a specific reads group with AddOrReplaceReadGroups. next step, should I merge vcf files related to the replicate of each condition. Eventually I get two vcf files, each of which is the result of merging the vcfs of replicates.

Thank you

SNP • 858 views

ADD COMMENT • link updated 3.7 years ago by hafiz.talhamalik ▴ 350 • written 3.7 years ago by eyonesi ▴ 60

0

Entering edit mode

with 3 replicates for each

Are those technical replicates, originating from the same individual or cell line?

ADD REPLY • link 3.7 years ago by WouterDeCoster 47k

score 0 · Answer 1 · 2020-08-07

0

Entering edit mode

3.7 years ago

hafiz.talhamalik ▴ 350

You should map individual sample and then make a combined vcf file. Its always good to make merged vcf file using from bam files rather than making individual vcf files and then merging it.

ADD COMMENT • link 3.7 years ago by hafiz.talhamalik ▴ 350

0

Entering edit mode

I'm not sure if this is good advice. If the replicates are from the same individual, combining them will increase the coverage and as such be helpful for calling variants. Especially in RNA-seq the coverage distribution is very uneven, and for lowly abundant genes this can make a big difference to have 3x the coverage.

ADD REPLY • link 3.7 years ago by WouterDeCoster 47k

0

Entering edit mode

thank you for your answer

they are biological replicates, do you mean that I create a bam file from my three biological replicates? Do I define them as a reads group?

ADD REPLY • link 3.7 years ago by eyonesi ▴ 60

0

Entering edit mode

These samples do not come from the same individual? So you expect them to have different variants? In that case just align everything separately.

ADD REPLY • link 3.7 years ago by WouterDeCoster 47k