Entering edit mode
2.8 years ago
sami ▴ 30
POS VARIANTS REF Allele Variations 99162 DEL/MNP TCGGTGTGCGCGG TCGG 99166 SNP T C
I have on my VCF: DELETION in position 99162 and SNP in position 99166.
It is impossible to have deletion in 99162 and snp in 99166 in the same time. How can i explain that ?? Should i keep snp or deletion in my vcf? Please help me
Thank you for your answer.
Not really. One strand can have a deletion while the other has an SNV.
99162 DEL TCGGTGTGCGCGG ----------TCGG*** (1)
99166 SNP --------T------------------------------------- C** (2)
T became C , but in (1) T is absent (deletion of TGTGCGCGG) : It is possible ???!!
I can't understand how One strand can have a deletion while the other has an SNV in the same position !!!! really ?
Please don't use bold text, it is bad etiquette. My guess is, this is a repeat region where either this sort of deletion/SNV is a possibility, or the caller gets confused enough that it makes ambiguous calls.
I would check the position in IGV - it is usually much more clear to check them visually.
Can't check 1000 VCF.i need solution to filter this situation => Keep del or keep snp !!
check one BAM file, you don't need to check 1000 of them to understand the situation
I have 1000 strain , so 1000 bam different . The situation is that : we have snp and deletion in the same position . So i have to delete one and keep one. I need the solution!
my recommendation remains the same: check 1 variant in 1 sample in IGV and then you'll understand the situation. After that - extrapolate the conclusions for all 1000 samples.
POS------- VARIANTS -----REF ---------------Allele Variations
99162-------DEL/MNP -TCGGTGTGCGCGG ---TCGG
99166--------- SNP ------------- T--------------------- C
WHAT DO YOU THINK ABOUT THIS SITUATION ?? THERE IS CONFLICT OR NO ?
Check it in IGV :) as @RamRS explained it is totally fine for diploid organisms
I work on bacteria, so not diploid
It can be a repetitive region in your bacteria or segmental duplication as well. It may be an assembly problem, not a variant caller problem
assembly! no i do not assembly , just mapping and variant calling . repetitive region mm i don't think because TCGGTGTGCGCGG became TCGG in the variant 1 and 5th T of TCGGTGTGCGCGG became C in the second variant. So we can see that there is no repetitive region in TCGGTGTGCGCGG
can't find 99162-99166 on IGV i must put the name of the chromosome! what is that ! can you explain me ! please
Yes. You need to put chromosome name. e.g.
chr4:99162-99166. That interval is too small so IGV will likely show you a larger interval in window. Make sure the chromosome name is the same as in your IGV genome.
i think there is only one chromosome in bacteria. (i have reference genome .fasta and file.bam and file.bai) so what should i do ? , thank you!
Genome File Error The following files were not loaded ... reference.fasta ERROR: index file must also be selected
Done , i have now my genome and the region 99162-99166 , but i can't see mapped reads,
Hi, we don't need live updates. Please make as much progress as you can and let us know when you really need help.
I have 0 mapped reads in IGV. I can't see reads
Are you looking at the BAM for the sample that had this set of DEL/SNP variants?
Yes , the BAM is good , but IGV show empty BAM (0,0)
You need to get in touch with a bioinformatics expert close to you so they can look at your computer with you. I don't think we can be of any more assistance to you here.
Is your BAM sorted by co-ordinates and indexed?
Yes, I think there is something wrong during variant calling. I whould like to know if i should index my vcf and when ? thank you
BAM and VCF are two different kinds of files. You can visualize both in IGV. If you load your sorted and indexed BAM alignment you can go to that location and inspect all reads there to see the two changes being called in your VCF.
Keep in mind that you are sequencing multiple copies of the genome so different copies may have different mutations in them.
should i index vcf ???
I can't add more comment here :/ the limit of the conversation
Between 2 and 3 you need to sort and index the alignment file before calling SNP.
Final warning: CAPS and bold are bad etiquette. Keep this up and your account will be suspended.