Structural variants identified from same reads / assembly
0
0
Entering edit mode
14 months ago
pablo ▴ 300

Hello,

I did an assembly with Pacbio Hifi reads. The metrics are very well.

I decided to do a SV calling with pbsv tool : I aligned the initial reads on this assembly with minimap2 , and I detected many SVs (insertions, deletions, translocations...). It is a pretty small diploïd fungi genome, easy to deal with.

How can I detect these kind of SVs with reads who served to build the assembly. Should it be "linear" , I mean with no variant? The alignment of the reads should give a perfect alignment, without any variation?

For example : if I detect a deletion, and if the proportion of initial reads at this position is 50/50 (50% of the reads with the deletion, 50% without because of the diploïdy) , is the assembly collapse here without the deletion? And so, the deletion is detected with the 50% of the remaining reads during the SV calling?

Best

variants insertion deletion structural assembly • 622 views
ADD COMMENT
0
Entering edit mode

Given your genome is diploid, and genome assemblies tend to collapse all information into a single linear molecule to represent an assembly, finding SVs could mean a few things:

  1. The SVs are located only on one molecule which is not represented in your final assembly. i.e., if loci X is on chr 1 in one of the two sets of chromosomes, but is on chr 2 in the other set, it will be called as a a deletion, translocation, or duplication depending on where loci X is assembled.
  2. You give no indication of whether your genome assembly is considered complete. If not, then any breaks in contiguity may be a result of SVs which are accurately being called.
  3. I don't know how pbsv calls breakpoints, but if tandem repeats were collapsed in your assembly, then duplicates may be called at that loci if coverage information is included in the SV call.
  4. The genome assembly is incorrect in places.
ADD REPLY
0
Entering edit mode

Thanks for you feedback :

  1. I did a phased assembly with hifiasm. Then, I ran svim-asm to call the variations : when I align the two phases on the haploid version, I noticed one of the phases has an insertion at a speficic loci, the other has not (confirmed with IGV). I also aligned the initial reads, and there is a 50/50 proportion of them which have this deletion. So, the deletion is collapsed during the assembly step.
  2. My assembly is kind of perfect : 1 contig ~ 1 chromosome.
ADD REPLY

Login before adding your answer.

Traffic: 1769 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6