Question: Quicker way to know if a SNV is involved in a mismatch alignment
gravatar for MarVi
8 days ago by
MarVi20 wrote:

Dear all,

I have an advice to ask. I have a collection of alignments, of which I want to know if the mismatches found in the reads when comparing with the reference sequence (genome) are due to an SNV. I have all the SNV's noted down and stored per chromosome in python dictionaries. However, the process of loading the dictionary (cPickle) for the current chromosome dictionary being researched takes a long time. Do you have any suggestions on how to make this process faster in the python, to look up a position on the chromosome if there is an SNV involved in that position?

Thanks in advance! Hope everyone is fine!

ADD COMMENTlink modified 8 days ago by JC11k • written 8 days ago by MarVi20
gravatar for JC
8 days ago by
JC11k wrote:

There are multiple options:

  • Convert your SNV in a VCFs, sort and index with Tabix, you can read/call from python with pyvcf or similar packages
  • Save your data in a real database (postgres, mysql, mongodb, ...) and use a db connector in python
  • Reorganize your data, one table per chromosome is not optimal as many chromosomes are large
ADD COMMENTlink written 8 days ago by JC11k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1569 users visited in the last hour