Quicker way to know if a SNV is involved in a mismatch alignment
1
0
Entering edit mode
3.5 years ago
MarVi ▴ 30

Dear all,

I have an advice to ask. I have a collection of alignments, of which I want to know if the mismatches found in the reads when comparing with the reference sequence (genome) are due to an SNV. I have all the SNV's noted down and stored per chromosome in python dictionaries. However, the process of loading the dictionary (cPickle) for the current chromosome dictionary being researched takes a long time. Do you have any suggestions on how to make this process faster in the python, to look up a position on the chromosome if there is an SNV involved in that position?

Thanks in advance! Hope everyone is fine!

python cPickle Dictionary mismatch alignment • 694 views
ADD COMMENT
0
Entering edit mode
3.5 years ago
JC 13k

There are multiple options:

  • Convert your SNV in a VCFs, sort and index with Tabix, you can read/call from python with pyvcf or similar packages
  • Save your data in a real database (postgres, mysql, mongodb, ...) and use a db connector in python
  • Reorganize your data, one table per chromosome is not optimal as many chromosomes are large
ADD COMMENT

Login before adding your answer.

Traffic: 2691 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6