connsensus file too big to read
0
0
Entering edit mode
6.1 years ago

Hi,

I am trying to use soapsnp.Ion chromosme 22 oh human38.I did all the steps for generating the consensus filr using soap snp to find if there are any snps in the consensus file.The problem is my consensus file is 2.7GB in size which is not opening in computer i.e. my laptop hangs and also it is not possible for me to search for snp sites. Is there any graphical viewer or something of similar sort to view the snp sites in my consensus file?

SNP soap consensus variants • 1.3k views
0
Entering edit mode

You should explain what you are doing specifically. Are you trying to open your sequence in a text editor such as Notepad? If so you'll run out of memory. If you're on Windows there's probably a better solution but if you're on Mac or Linux try using less in a terminal. Google is your friend here.

0
Entering edit mode

How can I locate the snp in such a big consensus file.I even don't know if there exists a snp?

0
Entering edit mode

I think you're supplying some parameters incorrectly because the output format section of the manual suggests a 17-column format where each line is a SNP. The manual is ambiguous and this could be one line for each position on chr22, which would explain your huge file size. If this is the case you could find mismatches between columns 3 and 4 using something like awk '{if ($3 !=$4) print}' < consensus.txt. However, SOAP is a very old tool, and you might consider using something more recent for resequencing and calling SNPS, like GATK or Freebayes.