Question: Sequences number of Orthomcl results (groups.txt) were lower than raw data (fasta file)
1
gravatar for Ginsea Chen
4.9 years ago by
Ginsea Chen130
Chinese Academy of Tropical Agricultural Sciences, Danzhou, China
Ginsea Chen130 wrote:

Hello everyone!

I tried to cluster 10000 protein sequences through Orthomcl, while my Orthomcl results (groups.txt, the output file of orthomclMclToGroups) only contained 6853 sequences. Is it suitable? I thought the mismatch may be for unsuitable blast tool were used, so I want to know which blast tool commonly used in orthomcl analyses, blast2 or blast+ ?

Thanks

 

orthomcl • 1.4k views
ADD COMMENTlink modified 4.8 years ago by DG7.1k • written 4.9 years ago by Ginsea Chen130
0
gravatar for DG
4.8 years ago by
DG7.1k
DG7.1k wrote:

I'm not 100% sure but I believe that OrthoMCL doesn't output singletons at the pairs and groups stage of the analysis. I would suspect that that is what makes up the bulk of your missing values. You could check this by looking at the BLAST results for sequences that didn't make it to the output stage and see what their BLAST scores/hits look like.

ADD COMMENTlink written 4.8 years ago by DG7.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1398 users visited in the last hour