select a predefined number of gene sequences from a given dataset
0
0
Entering edit mode
6 months ago
Gumindu • 0

I have 150 sequences of a particular gene in a dataset. The gene is highly polymorphic, and the sequences are from different studies with different techniques. I have to select 50 out of 150 to analyze polymorphism and selection. What should be the criteria for selection? Should I choose the most diverged 50, the longest 50, just random 50 samples, or any other fair statistical method?

What if I have more than 50 sequences under one criterion? lets say 70 sequences out of 150 are the longest and the same in length, how to select 50 out of those 70?

polymorphism to sequences how • 289 views
ADD COMMENT

Login before adding your answer.

Traffic: 4783 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6