we developed a tool years ago to calculate Fst using the ideas given by this example and migrating those formulas into perl. this is the perl code we use for calculate Fst depending on different population and population group sizes:

```
# GENETIC DIFFERENTIATION (FOR POPULATION GROUPS)
# HT eq groupHexp
# HS = sum( popHexp x popN ) / groupN
# Fst = ( HT - HS ) / HT
$HT = $Hexp{$group};
$HS = 0;
$Fst = 0;
if ($HT != 0) {
foreach $pop (@pops) { $HS += $Hexp{$pop} * $N{$pop}; }
$HS /= $N{$group}
$Fst = ( $HT - $HS ) / $HT;
}
```

where Hexp is the local expected heterozygosity of each subpopulation, HS is the sum of all Hexp multiplied by each population size, and HT is the expected heterozygosity for the entire group of populations considered, all of which you have to previously calculate.

In my opinion different sample sizes shouldn't be a big issue (unless one population is REALLY small). There are several estimators of Fst. You might refer to some very basic approach, such as the one ususally used to teach to undergad students. In that case I alway find useful this page by David McDonald at University of Wyoming. A useful discussion for you is here on biostars: Wright's Fst and Weir & Cockerham's Fst estimator - simple explanation of the difference

