Entering edit mode
10.0 years ago
The most recent versions of the human genome assembly (e.g. hg38 and GRch37) contain a new feature, called alternate reference loci.
How can I see which genes are included in these regions? I would guess that these correspond mostly to HLA and olfactory regions.
Thanks! I could not find the table. You should also add "cut -f2 | sort| uniq | wc -l" because some genes are repeated (they appear in more than one alternate locus). The answer I get now is 1039.
sort -u
is slightly faster and less keystroke thansort | uniq
Sort & uniq in Linux shell