Variation in Ensembl Chromosome ID
1
0
Entering edit mode
4.2 years ago
Mel ▴ 60

Hi, I've been parsing through some Ensembl CDS files (using BioPython), and I'm getting some variation in how the chromosome ID is specified. Typically it is specified like this: "chromosome:GRCh38:7:142786213:142786224:1". However, other times it is specified like this: 'chromosome:GRCh38:CHR_HSCHR7_2_CTG6:142847306:142847317:1'. Why is this variation present? What does 'CHR_HSCHR7_2_CTG6' specify exactly? Thanks for any clarification or help.

genome • 755 views
ADD COMMENT
1
Entering edit mode
4.2 years ago
JC 13k

That is a haplotype sequence, check https://www.ensembl.org/Help/Faq?id=291 for more information

ADD COMMENT
0
Entering edit mode

Excellent! Thank you JC!

ADD REPLY

Login before adding your answer.

Traffic: 1901 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6