Entering edit mode
2.1 years ago
screadore ▴ 20
I'm looking to find a Whole Genome Sequenced control group for the human genome GrCH38 with 30X coverage. Does anyone know of any publicly available databases that we could gain access to for this? Thank you.
1000 genomes ?
Sequence data for 1000 genomes can be found here: https://www.internationalgenome.org/faq/where-are-your-sequence-files-located/
I guess GTEx https://www.gtexportal.org/home/datasets
Not sure about the coverage though.
For raw sequencing data you will have to apply for dbGaP access via your PI / institution.
As mentioned, raw sequencing data of humans is privacy protected by law because it is identifiable. You need to either use a high-level summary of the data (for example, population variantallele frequency), or apply through your institution for access (and you will likely need a very good justification for doing this)