Where do I get a WES dataset of size <1GB
1
0
Entering edit mode
10 weeks ago
pragnapcu • 0

Can someone please tell me from where can I get the WES or WGS dataset of size <1GB

WGS WES genomics • 380 views
ADD COMMENT
0
Entering edit mode

what do you mean WES dataset?

ADD REPLY
0
Entering edit mode

whole-exome sequencing data or whole-genome sequencing data. Please correct me if I am wrong. I have started learning NGS analysis just now. I sincerely appreciate ur suggestions.

ADD REPLY
1
Entering edit mode

You can always downsample a larger dataset. seqtk, reformat.sh from BBMap are a couple of possible tools.

ADD REPLY
0
Entering edit mode

yeap but it can be that you need FASTQ files < 1GB, or BAM files < 1GB (which is pretty unrealistic), or VCF files < 1GB. All such files are possibly to be generated from WES

ADD REPLY
2
Entering edit mode
10 weeks ago
ATpoint 54k

Just browse sra-explorer.info for datasets. I doubt you can meaningfully query for file size as this depends on coverage and read length. Just download any from the species you need and then downsample. Just use head on the fastq files to get like the first 4000000 lines which equals 1mio reads. Reads in fastq files are random and not ordered other than its position on the flow cell, hence you do not strictly need a dedicated sampling approach, especially if this is just for some practicing task.

ADD COMMENT

Login before adding your answer.

Traffic: 2950 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6