Question: Submitting short reads to SRA
gravatar for kristjan
5.7 years ago by
kristjan160 wrote:

Because many journals request to have all the sequences in a public database, I wanted to submit my data to ENA or GenBank, but now I have some problems. The sequencing was done in 2011 with Illumina HiSeq200 of 16S V6 region. After barcode and primer removal the average read length was about 80 bp. My first problem is that GenBank accepts over 200 and ENA over 100 bp long reads, so do I have to find another database that accepts <100 bp reads?

And my second question is do you have to submit raw reads or after quality check? Because ENA requests fastq files, but after denoising I only have fasta files and unable to merge it with quality file. Although it makes more sense to me to upload denoised data and not raw reads as there is not much useful information in the low quality reads.

sra database submission • 2.5k views
ADD COMMENTlink modified 5.7 years ago by Ido Tamir5.1k • written 5.7 years ago by kristjan160
gravatar for Ido Tamir
5.7 years ago by
Ido Tamir5.1k
Ido Tamir5.1k wrote:

The data you submit should be demultiplexed but unaltered. So you can submit to ENA (although I am quite sure that they also accept reads of length 36 e.g.). The idea of the archives is that other scientists can reproduce your analysis or do their own from the raw data.

ADD COMMENTlink written 5.7 years ago by Ido Tamir5.1k

I totally agree here.  Demultiplex.  Do not clean.  Submit to SRA.  GenBank is an incorrect database to submit Illumina reads to.

ADD REPLYlink written 5.7 years ago by Lee Katz3.0k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1696 users visited in the last hour