EGA (ENA) database wont accept fastq files
1
0
Entering edit mode
23 months ago
heskett ▴ 110

hi folks,

I keep getting this error despite clearly uploading two files per library for paired end fastq

In run, alias:"ena-RUN-TAB-24-05-2022-00:14:25:828-37227", accession:"". In run(ena-RUN-TAB-24-05-2022-00:14:25:828-37227) Paired reads must contain two Fastq files

this is an example of the spreadsheet. i name the library the same and then have two lines, one for each file _1 and _2 for paired end. I am sure that they expect a different format but i cant figure it out. theres literally no examples in the documetation. thanks so much

GM12878_clone4  PRJEB52794  Illumina NovaSeq 6000   gm12878_clone4_early_rt GENOMIC RANDOM PCR  WGS PAIRED  gm12878_clone4_early_rt_1.fastq.gz
GM12878_clone4  PRJEB52794  Illumina NovaSeq 6000   gm12878_clone4_early_rt GENOMIC RANDOM PCR  WGS PAIRED  gm12878_clone4_early_rt_2.fastq.gz
ENA EGA fastq • 1.3k views
ADD COMMENT
1
Entering edit mode

I usually work via the webin interface, and not with uploaded spreadsheets but ok.

Can you check/confirm a few things? :

  • you are sure the files are in correct fastq format? (check for instance head of both of them)
  • they are 'in sync' meaning the first read from file one is the counterpart of the first read of file 2 etc
  • It can be that you will need to provide the two files of a paired end read file on the same line but in different columns (see my initial sentence here, that is at least how it sort of looks in the webin interface)

and rest assured, they definitely accept fastq files :)

ADD REPLY
0
Entering edit mode

thanks much! yes i found the correct template that allows for two files in one line

ADD REPLY
0
Entering edit mode

You may be best off contacting help desk at EGA for this.

ADD REPLY
0
Entering edit mode

Thanks. I have tried but it looks like their response time is very long. I have a journal editor asking me to upload ASAP, so I was hoping someone might know the format better. I have spent time answering other peoples questions and just hope someone may be able to identify the issue easily and give a tip or two.

ADD REPLY
1
Entering edit mode
23 months ago
Michael 54k

What happens if you use only one line per pair (Just a wild guess)? Like so:

GM12878_clone4  PRJEB52794  Illumina NovaSeq 6000   gm12878_clone4_early_rt GENOMIC RANDOM PCR  WGS PAIRED  gm12878_clone4_early_rt_1.fastq.gz gm12878_clone4_early_rt_2.fastq.gz
ADD COMMENT
1
Entering edit mode

indeed, I'm also thinking in that direction

ADD REPLY
0
Entering edit mode

this ended up being correct. I accidentally downloaded the template that didn't have the extra columns to add multiple fastq to one line. thanks everyone

ADD REPLY

Login before adding your answer.

Traffic: 2564 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6