Entering edit mode
7.0 years ago
kk.mahsa
▴
150
hi everyone
how can i combine two fastq file resulted paired end sequencing into one fastq file for submitting in NCBI?
i used cat command and i got that this command is not useful for me.
i want one fastq file same below pattern
@HWUSI-EAS529:41:FC62YHFAAXX:8:1:7969:1330 1:N:0:GCCAAT
CTACGAAAGGGCACTTGGAATTCTCGGGTGCCAAGGAACTCCAGTCACGCCAATATCTCGTATGCCGTCTTCTGCT
+
IIIIIIIHIIHIIIIIIHHIIIHGIIIIEIIIIIIEIIHIIIIIIIIIIIHIIIIIBHIHIIHGIGIEGHHEGEEH
@HWUSI-EAS529:41:FC62YHFAAXX:8:1:7969:1330 2:N:0:GCCAAT
AGTGCCCTTTCGTAGGATCGTCGGACTGTAGAACTCTGAACGTGTAGATCTCGGTGGTCGCCGTATCATTAAAAAA
+
IIIIIIIIIIIIIIIIIIIIIIIDHIGIIIHIIIGHGIIIIIIIHHIHIIIIIIIIIHIIIIIIIIHIIGIIIIHI
errors like these killed runing:
how can i fixed fastq files?
What have you done to these files before getting to this point? It appears that they have been edited/tampered with in some way. You could try to run fastqValidate to verify your fastq files.
i renamed it by adding .fastq
my files are in fastq format (four line per read) but in their name there is not .fq or .fastq so i renamed it and added .fastq then run reformat.sh on them and got above problem.
i gathered my files via cat command successfully, can i submit output of cat command into SRA in NCBI?
is it possible that file be harmed due to renaming?
Just renaming the files should not have caused this problem. They may have been corrupted when you downloaded them from your sequence provider.
cat
'ing the files together does not achieve interleaving of the reads (which is what you showed in the original post). What you will have isentire contents of R1
followed byentire contents of R2
, if you use plaincat R1.fq R2.fq > new.fq
. If that is acceptable to SRA then you could do it that way.thanks genomax for your kindly advises