Entering edit mode
6.9 years ago
Ric
▴
440
I have two folders and each of them contain the same file names.
ls -1
10_S0_L001_R1_001.fastq.gz
10_S0_L001_R2_001.fastq.gz
11_S0_L001_R1_001.fastq.gz
11_S0_L001_R2_001.fastq.gz
Is there a way to extract from each dataset the flowcell name and use it as unique filename?
Thank you in advance.
Thank you, is there a script for it?
You can use standard unix tools (such as
cut,awk,tr,grep, etc) or non-standard ones (e.g.,bioawk) to extract metadata from your fastqs. Like genomax said, you'd need to extract some other identifier(s) in order to make your filenames unique.