Question: How to identify library name in fastq filename
10 weeks ago by
zmk0 wrote:

Dear NGS experts,

I just started to analyse NGS data. I recently received some files that were generated at EMBL. I want to start analyzing them but have a hard time to figure out the read groups in order to start with GATK data preprocessing (from FASTQ to BAM).
The following article explains the different read groups i need to build the BAM file: here
My FASTQ file name is structured exactly as the one in this image
Can anyone identify the read groups, especially the library identifier for me?
The Image source is from here.

Thank you so much.


modified 10 weeks ago • written 10 weeks ago by zmk0

C43FWACXX is likely the ID of the flowcell this sample ran on. Rest of the mesoseq stuff is likely the sample ID. Only other distinguishing detail there is the sample is male and appears to have run in lane 5. Which would be the library name since a sample is converted to a library (unless multiple libraries are made from one sample).

modified 10 weeks ago • written 10 weeks ago by genomax46k
