Fixing MGI Sequence Headers - FASTQ Format
1
1
Entering edit mode
8 weeks ago
joe_genome ▴ 70

I am trying to fix the headers of the fastq data from MGI, this comes in a different format than expected Illumina data, this gives issues downstream when trying to use tools such as samtools, hisat2.

   @E250087456L2C221R00100000555#GGGTTTA/1

UMIs also come in a separate file so we need to parse this, not sure on how to approach.

Possible approach?

   @E250087456L2C221R00100000555#GGGTTTA:AATTGGCCTTAA/1
fastq mgi • 456 views
ADD COMMENT
3
Entering edit mode
ADD COMMENT
0
Entering edit mode

This worked quite well, thank you

ADD REPLY

Login before adding your answer.

Traffic: 6030 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6