What if the number of line in fastq.gz pair files are too different?
1
0
Entering edit mode
6 weeks ago
br0104 • 0

Please somebody help me,

I have fastq.gz files of three different groups of experiment from NextSeq 550 through BD rhapsody WTA multiplex pipeline.

I tried to analyze those results on SevenBridges using BD WTA pipeline but only one of three groups was successful and other groups have failed.

I asked for support and was told that pair fastq files (R1 and R2) have different lines so I counted line using 'wc -l' function and divided by four.

The number of line was similar in successful group however the other failed group showed big differences.

Does anyone know what to do with fastq.gz files to run the analysis pipeline successfully?

fastq.gz R SevenBridges count fastq • 181 views
ADD COMMENT
0
Entering edit mode

pair fastq files (R1 and R2) have different lines

You need match up paired-end reads from two FASTQ files.

ADD REPLY
1
Entering edit mode
6 weeks ago

You have to be sure that each mate in forward file has a respective in the reverse file as well. In other words, the number of reads/lines should be the same in both forward and reverse files. Try using fastq-pair prior to run any analysis.

This will remove un-matched reads from your data. In simple words, such reads which do not have a mate in either forward or reverse file will be discarded.

ADD COMMENT

Login before adding your answer.

Traffic: 2521 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6