Question: (Closed) Mark Duplicates - MAPQ = 0 and Value was put into PairInfoMap more than once
0
gravatar for Christian.Wood
4.7 years ago by
United Kingdom
Christian.Wood0 wrote:

Dear Biostars team,

I'm relatively new to using the Galaxy online platform and have been using it to run RNAseq with some paired end RNA data from an Illumina run with the rat rn5 genome. After completing our RNAseq analysis we're trying to look for SNP/Variants within the reads but I am having issue getting the files pre-processed with Picard tools with the Mark Duplicate Reads tool throwing up a couple of errors halting progress. These are the steps that I have taken so far:

  • Raw Illumina Fastq files ftp'd to usegalaxy public instance (FOR and REV for 2 lanes)
  • FASTQ Groomer - convert to fastqsanger
  • Trim by FASTQ quality score >=20
  • Map with BWA for Illumina using rn5 and paired end reads
  • Convert SAM to BAM for both mapped lane files
  • Reorder BAM for both files
  • Add read groups for both files
  • Mark Duplicate reads - removing duplicates from output -> here is where we get the issue.

The two bugs that are thrown up are - "MAPQ should be 0 for unmapped read." and "Value was put into PairInfoMap more than once" which halt this pre-processing step before moving the BAM files onto GATK Variant analysis.

In addition, for running the GATK analysis, is the best practice for using a custom genome just ftping the USCS rn5.fa file into a history and using that or should there be an additional index file for this?

Any help with regards to these issues would be greatly appreciated and please let me know if I need to clarify anything for a solution to be found!

Many thanks,

Christian

 

 

ADD COMMENTlink modified 4.7 years ago • written 4.7 years ago by Christian.Wood0

Hello Christian.Wood!

We believe that this post does not fit the main topic of this site.

This belongs on https://biostar.usegalaxy.org

For this reason we have closed your question. This allows us to keep the site focused on the topics that the community can help with.

If you disagree please tell us why in a reply below, we'll be happy to talk about it.

Cheers!

ADD REPLYlink written 4.7 years ago by RamRS24k
Please log in to add an answer.
The thread is closed. No new answers may be added.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1608 users visited in the last hour