Filter duplicates with picard
1
0
Entering edit mode
7 months ago
María José ▴ 10

Hi, I have tried to use Picard and filter duplicates in Illumina data after alignment, but it encounters memory issues. I've tried to increase memory with --mem 50 when submitting the job to the queue manager. Would it be advisable to decrease the max_records_in_ram? if so, would this change affect the subsequent results? Any guidance on this topic would be greatly appreciated.

java -jar picard.jar MarkDuplicates I="./U2OS_sorted.bam" O="./U2OS_sorted_duplicates.bam" M="./U2OS_sorted_metrics_dup.txt" ASSUME_SORT_ORDER=coordinate
human illumina libraries picard • 803 views
ADD COMMENT
0
Entering edit mode

Don't forget to follow up on your threads. If an answer was helpful, you should upvote it; if the answer resolved your question, you should mark it as accepted. You can accept more than one answer if they all work. If an answer was not really helpful or did not work, provide detailed feedback so others know not to use that answer.

Analyze read lengths in Illumina data Process Illumina Sequencing Data. Determine type of Pacbio read Extract aligned positions from reads how to see Pacbio reads in IGV

ADD REPLY
2
Entering edit mode
7 months ago

increase the java memory

java -Xmx10G -jar picard.jar MarkDuplicates ....
ADD COMMENT
0
Entering edit mode

Hi Pierre Lindenbaum, thank for your assistance. Now, Picard functions without memmory issues.However, I encounter a new error when I tried to run MarkDuplicates. I've used validateSamFile and some reads have problems:

ERROR: MISSING_READ_GROUP: 1
WARNING: RECORD_MISSING_READ_GROUP: 95,545,542

I think this issue could be resolved with AddOrReplaceReadGroups but I don't know what to indicate in the parameters: RGID=id(1 number ?), RGSM=name(HCT117 (cell line?), RGPU=unit, and RGPL=ILLUMINA. The ID of the reads is as follows: LH00587:112:22LM2WLT4:1:1101:4868:1028.

ADD REPLY
0
Entering edit mode

that is another question.

ADD REPLY
0
Entering edit mode

Please accept the answer so the question is marked solved on the website. To do that, click on the green check mark on the left side of the answer.

ADD REPLY

Login before adding your answer.

Traffic: 4308 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6