Hello All:
I am trying to remove duplicates using PICARD. I have two individual files - one from control and another from treatment. I would like to remove duplicates from both the files using a single command line. Can anyone suggest me how to specify multiple input files in Picard mark duplicates??? and also how to specify paired end reads?? Am really confused.
Thanks in advance.
Sorry, may be am not clear with my question. I have two SAM/BAM files - one from control and another one from treatment. So, now as i am interested in marking and removing duplicates from SAM/BAM files using PICARD, i would like to know how to specify multiple single end SAM/BAM files and also paired end files in a command line.
You can only remove duplicates from one sample at a time. If you want to do it on a few files, set up a loop in BASH (or some other shell with which you are familiar).
Also take a look here, where the same question was asked. The solution is effectively what I have just mentioned in my recent response: http://seqanswers.com/forums/showthread.php?t=66969
Trust this helps.
Thank you so much Kevin!!! I will look into these links.
Great - no problem! :)