Question: Should I remove duplicates?
1
gravatar for A
17 months ago by
A3.8k
A3.8k wrote:

Sorry,

I have a list of bam files for instance like this

HUMAN_1000Genomes_hs37d5_RNA_seq_WTSI-COLIVM_005_1pre.***dupmarked***.bam

By dupmarked likely duplicates have been marked; I want to extract raw read counts from these files by featurecounts; Do you just me to remove duplicates? How I know basically I should do that or not for extracting raw read counts?

Thank you for any help

bam next-gen duplicates • 548 views
ADD COMMENTlink modified 17 months ago by ATpoint36k • written 17 months ago by A3.8k
2

RNA-seq: Should remove duplicates in all samples of same experiment although some do not have technical duplicates? and the links there in.

ADD REPLYlink modified 17 months ago • written 17 months ago by genomax87k

Sorry, owner of data says that duplicates have not been removed rather just marked in bam files; So should I remove them now?

I should mention I also have a bam.bai for each sample like

HUMAN_1000Genomes_hs37d5_RNA_seq_WTSI-COLIVM_005_1pre.dupmarked.bam.bai
ADD REPLYlink written 17 months ago by A3.8k

Is this RNAseq data or something else? File name seems to indicate it is RNAseq but just to be absolutely certain.

If it is RNAseq then the post linked above has all the info you need. I will also explicitly link this: mRNA-seq quality report (fastQC): Does it mean samples have adapters and should remove duplicates?

ADD REPLYlink modified 17 months ago • written 17 months ago by genomax87k

Thank you, yes this is RNA-seq definitely

ADD REPLYlink written 17 months ago by A3.8k

Sorry, is it possible that these files are not normal RNA-seq files rather I should do Calling variants in RNAseq because in this paper they done that

Organoid cultures recapitulate esophageal adenocarcinoma heterogeneity providing a model for clonality studies and precision therapeutics
ADD REPLYlink modified 17 months ago • written 17 months ago by A3.8k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1682 users visited in the last hour