Question: Copy number variation using HTSeq/DESeq2
0
gravatar for ThePresident
3.0 years ago by
ThePresident150
ThePresident150 wrote:

I was wondering if the combination of HTSeq / DESeq2 (normally used for differential expression analysis in RNA-seq) could be used for copy number variation in case of DNA sequencing? I can't see why not, but again I didn't look at the math behind DESeq package.

Thanks, TP

deseq copy number htseq • 1.8k views
ADD COMMENTlink modified 3.0 years ago by WouterDeCoster44k • written 3.0 years ago by ThePresident150

Hi ThePresident,

Make sure to chose a more descriptive title for future threads, in this case "Copy number variation using HTSeq/DESeq2" would have been more informative. I've adapted this thread now, but please keep this in mind.

Cheers,
Wouter

ADD REPLYlink written 3.0 years ago by WouterDeCoster44k

I will, thank you for the advice.

ADD REPLYlink written 3.0 years ago by ThePresident150
5
gravatar for Kevin Blighe
3.0 years ago by
Kevin Blighe67k
Republic of Ireland
Kevin Blighe67k wrote:

If you don't see copy number variation analysis mentioned in the DESeq / DESeq2 manual, then don't use it for that purpose. The data distribution of your CNV data will not match that expected by DESeq (expects a negative binomial distribution). CNV data is measured as discrete intervals, so, something like a Hidden Markov Model (HMM) is more commonly employed (although it can be measured on a continuous scale too).

Note that the same question was asked in relation to edgeR and CNV on the Bioconductor forum: Question: edgeR for CNV detection

Also, take a look at this other Biostars question: Copy Number Variation from paired end RNA-Seq data Note, in particular Devon's reply, where he alludes to the "fundamental limitation" of trying to detect CNV from RNA-seq. This limitation relates to the fact that a copy number event does not necessarily alter gene expression levels. A gene could easily be duplicated, for example, but, without the promoter sequence and/or transcription start site (TSS), it will not be expressed (or just expressed at negligible levels).

If you can't afford to whole genome sequence, then the Affymetrix SNP 6.0 array can determine genome-wide CNV profiles, along with genotyping SNPs. I used this in my PhD years ago.

Best of luck, Kevin

ADD COMMENTlink modified 3.0 years ago • written 3.0 years ago by Kevin Blighe67k
2

Just to add, CNV calling with a DE tool having the assumption that data is normally distributed does not in any way accord for finding CNV which works on discrete data. One needs to find the right tool and the right distribution for finding CNVs and there are plenty of technology to produce the data and tools to generate copy profiles from those data. One important this is properly accounting for allelic frequencies while scanning through the genome and then using segmentation for finding copy ratios. This cannot be done with DESeq2. Try to read about which technology is specific to which kind of data generation first to better understand the power and utility.

ADD REPLYlink written 3.0 years ago by ivivek_ngs5.0k
1

To add to this, for exome sequencing there is also exomeCopy from the same author as DESeq2. But tens of tools for WES CNV analysis are available.

ADD REPLYlink written 3.0 years ago by WouterDeCoster44k

I totally agree with all of you. I was simply curious but Kevin brought a good argument about different expected distributions in DE vs CNV. Thank you all.

ADD REPLYlink written 3.0 years ago by ThePresident150
1

Great - good luck. Hope that you win the next election, Mr. ThePresident.

PS - note that DESeq2 does say that it can be used for ChIP-seq data, although it is definitely more renowned for RNA-seq.

ADD REPLYlink written 3.0 years ago by Kevin Blighe67k

EdgeR and DESeq2 can be used for ChIPSeq mostly for differential peak calling which is different from CNV. Again data is counts and distribution is in accordance with RNAseq.

ADD REPLYlink written 3.0 years ago by ivivek_ngs5.0k
1

If an answer was helpful you should upvote it, if the answer resolved your question you should mark it as accepted. Upvote|Bookmark|Accept

ADD REPLYlink written 3.0 years ago by WouterDeCoster44k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1303 users visited in the last hour