Question

Comparing proteomes in which most protein abundances differ

2

Entering edit mode

6.8 years ago

alexander.thompson ▴ 20

How can I compare proteomes that differ massively?

I am interested in comparing two fluids that I have proteomic data for two conditions (currently 3 replicates but I could increase this) in which the abundance of most proteins differs (i.e. that violates the assumptions of in silico normalization approaches (that I am aware of)). Is there a statistically robust way that I can compare relative abundance of the proteins between conditions?

Approaches I have considered:

Using standard normalisation e.g. median centering and scaling
Subtracting the (log) abundance of a common protein or set of common proteins?
Ranking proteins and using non-parametric tests e.g. Wilcoxon-Mann-Whitney

Many thanks

Proteomics normalization Statistics • 1.5k views

ADD COMMENT • link updated 6.8 years ago by Jean-Karim Heriche 27k • written 6.8 years ago by alexander.thompson ▴ 20

score 2 · Answer 1 · 2017-08-09

2

Entering edit mode

6.8 years ago

Jean-Karim Heriche 27k

You can express everything in terms of proportion or fractional abundance, i.e. the sum of each sample equals 1 (or 100%). You then have compositional data requiring particular treatment.

ADD COMMENT • link 6.8 years ago by Jean-Karim Heriche 27k

0

Entering edit mode

Thanks - I will look into this. I'm not sure that the summed abundance is necessarily proportionate to the amount injected for the two samples, but I don't think that there is an appropriate way around this.

ADD REPLY • link 6.8 years ago by alexander.thompson ▴ 20

0

Entering edit mode

The summed abundance is not related to the absolute total amount if you express it as proportion of the total. The sum of all proteins will be 100%, irrespective of what amount you started with. In this way, you only get relative information. If preserving absolute information on abundance is necessary, don't go down this road. You mentioned relative abundance so my understanding was that you don't care about the absolute values.

ADD REPLY • link 6.8 years ago by Jean-Karim Heriche 27k

0

Entering edit mode

Yes, that is true. I think my concern is less of a statistical/bioinformatic one and I'm sure that there is no way around it - given the gross differences is proteomes, some of the observed differences in abundance may be due to factors e.g. competitive ionisation that might influence the abundance of a protein in one condition but not the other.

ADD REPLY • link 6.8 years ago by alexander.thompson ▴ 20