I want to predict the pan-proteome on 25 strains of the same species (each strain contains more than 9000 proteins). I want to know what are the values (%) of coverage and identity to use for a good prediction of Pan (Core and accessory) ??
For information I used proteinortho.
Looking forward to your answers