I have a list of about 1000 cancer cell lines with associated RNA-seq expression profiles per gene. I would like to be able to confirm that these cell lines have been verified or meet a standard that can be used in research since some cell lines can be contaminated or labeled incorrectly at some point. Is there a list of "verified" cancer cell lines somewhere? Or any advice on how to go about this?
There is a Register of Misidentified Cell Lines:
ICLAC curates a Register of cell lines that are known to be misidentified through cross-contamination or other mechanisms (e.g., mislabelling).
It seems to be actively maintained as well (the last update was less than a month ago).
And you can look up for your cell lines in the Cellosaurus, all those that are "problematic" are indicated and this includes all cell lines in the ICLAC register as well as other. The following FAQ explains this:
Note that you can retrieve the list of cell lines in the ICLAC register by using:
Here is an example in XML of the information on problematic cell lines and on the lCLAC register number:
<comment category="Registration"> International Cell Line Authentication Committee, Register of Misidentified Cell Lines; ICLAC-00298 </comment> <comment category="Problematic cell line"> Contaminated. Shown to be a HeLa derivative (PubMed=566722; PubMed=20143388). </comment>