I have a list of Uniprot gene IDs associated with Gene Ontology (biological processes), which I have obtained from Uniprot.org. I am showing only one gene ID with associated the biological processes -- because the other genes have a lengthy biological process.
O95831 activation of cysteine-type endopeptidase activity involved in apoptotic process; apoptotic DNA fragmentation; apoptotic process; cell redox homeostasis; chromosome condensation; DNA catabolic process; intrinsic apoptotic signaling pathway in response to endoplasmic reticulum stress; mitochondrial respiratory chain complex I assembly; NAD(P)H oxidase activity; neuron apoptotic process; neuron differentiation; oxidoreductase activity, acting on NAD(P)H; positive regulation of apoptotic process; regulation of apoptotic DNA fragmentation.
Problem: Figure out a way to text mining the biological process that is related mitochondria (where mitochondria is mentioned). Would regex be useful to solve this problem? or what other ways that might be useful?
Expected Result: the result that I want to get is the following:
O95831 mitochondrial respiratory chain complex I assembly
Your help is appreciated,