I am currently in internship, and honestly, "the more I learn, the more I realize how much I don't know". There is so many websites and documentations that I am getting confused.
I have a refFlat file, looks like this:
TMEM11 NM_003876 chr17 - 21041854 21058500 21042228 21058057 2 21041854,21057995, 21042745,21058500,
I would like to know if there is a function in R or file which I could sort the proteins in the refFlat by categories.
So I want to sort all the protein in the RefFlat file and gather by function. (like COG, each COG have a key letter which represents the function of the protein).
So I am not interested by the protein itself but the function/category to which it belongs.
And in the end, the main idea is to have my results like this:
category A (maybe translation)--> 45 proteins
category B (lipid metabolism)--> 34 proteins
and so on...
Thank you very much.