Question

What to do when updating a gene annotation results in several possible results (or when the annotation of two genes converges)

1

Entering edit mode

5.7 years ago

rocha.dario.bio ▴ 10

Hello everybody, I am working with annotated gene expression databases and software from different sources that I often need to analyze together, for example, microarray, RNA-seq, and iTRAQ data. Since gene annotation can change over time and between sources, I figured I need to update the annotation of all the data I work with and eventually found one R package that seems to suit my needs: mygene. However, I haven't found an appropriate way to deal with two particular situations: 1- Sometimes one gene symbol will get updated to more than one 2- Sometimes two gene symbols will get updated to the same symbol

My current approach is checking for duplicated gene symbols in the updated annotations and eliminating all duplicates but the lowest gene id. However, I have no solid base to support that decision, How do other people deal with similar situations?

gene symbol annotation update • 929 views

ADD COMMENT • link updated 5.7 years ago by Biostar 20 • written 5.7 years ago by rocha.dario.bio ▴ 10

1

Entering edit mode

If you have genome build/coordinate information available why not use a trusted external annotation source such as NCBI/GENCODE (for human/mouse/rat only)/Ensembl instead and avoid these types of issues.

ADD REPLY • link 5.7 years ago by GenoMax 141k