Hello,
I'm really new if the field and maybe my question is a little bit naive.
I'm trying to annotate my SNPs by using VEP. The reason to do this is to find the nonsense SNPs on each genome, the synonymous and non-Synonymous. The organism that I'm working on is sheep. The thing that is a little bit confusing to me is the tag system that Ensembl uses. For example, the "synonymous_variant" is for the synonymous SNPs, but I'm not so sure about the non-synonymous and the nonsense. I'm taking the "coding_sequence_variant" and "stop_gained", respectively. Am I right? Also, I cannot identify the CNVs. Is there any particular tag for this?
A second issue that I faced is that for some gene IDs there is no information about the name of the gene (symbol tag). Is there any way to use somehow a list with these IDs and find the names of these genes?
Thank you very much in advance and I'm really sorry for the questions "bombing".
Thank you very much for your help :)
I'm sending you some examples of tags that when I used the --symbol flag didn't give me the gene name.
ENSOARG00000000134 ENSOARG00000000154 ENSOARG00000000161
ENSOARG00000000134, ENSOARG00000000154 and ENSOARG00000000161 are all uncharacterised proteins, with no gene name, different from ENSOARG00000019179, the latter named as CD96.
Thank you very much for your help! So, I will leave them as unknown genes in my code.