Good morning/evening/night
I was using the ORFFInder (NCBI). Then i blastp to biggest selected the ORFs to check the domains. This case the whole protein domains are splitted in the two ORFs (and in different frames +1, +3). I wonder if someone has read something in literature. The strange (or not) that the sequences are from RNA-seq. It should have without introns (hypothetically). I am thinking it is about problems in the assembly or a natural processes.
Have someone saw this in animal proteins? is it normal? I know in viruses is so normal in different frames, different proteins. But it is the same protein. Other option is that the first codon stop is not used, then the translation continues. But it is in different frames (+1,+3).
Regards.
Thanks lieven.sterck. I will follow your advise.
I have discovered new topics, such as there are 'pseudogenes' are not pseudo.
Olfactory receptor pseudo-pseudogenes https://www.nature.com/articles/nature19824
The small ORFs haave a huge potential. Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5703645/
Regards
yes indeed. "True" pseudogenes are a possibility as well. Are you using genomics as input or are we talking transcriptome here (assembly artefact will be more likely than pseudogene when it's transcriptome data).
And yes again, there are indeed 'pseudogenes' that seem to be functional but then the question is of course whether they are 'true' pseudogene in the classical sense of the term.
Sorry, Yes it is transcriptome sequence. Probably the artifact.
Regards.