Question

How to find orthologus sequences between the different species of non-model organisms?

0

Entering edit mode

3.6 years ago

sunnykevin97 ▴ 990

Hi

I work with non-model organisms trying to understand the orthologs transcripts shared among the different species.

I de novo assembled the transcripts using TRINITY

Removed redundant transcripts using CDHIT

How to find the orthologs common among the species , is their any tools of interest that can do the job ?

Suggestions please.

Thanks Kevin

RNA-Seq Transcript Assembly ORF • 1.7k views

ADD COMMENT • link 3.6 years ago by sunnykevin97 ▴ 990

score 1 · Answer 1 · 2021-03-24

1

Entering edit mode

3.6 years ago

Michael 55k

There is OrthoFinder. It works on predicted proteomes, so you need to get the likely coding sequences and the predicted amino-acids sequences from the transcriptome assembly. This can be done with Transdecoder.

ADD COMMENT • link 3.6 years ago by Michael 55k

0

Entering edit mode

Predicted ORF's using Transdecoder. Subjected the aminoacid sequences of each species into Orthofinder. I generated comparative stats about the shared orthologs among all species. How do I extract Orthology sequences ? further doing MSA alignment -> positive selection analysis (dN/dS).

Suggestions please!

ADD REPLY • link 3.6 years ago by sunnykevin97 ▴ 990

0

Entering edit mode

What do you mean by "orthology sequences", consensus sequences?

Answers please! :D

ADD REPLY • link 3.6 years ago by Michael 55k

0

Entering edit mode

The sequences shared among the all the nine species in my case.

I generated the alignment by performing multiple sequence alignment.

ADD REPLY • link 3.6 years ago by sunnykevin97 ▴ 990

score 1 · Answer 2 · 2021-03-24

Phylogeny has been raised here already. That one is certainly capable of doing this and will even be quite robust but will also entail a lot of work as well as some knowledge on the basics of it and the data processing.

Much more straightforward (and more than enough in most cases) is to do a blast based analysis. You can either run and process the blast yourself or even run some tool that does it all for you. examples of tools: InParanoid, Orthomcl, OrthoFinder, RBH, ....

I personally suggest you first give those a try (or one/some of them) and only if that does not work out go to phylogeny.

score 0 · Answer 3 · 2021-03-24

If you have sequence files (preferably FASTA, even FASTQ files can be converted to FASTA) for all the organisms of your interest, you can use phylogeny tools like MEGA. MEGA X is a popular tool with several similarity functions and visualization tools that you can use for the construction of de novo phylogeny trees.

Starting from scratch can be helpful in your case if there's not a lot of prior information known about the organisms of your choice. MEGA is properly documented and has text and video tutorials too. For other tools, see this comprehensive answer.