Question: retro-engineered annotation on genome assembly
0
gravatar for guillaume.rbt
14 months ago by
guillaume.rbt340
France
guillaume.rbt340 wrote:

Hi everyone,

I'm working on a fungus species, on which I have two genome assemblies, performed on two different strains, and also one annotation for each assembly.

By crossing the annotations peptide sequences results, with BDBH analysis, I get some common proteins, and some specific to each strain.

I know, by blasting them on genomes assembly, that most of the "strain specifics" genes are however also present on the other strain genome. (certainly due to the different annotation software)

What I would like to do is to retrieve the sequences of one strain specifics genes on the other strain genomes, so that I complete the annotation.

Would anybody have a clue on how doing such a thing?

Thanks

ADD COMMENTlink modified 14 months ago by Bill Pearson730 • written 14 months ago by guillaume.rbt340
1
gravatar for Bill Pearson
14 months ago by
Bill Pearson730
Bill Pearson730 wrote:

A possible strategy:

(1) blastp all of fungus1 vs fungus2 and vice versa. Find the proteins in fungus1 that do not have significant hits (possibly with a percent identity and coverage threshold) in fungus2, or have hits that only cover part of the protein, and vice-versa.

(2) take the proteins in fungus1 and fungus2 that do not have a match in the other fungus, and tblastn (tfastx) them against the other fungal assembly. I would expect that many of the fungal proteins that do not match, or match only partially, will be found by the tblastn (tfastx) search. tfastx will be slower but much less sensitive to frameshift errors in the assembly.

ADD COMMENTlink written 14 months ago by Bill Pearson730

thank you Bill for your answer, I will try that

ADD REPLYlink written 14 months ago by guillaume.rbt340
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1572 users visited in the last hour