Question: retro-engineered annotation on genome assembly
0
gravatar for guillaume.rbt
22 months ago by
guillaume.rbt450
France
guillaume.rbt450 wrote:

Hi everyone,

I'm working on a fungus species, on which I have two genome assemblies, performed on two different strains, and also one annotation for each assembly.

By crossing the annotations peptide sequences results, with BDBH analysis, I get some common proteins, and some specific to each strain.

I know, by blasting them on genomes assembly, that most of the "strain specifics" genes are however also present on the other strain genome. (certainly due to the different annotation software)

What I would like to do is to retrieve the sequences of one strain specifics genes on the other strain genomes, so that I complete the annotation.

Would anybody have a clue on how doing such a thing?

Thanks

ADD COMMENTlink modified 22 months ago by Bill Pearson820 • written 22 months ago by guillaume.rbt450
1
gravatar for Bill Pearson
22 months ago by
Bill Pearson820
Bill Pearson820 wrote:

A possible strategy:

(1) blastp all of fungus1 vs fungus2 and vice versa. Find the proteins in fungus1 that do not have significant hits (possibly with a percent identity and coverage threshold) in fungus2, or have hits that only cover part of the protein, and vice-versa.

(2) take the proteins in fungus1 and fungus2 that do not have a match in the other fungus, and tblastn (tfastx) them against the other fungal assembly. I would expect that many of the fungal proteins that do not match, or match only partially, will be found by the tblastn (tfastx) search. tfastx will be slower but much less sensitive to frameshift errors in the assembly.

ADD COMMENTlink written 22 months ago by Bill Pearson820

thank you Bill for your answer, I will try that

ADD REPLYlink written 22 months ago by guillaume.rbt450
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1182 users visited in the last hour