Question: Issue wiht pseudogenes in bacteria genome
0
gravatar for hjafar
3 months ago by
hjafar0
hjafar0 wrote:

I have submitted bacteria genome to NCBI and I have received a pseudogenes issue in bacteria genome as shown following :

Before we can assign your accession number, there are a few issues that require your attention.

We have annotated your genome and found that the number of pseudogenes is greater than 10% of the called gene features. This suggests that there may be a problem with your sequence and should be investigated. Please check it to determine whether you need to submit a new assembly. Let us know if you would like to proceed with the existing information.

Note that the vast majority of the pseudogenes are due to frameshifts, which suggests that there are insertions and deletions in your sequence that are causing the excess pseudogenes. You should take this into account when looking into any problems with the genome.

Your genome has 775 pseudogenes out of 5806 CDS genes. 386 of these are frameshifts. 441 are incomplete.

We have uploaded a .sqn file called 'Current.sqn' to your portal submissions so you can see the annotation. We also set the FIX button(s) so that you can upload revised assemblies if necessary.

How can I solve this problem ? Are there any ways should I follow to solve it ?

Thanks in advance,

genome • 167 views
ADD COMMENTlink modified 12 weeks ago by Biostar ♦♦ 20 • written 3 months ago by hjafar0
1

Is this a PacBio or MinION assembly? Frameshifts in the assembly can also lead to fake pseudogenes, for which the solution is to run Illumina-based error correction (see Mick Watson's blog: http://www.opiniomics.org/a-simple-test-for-uncorrected-insertions-and-deletions-indels-in-bacterial-genomes/ )

ADD REPLYlink written 3 months ago by Philipp Bayer6.4k

See @h.mon's answer in this thread: Pseudogenes in bacteria genome

ADD REPLYlink written 3 months ago by genomax71k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1839 users visited in the last hour