Question: IPA (Improved Phased Assembly) output questions
0
gravatar for slin023
4 weeks ago by
slin0230
slin0230 wrote:

Hello, I just finished IPA assembly for my genome assembly (https://github.com/PacificBiosciences/pbipa), but I received two output files. The tutorial didn't mention anything, I checked two fasta files, one is

  1. final.a_ctg.fasta, it has id named " >hap_ctg.000084F_1 HAPLOTIG", "hap", I assume is haploid?

The other is 2.final.p_ctg.fasta, it has id named ">ctg.000071F_1"

Can someone explain the difference between two files, and what the letters and numbers mean, please?

(ex: hap_ctg.000084F_1, what is ctg? what is 00084? what is F_1?, for two output files, what do "a" and "p" represent in fasta name?)

pacbio assembly genome • 113 views
ADD COMMENTlink modified 4 weeks ago by h.mon31k • written 4 weeks ago by slin0230
0
gravatar for h.mon
4 weeks ago by
h.mon31k
Brazil
h.mon31k wrote:

Some of this information can be found at an old Falcon documentation page:

The final output of this step is a fasta file of all of the primary contigs, p_ctg.fa as well as an associated contig fasta file, a_ctg.fa that consists of all of the structural variants from the primary contig assembly.

A cursory look at the wiki revealed no clues, I think you could open an issue at the IPA github page asking for better documentation - just be sure to search extensively before doing so.

ADD COMMENTlink written 4 weeks ago by h.mon31k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1751 users visited in the last hour