Question: FASTA to SAM for Tablet Assembly Visulisations
0
gravatar for peflanag
3 months ago by
peflanag0
peflanag0 wrote:

Hi,

I'm very new to the world of terminal command and bioinformatics. I currently use ONT for WGS in my lab. I have the FASTQ and FASTA files of my sequence reads that have been basecalled and trimmed with albacore and porechop. y query to the community is:

The file containing the FASTA reads I have used with a reference FASTA file to perform nucmer to generate a delta file that I can input into the web app Assemblytics. However I am now wondering if I need to use assembled files first before using Assemblytics.

I am also using unicycler to align the long reads to generate an assembled file. The output of which is a FASTA file. I want to visualise the assembly in Tablet but the input format has to be SAM or indexed BAM. Can I easily convert the assembled FASTA file to one of these formats? If so how?

I am using macOS

Cheers,

Peter

ADD COMMENTlink modified 3 months ago • written 3 months ago by peflanag0
1

It may be best if you spent some time learning command line basics. After doing that you could use minimap2 to align your data to the reference which will generate alignments in BAM format. You can then visualize them in genome browser of your choice.

ADD REPLYlink modified 3 months ago • written 3 months ago by genomax65k

Cheers for the quick reply.

I'm using minimap 2 in the Unicycler command " unicycler -l FASTQ.gz -o OUT_DIR " when generating the aligned file. However the output is in FASTA and the visualiser Tablet wants it in SAM or indexed BAM. Thats why I'm wondering can I convert the aligned outputted FASTA into one of these other formats.

ADD REPLYlink written 3 months ago by peflanag0

unicycler is an assembly pipeline. If you look in read_alignment/ output folder there should be long_read_alignments.sam file (according to unicycler manual). You can try using that to visualize the data.

ADD REPLYlink modified 3 months ago • written 3 months ago by genomax65k

Oh cheers, I didn't see that in my output folder! I appreciate the help.

ADD REPLYlink written 3 months ago by peflanag0

So are you saying that you don't have that .sam file?

Edit: Looks like you will need to run unicycler with --keep 3 option to generate that file.

ADD REPLYlink modified 3 months ago • written 3 months ago by genomax65k

I was never using the --keep command. Which I assumed kept everything but can't seem to find the .sam file. I am re-running the unicycler command now with the --keep 3 command

ADD REPLYlink written 3 months ago by peflanag0

I tried again with --keep 2 and --keep 3 but it hasn't worked for either. I think there was a typo on the github website. the text said --keep two but the table said --keep 3. I have posted on the github issues page.

Cheers for your help!

ADD REPLYlink written 3 months ago by peflanag0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1853 users visited in the last hour