Quality check of sequences in fasta format
0
0
Entering edit mode
4.0 years ago
Fid_o ▴ 40

Hello,

I have .FASTA files of bacterial genomes. I want to check quality of the sequences. Which program accepts quality analysis of .FASTA files?

Cheers!

alignment assembly genome • 2.6k views
ADD COMMENT
0
Entering edit mode

What kind of quality are you referring to?

If it means how "complete" is the genome then there are tools like BUSCO for that purpose.

ADD REPLY
0
Entering edit mode

@genomax, I mean to check the quality like we would check using FASTQC. I realise FASTQC takes in Fastq files, I have .FASTA files.

ADD REPLY
0
Entering edit mode

You can't. Fasta format does not allow for encoding of quality scores.

Note: There is an [fasta/qual format] which basically splits a fastq files into sequence/quality files (used by 454 I think). I don't think that is very prevalent. Do you have files like that by any chance?

ADD REPLY
0
Entering edit mode

@genomax I don't have the fasta/qual files, it's just the fasta files.

This being the case, can I safely convert fasta files to fastq for purpose of FASTQC use? My first reservations with conversion would be possibility of introducing errors to the files.

ADD REPLY
1
Entering edit mode

can I safely convert fasta files to fastq for purpose of FASTQC use?

While you can do that it is meaningless in terms of analysis. You would set an artificial Q scores for each base. BTW, why do you need quality scores for your fasta files?

ADD REPLY
0
Entering edit mode

@genomax. Thank you for your replies. I needed quality check for my SNP analysis on these sequences.

ADD REPLY
1
Entering edit mode

Not much you can do if you just have fasta sequence. Look for multiple reads (bases) supporting a particular call.

ADD REPLY
0
Entering edit mode

Thank you for that help and insight. Absolutely helpful.

ADD REPLY

Login before adding your answer.

Traffic: 2701 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6