Aligning multiple species for primer
1
0
Entering edit mode
5.5 years ago

Good evening,

I am relatively new to bioinformatics and I have been tasked to align give or take 50 to 60 similar species with different formae specialis to find a unique genetic sequence that can be used as a probe. The species are all fungi, if it is of any importance. The entire assemblies can be found in NCBI in FASTA format. Given this,

1) What kind of program can I use to do this? 2) What kind of computer do I need to do this?

I have tried using the MUSCLE tool in AliView, UGENE and Mafft to test run around 3 of these formae specialis to test, but I always get an "out of memory" error.

Thank you.

alignment assembly • 1.2k views
ADD COMMENT
0
Entering edit mode

Even extremely powerful computers will struggle to align fungal genomes in that number. Aligning of whole genomes isn’t trivial (or particularly accurate).

Do you already have a region you want to probe or are you just looking for any conserved site in the genomes?

ADD REPLY
0
Entering edit mode

I've been given a hint that I should focus on noncoding regions since it is most likely that coding regions will have genes that are 99% similar to similar f. sp.

ADD REPLY
0
Entering edit mode

1)Maybe something with K-mers is an option

2)With k-mers any computer

A k-mer tool: http://www.genome.umd.edu/jellyfish.html

ADD REPLY
0
Entering edit mode

Thank you, I will try.

ADD REPLY
0
Entering edit mode

Probe for what? What is the purpose of the probe?

You may want to look at databases of orthologous genes - like OrthoDB or OMA - then the alignment has already been done for you.

ADD REPLY
0
Entering edit mode

For detection of a certain organism that will not yield false positives as much as possible. In other words - for agricultural use

ADD REPLY
0
Entering edit mode

Fungal amplicon taxonomy focuses heavily on the ITS region, isn't that an option for you?

ADD REPLY
0
Entering edit mode

Unfortunately, I am barred from using the ITS region for this specific paper.

ADD REPLY
0
Entering edit mode
5.5 years ago
h.mon 35k

There is some literature about genomic identification of formae specialis. For fungal pathogens, it seems a good approach is to identify effector protein genes - see these papers:

Effector profiles distinguish formae speciales of Fusarium oxysporum.

Comparative genomics and prediction of conditionally dispensable sequences in legume-infecting Fusarium oxysporum formae speciales facilitates identification of candidate effectors.

Use of Comparative Genomics-Based Markers for Discrimination of Host Specificity in Fusarium oxysporum.

Initially, your problem was too loosely defined, and at each suggestion, you added new constraints. Please be precise from the beginning, you will get better answers, faster.

ADD COMMENT
0
Entering edit mode

Thank you very much, and I will take note

ADD REPLY

Login before adding your answer.

Traffic: 1957 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6