Question: Does Kover work well for inter-species comparisons?
0
gravatar for jacekzkominek
7 weeks ago by
jacekzkominek0 wrote:

Hi, I intend to run a relatively large-scale genotype-to-phenotype assessment (hundreds of genomes. dozens of phenotypes) and would like to use kover to provide targets for the bases of the phenotypes. My problem is that the dataset covers different species, rather than strains of the same species, so the sequences are much divergent, and so I don't think that running kover the 'default' way would work too well. To overcome that, I was thinking about using the amino acid sequences of proteins identified in each species, but I am not sure if kover would accept that as a valid input to the '--from-contigs' option? Alternatively, would it work fine if I passed the k-mer matrix for amino-acid sequences to '--from-tsv', rather than nucleotides? Thanks in advance for all your help!

kover • 141 views
ADD COMMENTlink modified 6 weeks ago • written 7 weeks ago by jacekzkominek0

Thank you, that helps and clarifies a lot!

ADD REPLYlink written 6 weeks ago by jacekzkominek0
3
gravatar for Alexandre Drouin
7 weeks ago by
Montreal, Canada
Alexandre Drouin90 wrote:

Hi Jacek,

The --from-contigs option won't work for amino-acid sequences. In that case, I would recommend precomputing the k-mer matrix and creating the dataset with --from-tsv. The entries of this matrix can really represent the presence/absence of anything, e.g., amino-acid sequences, point mutations, etc. The only thing to watch out for is that the feature identifiers must have the same length (what is under the "kmers" column here http://aldro61.github.io/kover/doc_input_formats.html#k-mer-matrix).

I hope that this helps!

Alex

ADD COMMENTlink written 7 weeks ago by Alexandre Drouin90
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 588 users visited in the last hour