I am working on non-model organism. I have called SNPs by read mapping illumina reads to reference genome(assembled contigs/scaffolds). I have VCF file with SNP predicted for each contig/scaffold. Recently Ensembl released non-chromosomal genome assembly and also have GTF file(gene prediction based on Ensembl annotation). Now I am trying to find out synonymous and nonsynonymous mutation rate. I find tools that only available for chromosomal level, Is there any script or tool available already for contig level?.
The VAASST pipeline can annotate variants. All you need is the annotation set in GFF3 format, the reference sequence, the variants, and the lengths of the chromosomes. Converting GTF to GFF3 should also be trivial.
Will this pipeline accept contig lengths?
Yes the --build flag is a tab delimited file that has 3 columns: 1. seqid/contig_name/scaffold 2. start position (1). 3. length of sequence