Question

SSRs abundance calculation

0

Entering edit mode

7.0 years ago

moohit21 • 0

Hi everybody, I have a .misa file and i want to calculate abundace of classes of ssrs from that misa file, for example in dinucelotide repeats which one is repeating maximum time and same in tri, tetre,penta and hexanucleotides. is there any software or any script which can count this?

Thanks in advace for the help.

RNA-Seq ssrs abundance • 1.5k views

ADD COMMENT • link updated 7.0 years ago by Tm ★ 1.1k • written 7.0 years ago by moohit21 • 0

1

Entering edit mode

Do you want the abundance of repeat sequence or abundance of mono, di, tri, tetra etc repeats (not sequences just length of repeat)?

If you are interested in the abundance of the length of repeats then it's pretty simple. There is one column SSR type in which p1 means mono-nucleotide repeat p2 means di-nucleotide and so on. So open the file in excel and make a pivot table of SSR type column you will get repeat lengthwise abundance.

ADD REPLY • link 7.0 years ago by Nitin Narwade ★ 1.6k

score 0 · Answer 1 · 2018-07-11

0

Entering edit mode

7.0 years ago

Tm ★ 1.1k

When you run misa.pl on your scaffold file, two files are generated:

.misa file
.statistics file

So you can get different types of repeats along with the times they appeared in .statistics file

ADD COMMENT • link 7.0 years ago by Tm ★ 1.1k

0

Entering edit mode

Hey, I tried running a FASTA file in MISA using the below command:

misa.pl file.fasta

and it says

Use of uninitialized value $total in concatenation (.) or string

How do I troubleshoot this? It would of great help if someone could help me out on this.

TIA

ADD REPLY • link 6.2 years ago by sruthi ▴ 40