Question

Mirdeep2 Error: FASTA reads file is not in accordance with the fasta format specifications

0

Entering edit mode

9.5 years ago

Gabe Anderson ▴ 10

Hi All,

I get this error from while running mirdeep2. I have tried two data sets and the same error results. It says:

First line of FASTA reads file is not in accordance with the fasta format specifications
Please make sure your file is in accordance with the fasta format specifications and does not contain whitespace in IDs or sequences

***** Please check if the option you used (options c) designates the correct format of the supplied reads file earthwormShort1.fa *****

I ran a perl code to delete whitespaces. I also tried this same data sets on other tools like sRNAbench and I had no error. Do I need to tweak the fasta file in a particular way for mirdeep2 to work with it?

Thanks!

RNA-Seq software-error • 6.2k views

ADD COMMENT • link updated 2.7 years ago by asalimih ▴ 60 • written 9.5 years ago by Gabe Anderson ▴ 10

0

Entering edit mode

You should post the first line of the FASTA file.

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 9.5 years ago by Matt Shirley 10k

0

Entering edit mode

Here is a few line from the fasta file. I believe Mirdeep wants it formatted in a some kind of format...

>cel1_count=3
TGCCTTGTCTGTCCTAAAAATC
>cel2_count=9
GTTAAGTGGGAAACGATGT
>cel3_count=7
CCGACCTTGAAATACCAC
>cel4_count=7
TAGAAATCCACTATGCTTTGG
>cel5_count=5
CGCGGGTGAGCAGCCTGGTAGCTCGTC
>cel6_count=3
TCCTGTTTTGTAATCGGCTGCA
>cel7_count=4
TACCACGTCCAAGGAAGGC
>cel8_count=3
GGCCGCGTGGCCTAATGGATAAGG

Kindly note that the first header line is not indented as shown. I cant tell why this note editor indents it after I hit submit.

ADD REPLY • link updated 5.6 years ago by Ram 45k • written 9.5 years ago by Gabe Anderson ▴ 10

score 1 · Answer 1 · 2020-02-28

1

Entering edit mode

5.4 years ago

zhang.firework ▴ 10

$ dos2unix reads.fa

sovled my problem!

ADD COMMENT • link 5.4 years ago by zhang.firework ▴ 10

0

Entering edit mode

This solved my problem too. the problem is the trailing ^M at each line of the file (you can see it using cat -v file.fa | head). The ^M is a carriage-return character which originates from windows. dos2unix replaces it with a single newline. it can be installed using apt-get install dos2unix

ADD REPLY • link 2.7 years ago by asalimih ▴ 60

Ram · Answer 2 · 2016-01-15

0

Entering edit mode

9.5 years ago

Matt Shirley 10k

You should look at the file format description in the documentation. It looks like the mapper.pl script expects the FASTA identifiers to contain two underscores which delimit three fields such as:

>PAN_123456_x969696
ATACAATCTACTGTCTTTCCT

ADD COMMENT • link updated 5.6 years ago by Ram 45k • written 9.5 years ago by Matt Shirley 10k

1

Entering edit mode

Thanks for your response, but then, how can we change our .fa files to fit the mirdeep2 description?

ADD REPLY • link 5.4 years ago by jomagrax ▴ 40