What is "M" letter in the singlets output file generated by CAP3
7.2 years ago
Hi friends,

I have made different assemblies with various k-mer, then removed redundant contigs and used CAP3 software for making meta-assembly. As you know, the CAP3 tool generated contig and singlets file as outputs. Please take a look at the following that is part of singlets file. I was wondering what is "M" letter? Since the contigs and singlets files usually will be pooled to final assembly, could you please let me know if these "M" character is problematic?

>c1
CGTACAGGGCGAGTTATTATGTTTTTGATAATCCATAAATCTTCTGTGTTCGTTTTGCAG
^MCCTGAATTTAAAACTAGAGGTTCCATCGAACCATCCAATTTTGGAAGCATGACATATTA
T^MTTCTGAAGCATCTCGTTAGAACAACATAAATCTAAATCATTAATACAAACCTCAAAGC
AT^MATGGAAATACAAACAAACACAAATAAACGCATAGGAGGAAGATACAGAATCGTTTAG
TTT^MCCTCAGCATGCCGCCCTTTGCTTCCCTTATATTATAGCTAGTTATGATGATGAAAA
GCCT^MCCTTAATCAATCTATTAAAATTATTATAAGATTTTCCTCCAATATCCGTAGTTTC
TTCAG^MCCATCTTTTTCCATTCTTGAGCCTTCTGTCTCATTTGTTTTCCCCTCTCTCCTT
CCATCA^MTTTCCTTAACAAGAGCCTCAATGTCATCACGTTTCACATCTTCATTGACTTCC
ATGCCCA^MTGCCCCAAGTTGTGCATGCATATCGACGATTCGTTTGCTGCTCAGCAAAGAA
AGGCCAAC^MAAATTACAGGAACACCACCATTTACGGTTTCAATCGTAGAATTCCACCCGC
AATGTGTTA^MAAAATAGCCCAATTGAAGGGTGAGAAAGCACTTGGTCTTGCGGACACCAA
CTTACTATTA^MACCCTCTATCCTTTATCTCTTCAAAATATTCTTGAGGCAAGATCGCAGA
ATCGTCAGTAC^MCCTCCACCACATCAGGCCTAAGAATCCATAAAAAAGGGTGTTTGCTAT
TTGCAAGACCCC^MAGGCGAATTCTTTCAGGTGCTCGTCCGACATTACAGTAATGCTACCG


Thanks a lot for your feedback.

How do you visualize this? I often see this when I open a windows file: https://en.wikipedia.org/wiki/Newline

I see it in the terminal not windows.

Type file your.cap3.file. What's the output?

It returned

ASCII text, with CRLF, CR, LF line terminators

CR, LF < that's your problem. cleanup your file with e.g: http://linuxcommand.org/man_pages/dos2unix1.html

Thanks Pierre.