I have a txt file with sequences
1 TACCCTGTAGAACCGAATTTGT miRNA mmu-mir-10b PM 2 GCATTGGTGGTTCAGTGGTAGAATTCTCGCCT tRNA Mus_musculus_tRNA-Gly-GCC-4-1 PM 3 TACCCTGTAGATCCGAATTTGT miRNA mmu-mir-10a PM 4 GCATTGTGGTTCAGTGGTAGAATTCTCGCCT tRNA Mus_musculus_tRNA-Gly-GCC-2-2 IM 5 ACCCTGTAGAACCGAATTTGT other other NA 6 TACCCTGTAGAACCGAATTTG other other NA 7 GCATTGGTTCAGTGGTAGAATTCTCGCCT tRNA Mus_musculus_tRNA-Gly-GCC-2-7 IM 8 GCATTTGTGGTTCAGTGGTAGAATTCTCGCCT tRNA Mus_musculus_tRNA-Gly-GCC-4-1 IM 9 TACCCTGTAGAACCGAATTTGTG miRNA mmu-mir-10b PM 10 GGTGAATATAGTTTACAAAAAACATTAGACTGTGAATC tRNA tRNA-His IM
I'd like a count matrix based on the 3rd value in each line such that I have something like. What's the best way to do this?
mmu-mir-10b 2 tRNA-His 1 other 2