Question: (Closed) How to retrive unique IDs from the txt file?
0
gravatar for tcf.hcdg
4.7 years ago by
tcf.hcdg60
European Union
tcf.hcdg60 wrote:

Hello

I have a text file containing the sequence IDs. These Ids file contain some duplicate IDs. Few IDs are also present more then 2 times in this file. I want to find  unique IDs in one file and repeated IDs in another file. Furthermore I am also interested to find the number, how many times the repeated IDs present in the file.

I found duplicated sequence using the following command

$ cat id.txt | grep '^>' | sort | uniq -d > dupid.txt

 This gives me the duplicated sequences in "dupid.txt" file . But the question of those who are present more then 2 times and how many times they are present remains unsolved. secondly how to find unique sequences.

please suggest how can it be handeled.

Thanks in advance

 

duplicate sequence id • 1.2k views
ADD COMMENTlink written 4.7 years ago by tcf.hcdg60

see 'uniq -c'

ADD REPLYlink written 4.7 years ago by Pierre Lindenbaum127k

Hello tcf.hcdg!

We believe that this post does not fit the main topic of this site.

mostly a basic linux command.

For this reason we have closed your question. This allows us to keep the site focused on the topics that the community can help with.

If you disagree please tell us why in a reply below, we'll be happy to talk about it.

Cheers!

ADD REPLYlink written 4.7 years ago by Pierre Lindenbaum127k
Please log in to add an answer.
The thread is closed. No new answers may be added.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 992 users visited in the last hour