Hi everybody,
I'm trying to download the GI list of plant taxid to run blastx against only plant sequences of nr database. But, I found several plant taxids, including green plants, flowering plants, land plants, ... with different number records. Also, it sounds that some records is the same between two different taxids, so I cannot download more than one because of redundancy. Could you please help me to make right decision? Also, please let me know is there any way to count the number of downloaded GI to make sure the related GI list was completely downloaded. Sorry, if you find this post is so basic, but it's a kind of challenge for me as a beginner. Thanks
Thanks friend. I would like to download all plant record to make plant specific nr database. As I mentioned in post, there are several plant taxid (like, green plants, flowering plants, land plants, ... ) that sounds that some records are the same among them, so there is redundancy with downloading GI list of all related plant taxids. Could you please let me know if there is a way to detect identical GI and remove them?