Hi everyone, I would like to create a pool of shRNA sequences (20nts each) for the specific set of 1000 genes, making it up to 30000 sequences in total, the gene sequences are available in Genbank, but I don't have a single file with all of them together. There are online tools that design shRNAs for single genes, but I can't figure out how to do this for 1000 sequences. My confusion comes from the fact that although I perfectly see the task, I'm new to the bioinformatics field and do not immediately see the way to design such pipeline.
I would appreciate it if you can provide some answers to my questions: 1. Where do I start? 2. What software/tool or combination of tools should I use for these large datasets? 3. What is the format of the input file? Is there a tool to assembly such file automatically or it has to be done manually?
Thank you very much!!!
Which species are you targeting ? For such a large number, it might be cheaper to use an existing library if available.
Thank you for your reply. I'm planning to target diatom algae.