We are doing RNAi based screenings. There, we generate a lot of data for GenXY of different types (many microscopically images, many continuously growing data from different experiments...). In addition to this, there are a lot of sequence, expression, functional data information existing from other sources, which needs to be linked to our tested genes. At the moment, all these data are stored in hundreds of excel files. Now, we would like to create a database out of the files and display all information in a proper way. Since I know Python very well, I thought about using Django. Do you thing that Django is suitable for this? Does anybody used Django successfully for such a purpose? We also would like to include some applications, such as BLAST, ClustalW...
Any tips are very appreciated! Stefanie
Thanks for all the answers! I'll try to make things more clear:
We cloned about 1000 RNAi constructs. Each of this constructs has an unique ID. Already at this step, there is a lot of information (primer sequence, RNAi sequence, FL sequence, contig memberships, BLASTX...). After this, the constructs are tested in plants by particle bombardment. Each experiments consist of about 16 constructs plus some controls. To each of them, we get a value plus many images. If a construct shows some effect, it's will be repeated at least 4 more times and we'll get at least 4 more values (the final number of values is unknown). After we have at least 5 values, we calculate some statistics. If it's significant, we make transgenic plants. Of course to each construct we want to add as much information as possible (NCBI, Proteomics, whatever. This is allways growing.).
ALL this information, we have in different excel files and we would like to show it in a web interface. I started already to combine all this information in a few excel files. I agree, this is the most laborious work. The advantages is, that I'm the technician which produced the data, so I know very good what is what. I created 2 excel files, one with all the experimental data per unique ID and one with all other information per unique ID. I think this is a good basic to add it to a database.
What I would like to have: Paste anything (the unique ID, all values<50%, an image link, a function, a TCA...) and get ALL linked information. For instance, an overview about the corresponding unique ID(s), plus some tabs which contain all the other information (tab for experimental data, tab for sequence information, tab for images and so on). Maybe something like this: http://www.gabipd.org/database/cgi-bin/GreenCards.pl.cgi?BioObjectId=664184&Mode=ShowBioObject&QueryKey=fb2a0d954e12a90b7810b5d445957eb0&Start=1&Rows=50
In addition, it would be great, if data could be easily edited or commented.
I hope I made things more clear. Thanks again! Stefanie