Question

Analysis Of Putative Causitive Mutations

2

Entering edit mode

13.0 years ago

Kamila001 ▴ 120

I have set of predicted causitive missense mutations by using Insilico tools like SIFT, SNAP, SNPEffect, Polyphen etc.. The question is What would be the best way to further analyze them (bioinformatics tools or statistical methods or grafically)? e.g. could do the gene ontology analysis to see their biological role.

Any suggestions about softwares and literature would be useful.

Thanks in advance.

function statistics • 3.1k views

ADD COMMENT • link updated 13.0 years ago by Khader Shameer 18k • written 13.0 years ago by Kamila001 ▴ 120

Ram · Answer 1 · 2011-04-23

Hi Kamila,

This is Pauline Ng, creator of SIFT (please note new website located at: http://sift-dna.org). The answer would depend on what type of analysis you are doing.

Genome-wide: If you are doing a genome-wide analysis, for seeing what genes in a Drosophila species are affected and have evolved differently, then use GO ontology like GOrilla to find any patterns. Also, if it's a genome paper, use Ka/Ks to see if the genes which contain these predicted-deleterious predictions are under relaxed selection, or the gene family size to see if the substitutions occur in large gene families. Ka/Ks and gene family size can be obtained from Ensembl Biomart. I like scanning the list of genes manually, see if any patterns strike out at me, and then code it up accordingly.

Finding a specific gene: If you are looking for a specific gene (for example, looking for a disease gene, and you know the disease), then you would list the gene descriptions, and whether any are in OMIM (just use the checkboxes on the website), and then look through them manually to see if any are interested hits. This may or may not get what you want, if you know the disease name, then use tools like Endeavour where you can enter the disease, or genes involved in the disease, and then all your genes that were predicted to affect protein function, for prioritization of candidate disease genes. There are a class of tools out there that do this (Endeavour, G2D, etc.)

Hope this helps. Again, please check out our new website at http://sift-dna.org where I am adding the latest updates.

Best, Pauline Ng

score 2 · Answer 2 · 2011-04-23

Short answer: you can perform impact of mutation in transcriptome, epistasis analysis, pathway analysis, protein-protein interaction analysis, TF analysis, eQTL analysis, phenotype analysis are some potential ideas that you could explore.

Long answer: I would point you to three recent review articles and one community articles that discuss various potential ideas on post-gwas bioinformatics and statistical strategies. Most of the ideas are discussed with atleast one example, so you can definitely get a clear idea by reading the key-papers discussed in the review articles.

Review 1: Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application

Review 2: Bioinformatics challenges for genome-wide association studies

Review 3: Using biological knowledge to uncover the mystery in the search for epistasis in genome-wide association studies.

Post-gwas article : Principles for the post-GWAS functional characterisation of risk loci