Question: Cis- eQTL Question
0
gravatar for mms140130
2.8 years ago by
mms14013060
mms14013060 wrote:

I'm trying to do a cis eqtl analysis, so I have a linear regression where gene expression is the dependent variable and the snps are the independent variable , I have 20,000 genes with 1000 patients and 700,000 snps with 1000 patients , I need to reduce the dimension of the snps so I will choose the snps that is 1000 bp above TSS and 1000 bp below TSS of the gene (cis-eqtl) then I will combine the snps that are above TSS (not sure how yet ???)in one variable, and combine the snps that are below TSS in one variable in aim of reducing the snps in the model ,,and those two combined variables will be added to the model.

Does this make sense ?

snp gene • 1.3k views
ADD COMMENTlink modified 2.8 years ago by andrew.j.skelton735.9k • written 2.8 years ago by mms14013060
1

I think it would make more sense to use a published method such as FastQTL.

Although it might be very interesting to reinvent the wheel, often that's not necessary.

ADD REPLYlink written 2.8 years ago by WouterDeCoster43k

Thanks for your answer, but my advisor wants me to write my own code, so Im trying to think how to do that ??

ADD REPLYlink written 2.8 years ago by mms14013060

Then it makes sense to look up how published methods do their job, and try to replicate that.

ADD REPLYlink written 2.8 years ago by WouterDeCoster43k
2
gravatar for andrew.j.skelton73
2.8 years ago by
London
andrew.j.skelton735.9k wrote:

I'd highly recommend MatrixEQTL, it's phenomenally quick for the scale of tests you're describing. Follow the tutorial, once you've got your head around it, it should be relatively simple to apply to your data.

ADD COMMENTlink written 2.8 years ago by andrew.j.skelton735.9k

well, I have used MatrixEQTL before but my advisor doesn't want me to use it , he wants me to write my own code !!

ADD REPLYlink written 2.8 years ago by mms14013060
1

I don't see the point in reinventing the wheel. Your logic sounds fine, but typically cis distances are around 1e6 bases. Take a look at the matrixEQTL source if you're doing this in R, applying these operations one by one will take forever, but if you can get your head around the matrix operations, then that's what allows matrixEQTL to be so fast. There's other smaller speed ups such as parlapply, which are parallel implementations of apply.

then I will combine the snps that are above TSS (not sure how yet ???)in one variable, and combine the snps that are below TSS in one variable in aim of reducing the snps in the model ,,and those two combined variables will be added to the model.

This bit doesn't make a lot of sense... If you're just looking for cisQTLs then choose your distance from the gene (TSS is fine) apply the tests and be done. If you then want to look for trans associations (stuff that doesn't fall in your cis distance), then you'll need to test those too.

ADD REPLYlink written 2.8 years ago by andrew.j.skelton735.9k

Thank you for your answer ..

ADD REPLYlink written 2.8 years ago by mms14013060
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1754 users visited in the last hour