How to get SNP 6.0 array data in R?
1
0
Entering edit mode
8.3 years ago
agicict ▴ 190

Hi.

I currently downloaded a number of SNP datasets from GEO (CCLE)

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE36138

Also I downloaded Genomewide affymetrix 6.0 SNP array annotation data from affymetrix site.

http://www.affymetrix.com/support/technical/byproduct.affx?product=genomewidesnp_6

I imported the SNP 6.0 CEL files using affy package's ReadAffy.

Strangely, every CEL files has approximately 7,000,000 rows but affymetrix annotation data has around 900,000 rows.

What is right way to match probes?

R SNP annotation • 5.6k views
ADD COMMENT
0
Entering edit mode

You could use the crlmm Bioconductor package

ADD REPLY
5
Entering edit mode
8.2 years ago

The GenomeWideSNP_6 chip type has 6,892,960 probes (never changes). These probes are arranged in probe sets ("units") corresponding to 934,946 bi-allelic SNPs and 946,371 non-polymorphic single-probe CN loci. The exact number differ slightly between genome builds.

Check out the aroma.affymetrix R package, cf. http://aroma-project.org/. It has several ready pipelines for GenomeWideSNP_6, especially for copy-number analysis.

/Henrik
(author of aroma.affymetrix)

ADD COMMENT
0
Entering edit mode

Thank you for your kind explanation.

ADD REPLY

Login before adding your answer.

Traffic: 2417 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6