Question: Automate Illumina GenomeStudio With Python Script
2
gravatar for travis.m.couture
2.7 years ago by
travis.m.couture30 wrote:

Hey everyone. I am currently working on automating a SNP pipeline and need some help figuring out how to interface Python with GenomeStudio. I want to be able to pass input from a python script into GenomeStudio so that the entire analysis process can be automated and manual intervention is only required rarely. Does anyone have any experience with this? I've done a few Google searches and haven't come up with anything.

Thanks, Travis Couture

ADD COMMENTlink modified 2.3 years ago by blrhgeruser140 • written 2.7 years ago by travis.m.couture30

I've been in this scenario before, and if you have the choice of language, I'd recommend utilizing bash shell scripts (Unix/Linux) to create a pipeline workflow for your application. Python is notoriously difficult to use in multi-processing or alternative process applications, both since it has a global interpreter lock for multi-threading and also because Python is not usually the most efficient language for subprocessing.

That being said, here is the module you will have to use if you utilize Python: https://docs.python.org/3/library/subprocess.html

You'll likely also be familiar with the multiprocessing module.

ADD REPLYlink written 2.7 years ago by Steven Lakin1.4k

Unfortunately I'm working in a Windows environment for this project. Could the same approach for bash be taken MS-DOS? If so where would I start. If not I will have to read up on the multiprocessing module. I am familiar with he subprocess module.

ADD REPLYlink written 2.7 years ago by travis.m.couture30

I called Illumina and they were no help. I am still researching this but I'm hoping someone out there can give me some direction.

ADD REPLYlink written 2.7 years ago by travis.m.couture30

Apologies for my vagueness - I don't have experience with your exact situation, but here are my thoughts:

I've used Python for this, but only in combination with other bash scripts on a Unix based OS. Python's subprocess methods work fine, but it's surely easier if you find a way to create and run bash scripts. There are several resources (though I don't have experience with them personally) for emulating bash on Windows if you decide to go that route.

Also, what kind of input does GenomeStudio expect? Theoretically, if the second application in the pipeline can read from stdin, simply have Python print the required input (this goes to stdout) and the downstream application can capture this.

Best of luck!

ADD REPLYlink written 2.6 years ago by Jonathan King0
2
gravatar for blrhgeruser1
2.3 years ago by
blrhgeruser140
United States
blrhgeruser140 wrote:

Not exactly automating GenomeStudio, but what you could do is turn on AutoConvert on the iScan instrument and have it generate gtc files for each sample in your experiment. Then you can use the recently published open-source Python gtc file parser from Illumina to write a script for generating reports or whatever else you want to do with the data: https://github.com/Illumina/BeadArrayFiles. In the examples folder of that repo, there's an example script that lets you generate final report files from gtc files, thereby letting you avoid having to use GenomeStudio at all.

Or if you're working with idat files instead, what you could do is install and run AutoConvert off the instrument and write some Python code to go from idat + bpm + egt --AutoConvert--> gtc --python gtc file parser--> reports

ADD COMMENTlink written 2.3 years ago by blrhgeruser140

I didn't know about this. Thanks for the info! We'll give it a try.

ADD REPLYlink written 2.3 years ago by travis.m.couture30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2429 users visited in the last hour