Question: How does a bioinformatician employ Python in data analysis?
gravatar for F
16 months ago by
F3.4k wrote:


When I am asked to mention for which I have recently used a programming language like Python in data analysis, I just can say that I used Python once to extract some genome coordinates from gene bank. But definitely this is not a wise answer and disappointing for a bioinformatician. How then a medium data analyst could use Python? I mean for which purpose as a good answer.

Thank you very much

bioinformatics python • 526 views
ADD COMMENTlink modified 16 months ago by Alex Reynolds27k • written 16 months ago by F3.4k

If someone is asking you in what context you've used something it's best not to answer that you've used it in some context in which you haven't (i.e., don't lie). If you start mentioning things you've never done then if they ask any follow up questions you'll be immediately screwed.

ADD REPLYlink written 16 months ago by Devon Ryan88k

Is the focus of the question to figure out if you know python (and how to code) because otherwise that question does not make a lot of sense. If you are an end-user/consumer of code (python or else) that others make so generously available, then I don't see any problem in making that clear when you answer the question.

ADD REPLYlink modified 16 months ago • written 16 months ago by genomax63k

Thank you, exactly as what you stated I am an end-user/consumer of code and as WouterDeCoster mentioned I use Python and R for differential expression, plotting and data processing. you are always helpful

ADD REPLYlink written 16 months ago by F3.4k
gravatar for WouterDeCoster
16 months ago by
WouterDeCoster37k wrote:

If this is hypothetical (and not explicitly about how you have used Python) then you can say that Python can be used for pretty much everything, although it has it's strong points and weaker points.

You could use python to create a pipeline (snakemake), make plots (matplotlib, seaborn), interface with databases (sqlite), perform processing of fasta/fastq (biopython) and create websites (django).

But even more important than knowing what you can is knowing the limitations. While you have a lot of statistical modules in Python - usually R is the more appropriate language here, definitely for differential expression analysis. But there is nothing that you can do in R that you cannot do in Python. For making production ready code for which speed is an issue then Python is often not your best guess, rather something compiled such as C or Java, or more recent languages such as Go or Rust. It's often 1) having a language you are comfortable in and 2) using the right tool for the job at hand.

I really dislike R, but I do differential expression analysis in R.

ADD COMMENTlink written 16 months ago by WouterDeCoster37k
gravatar for Alex Reynolds
16 months ago by
Alex Reynolds27k
Seattle, WA USA
Alex Reynolds27k wrote:

You might keep an interactive notebook that is Python-based, like Jupyter, or IPython, or Google's recent Colaboratory effort, which also appears to tie in Tensorflow-based machine learning components.

ADD COMMENTlink modified 16 months ago • written 16 months ago by Alex Reynolds27k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1213 users visited in the last hour