Question: Use R to find overlapping positions from transcription start sites
1
gravatar for igperez
12 months ago by
igperez10
igperez10 wrote:

I am trying to use R to find the transcription factors Pdx1/NeuroD overlapping positions that are variable distances from transcription start sites. For the SELECT/WHERE query to work, you must SELECT data from two different Excel files. this is what I've attempted

promoter2 <- sqldf("SELECT pdx_neuroD6.Chr, pdx_neuroD6.neuroD.pos-1000, pdx_neuroD6.neuroD.pos+1000, promoter1.known_gene chrom, promoter1.tx_start from pdx_neuroD6 AND promoter1 WHERE Chr = known_gene chrom AND tx_start BETWEEN NeuroD.pos-1000 AND neuroD.pos+1000")

this is the error

Error in result_create(conn@ptr, statement) : near "AND": syntax error

sql R • 313 views
ADD COMMENTlink modified 12 months ago by zx87549.1k • written 12 months ago by igperez10
2
gravatar for zx8754
12 months ago by
zx87549.1k
London
zx87549.1k wrote:

If you format your SQL code, then it is easy to spot why this is happening. You are using AND instead of a comma (,) between the table names. Also you have spaces in column names, so we need to wrap them into square brackets ([...]):

sqldf("
SELECT pdx_neuroD6.Chr,
       pdx_neuroD6.neuroD.pos-1000,
       pdx_neuroD6.neuroD.pos+1000,
       promoter1.[known_gene chrom],
       promoter1.tx_start 
FROM pdx_neuroD6, promoter1 
WHERE Chr = [known_gene chrom] AND 
      tx_start BETWEEN NeuroD.pos-1000 AND neuroD.pos+1000
")
ADD COMMENTlink modified 12 months ago • written 12 months ago by zx87549.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1636 users visited in the last hour