Question: Use R to find overlapping positions from transcription start sites
1
gravatar for igperez
8 months ago by
igperez10
igperez10 wrote:

I am trying to use R to find the transcription factors Pdx1/NeuroD overlapping positions that are variable distances from transcription start sites. For the SELECT/WHERE query to work, you must SELECT data from two different Excel files. this is what I've attempted

promoter2 <- sqldf("SELECT pdx_neuroD6.Chr, pdx_neuroD6.neuroD.pos-1000, pdx_neuroD6.neuroD.pos+1000, promoter1.known_gene chrom, promoter1.tx_start from pdx_neuroD6 AND promoter1 WHERE Chr = known_gene chrom AND tx_start BETWEEN NeuroD.pos-1000 AND neuroD.pos+1000")

this is the error

Error in result_create(conn@ptr, statement) : near "AND": syntax error

sql R • 266 views
ADD COMMENTlink modified 8 months ago by zx87548.7k • written 8 months ago by igperez10
2
gravatar for zx8754
8 months ago by
zx87548.7k
London
zx87548.7k wrote:

If you format your SQL code, then it is easy to spot why this is happening. You are using AND instead of a comma (,) between the table names. Also you have spaces in column names, so we need to wrap them into square brackets ([...]):

sqldf("
SELECT pdx_neuroD6.Chr,
       pdx_neuroD6.neuroD.pos-1000,
       pdx_neuroD6.neuroD.pos+1000,
       promoter1.[known_gene chrom],
       promoter1.tx_start 
FROM pdx_neuroD6, promoter1 
WHERE Chr = [known_gene chrom] AND 
      tx_start BETWEEN NeuroD.pos-1000 AND neuroD.pos+1000
")
ADD COMMENTlink modified 8 months ago • written 8 months ago by zx87548.7k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1938 users visited in the last hour