Question: Bedtools intersect error in record file
0
gravatar for jamie.pike
11 weeks ago by
jamie.pike50
jamie.pike50 wrote:

I keep getting an error with bedtools intersect when comparing a .bed file to a .gff, e.g.

Error: Invalid record in file /home/u1983390/Fusarium_data/MIMPS/MIMP_Searches/IntersectBED/GFFS_SameNameFormula/AGND.SNF.gff. Record is 
JH658272    ena PROMOTER    0   0   .   +    ID=gene:FOIG_00001;biotype=protein_coding;description=hypothetical

I think that the problem is in the labelling in column 1 in the .bed file doesn't match that of the labelling in column 1 of the gff:

.bed file

AGND01000115.1  654599  658809  .   .   +
AGND01000115.1  654599  658809  .   .   +
AGND01000115.1  654599  658634  .   .   -
AGND01000115.1  654774  658809  .   .   +
AGND01000218.1  289972  294170  .   .   +
AGND01000218.1  289972  294170  .   .   +
AGND01000218.1  289972  294170  .   .   +
AGND01000218.1  289972  294170  .   .   -
AGND01000243.1  34534   38759   .   .   +
AGND01000243.1  34534   38759   .   .   +

.gff

JH658272    ena PROMOTER    0   0   .   +   .   ID=gene:FOIG_00001;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    0   1338    .   +   .   ID=gene:FOIG_00002;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    5777    6538    .   -   .   ID=gene:FOIG_00003;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    6692    7297    .   -   .   ID=gene:FOIG_00004;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    4062    5109    .   +   .   ID=gene:FOIG_00005;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    5522    6494    .   +   .   ID=gene:FOIG_00006;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    7461    9101    .   +   .   ID=gene:FOIG_00007;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    13652   15129   .   -   .   ID=gene:FOIG_00008;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    15366   16052   .   -   .   ID=gene:FOIG_00009;biotype=protein_coding;description=hypothetical
JH658272    ena PROMOTER    13017   14774   .   +   .   ID=gene:FOIG_00010;biotype=protein_coding;description=hypothetical

Is this the case? And if so, how might I go about fixing it?

ADD COMMENTlink written 11 weeks ago by jamie.pike50
2
gravatar for ATpoint
11 weeks ago by
ATpoint36k
Germany
ATpoint36k wrote:

A GFF is by definition 1-based so a start coordinate of 0 is an invalid record. Your GFF has at least two 0 start coordinate.

ADD COMMENTlink modified 11 weeks ago • written 11 weeks ago by ATpoint36k

Great! This fixed it, thanks!

ADD REPLYlink written 11 weeks ago by jamie.pike50
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1394 users visited in the last hour