Convert gff to bed, including non coding elements
1
0
Entering edit mode
17 months ago
LDT ▴ 340

I am using agat atm to convert a gff file to bed using the following command.

agat_convert_sp_gff2bed.pl -gff Arabidopsis_thaliana_TAIR10.gff3  -o test.bed

When I convert the gff to bed I find NA values (aka .) in the thickStart and thickEnd column of the bed file for the non-coding RNAs. Is there a way to convert gff to bed and acquire a thickStart and thickEnd values for these elements?

Thank you for your time
In the link, I post the gff3 file that I am working on https://drive.google.com/drive/folders/1-wmbc9gKtbXFJ95E0n41WgPL-G313SNe?usp=sharing

bed RNA non-coding gff • 608 views
ADD COMMENT
4
Entering edit mode
17 months ago
Juke34 8.6k

thickStart and thickEnd are usulally used to define coding part (CDS), which does not exist for non-coding genes. A possibility is to fill thickStart and thickEnd with chromStart and chromEnd values using a awk command:

awk 'BEGIN{OFS="\t"}{if($7 == "."){$7=$2; $8=$3} print $0 }' file.bed
ADD COMMENT
0
Entering edit mode

Thank you so much Juke34 for your help! I highly appreciate it

ADD REPLY

Login before adding your answer.

Traffic: 1759 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6