Hi y'all,
I have a .bed file file currently formatted like this:
chr start stop
1 14520 14812
1 65409 65725
1 65731 66073
1 69381 69700
1 721281 722042
1 752816 753135
I would like to get something that looks like this (where for overlapping exons, use the exon boundary, unless this boundary was <10bp, in which case expand the probes to include at minimum 10bp of sequence):
chr start stop name 1 69090 70008 OR4F5 1 565876 566576 1 801642 802733 1 861321 861393 SAMD11 1 865534 865716 SAMD11
Is there a way that I can use UCSC or another tool to accomplish this?