Hi folks -- haven't had much success in figuring this out through Googling. Hoping for some help here:
My inputs are as follows: a BAM where each alignment has tags for a barcode and UMI. I also have an annotation file in .gff3 format.
My desired output is a table where each alignment in the BAM is broken down into three columns: barcode, UMI, and a given feature from the annotation file (gene, exon, feature, etc.).
Thanks to recent advice obtained here, I'm familiar with how to parse through the BAM file to pluck out the barcode and UMI, but the process of taking the alignment data and returning the genome feature is beyond my thought process. Any ideas? Many thanks.
AWESOME. Thanks for this!