Question: How do I get the lengths of different regions targeted by the Nextera expanded rapid capture exome kit for b37?
1
gravatar for anp375
4.5 years ago by
anp375170
anp375170 wrote:

I want the total lengths of regions that are exons, splicing, UTRs, and ncRNA. I can't distinguish them from the provided bed file. If there is a way using the bed file, please let me know. Thank you.

nextera lengths exome • 1.2k views
ADD COMMENTlink modified 4.5 years ago by Carlos Borroto1.9k • written 4.5 years ago by anp375170
2
gravatar for Carlos Borroto
4.5 years ago by
Carlos Borroto1.9k
Washington Metropolitan Area
Carlos Borroto1.9k wrote:

You can use a GTF with annotated intervals for the categories you want. For example this one from Ensembl. Use something like unix grep to create files for the categories you are interested. Then use bedtools intersect to get the regions of your BED file overlapping with each category GTF file. Finally use the procedure described here to find total lengths.

Also, if you are only interested on the ratio for each category, you could simplify this by using bedtools jaccard. In this case you would only need your BED file and the categories GTF files.

ADD COMMENTlink written 4.5 years ago by Carlos Borroto1.9k

Thank you, this helped a lot.

ADD REPLYlink modified 4.5 years ago • written 4.5 years ago by anp375170
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1883 users visited in the last hour