Question: intersectBed a gtf file with a vcf file
0
gravatar for Susmita Mandal
10 weeks ago by
Bangalore
Susmita Mandal0 wrote:

Hey all,

I have been trying to intersect a gtf file with a vcf file using intersectBed without using any options. I have edited my VCF, it has the 8 necessary fields and I have added two columns of my own which are read counts from bed files when intersected with a vcf file ( intersectBed -c ). And the intersectBed with the gtf and vcf is working but something weird is happening. All the gtf files are same. Is that weird or usual? I'm checking the difference between gtf files using plain diff. Am i doing some mistake in creating the VCF file or in the intersectBed step? help me

Susmita

snp intersectbed bedtools • 154 views
ADD COMMENTlink written 10 weeks ago by Susmita Mandal0

9 th column is format column and tags should be defined in vcf header and 10th column on wards sample information as per fields/tags defined in 9th column. I think your VCF is not standard vcf as you defined in OP. Please post your edited vcf to know better. Susmita Mandal

ADD REPLYlink written 10 weeks ago by cpad011210k
##fileformat=VCFv4.2
#CHROM  POS ID  REF ALT QUAL    FILTER  INFO    129S1   JF1
1   3000176 .   T   G   40.00   PASS    .   0   0
1   3000223 .   T   A   40.00   PASS    .   0   0
1   3000248 rs32640266  G   T   237.81  PASS    .   0   0
1   3000289 rs32137367  T   G   255.31  PASS    .   0   0
ADD REPLYlink modified 9 weeks ago by finswimmer8.2k • written 9 weeks ago by Susmita Mandal0

Hellohttps://www.biostars.org/u/49214/ ,

Please use the formatting bar (especially the code option) to present your post better. I've done it for you this time.
code_formatting

Thank you!

ADD REPLYlink written 9 weeks ago by finswimmer8.2k

Hello Susmita Mandal ,

could you please post the commands you use and how your input and output files looks like? I didn't understand what you have tried and what your problem is.

Thanks.

fin swimmer

ADD REPLYlink written 9 weeks ago by finswimmer8.2k

This command seems to give me same results in all the samples:

intersectBed -a /home1/Mouse_Reference_genome/NCBIM37.67/Mus_musculus.NCBIM37.67_plus.gtf -b /home1/temp/8/8_SRR_133_forward_intersect.vcf > /home1/Intersect_by_GTF/plus/SRR_133_plus.gtf

The gtf files are identical!

And this command i used to get the VCF before i did some tinkering:

intersectBed -c -split -sorted -a /home1/Strain_specific_SNPs/SNPs_merged.vcf -b /home1/Sorted_BED_SplitbyStrand/129/SRR_133_129_forward_sort.bed > /home1/Intersect_by_VCF/129/SRR_133_129_forward_intersect.bed

Read counts are different for each sample!

ADD REPLYlink modified 9 weeks ago by finswimmer8.2k • written 9 weeks ago by Susmita Mandal0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1181 users visited in the last hour