Question: Validating pre-processing pipeline from FASTQ to BAM
gravatar for Maxime
12 months ago by
Maxime0 wrote:

GATK ver. : Picard ver. : 2.21.4 samtools ver. : 1.10


I'm learning to create a pipeline for variant calling. I started with an arbritrary chosen exome from 1000genomes in form of two FASTQ files.

I pre-processed the data using the GATK Best Practice workflow

And ended-up with a supposedly "analysis-ready" bam file.

Since 1000genomes also provides a .cram file (aswell as a .cram.crai and a .bam.bas). How would I be able to compare my file with what is provided? I converted the .cram into a .bam file and I'm looking for a way to compare the two.

Next, for the variant calling, 1000genomes provides a .vcf file for each chromosome. How can I know wich type of variant calling was done? (SNP, SNV, Indels, CNV, ... ) Would I be able to check the validity of my .vcf result?

Any help would be appreciated, don't hesitate to ask for more informations.

Thank you in advance,


samtools picard gatk • 335 views
ADD COMMENTlink modified 12 months ago • written 12 months ago by Maxime0

Hi Maxime,

I strongly suggest you edit your post to make it notably shorter. Most users (including myself) will not read through such a long post to even understand what your problem is. Please try to work out the core part of your question while providing the minimal necessary information to understand it. This is of course a suggestion and you are free to decide if you follow it or not, but I think shortening your post will increase your chance of a good response.

ADD REPLYlink modified 12 months ago • written 12 months ago by ATpoint44k

Hi, Thank you for your consideration, I actually though that it might be too long. Isn't there a way to make some sort of spoiler section to add information without overloading the post?

ADD REPLYlink written 12 months ago by Maxime0

Unfortunately not :(

ADD REPLYlink modified 12 months ago • written 12 months ago by ATpoint44k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1041 users visited in the last hour