Entering edit mode
                    12 months ago
        ashaneev07
        
    
        ▴
    
    50
    Hello all,
I have been working with a sample dataset and have developed a workflow that generates a VCF file through GATK. This is typically used for identifying genetic variants in human samples. It would mean a lot to me if anyone could kindly review my commands, as your feedback would help boost my confidence in my approach. Only then can I apply this to the original sample.
Thank you..
    bwa mem -t 4 -R "@RG\tID:sample003\tPL:ILLUMINA\tSM:sample003" /hg38/hg38.fa sample_R1.fastp.fastq.gz sample_R2.fastp.fastq.gz  > s
```
\tPL:ILLUMINA\tSM:sample003" /hg38/hg38.fa sample_R1.fastp.fastq.gz sample_R2.fastp.fastq.gz | samtools view -Sb - > sample003.paired.bam 
samtools view -Sb sample003.paired.sam > sample003.paired.bam
samtools sort sample003.paired.bam -o sample003.paired_sorted.bam
gatk MarkDuplicatesSpark -I sample003.paired_sorted.bam -O sample003.sorted_dedup_reads.bam 
gatk BaseRecalibrator -I sample003.sorted_dedup_reads.bam -R  /hg38/hg38.fa  --known-sites  /hg38/Homo_sapiens_assembly38.dbsnp138.vcf -O recal_data.table 
gatk ApplyBQSR -I sample003.sorted_dedup_reads.bam -R /hg38/hg38.fa --bqsr-recal-file recal_data.table -O sample003_sorted_dedup_bqsr_reads.bam 
gatk CollectAlignmentSummaryMetrics R=/hg38/hg38.fa I=sample003_sorted_dedup_bqsr_reads.bam O=alignment_metrics.txt
gatk HaplotypeCaller -R /hg38/hg38.fa -I sample003_sorted_dedup_bqsr_reads.bam -O 53_raw_variants.vcf 
                    
                
                2
            
        Entering edit mode
                    
- gatk-funcotator data source
 - Cutadapt&FastQC:sequence length disturbution failed after adapter trimming
 - Server or aws cloud- which one is better for gatk pipeline
 - Fetch table from clinvar database according to a list of rsid
 - Minimum requirements for a laptop to do rna seq analysis
 - Exonerate installation on server
 - Error while running gatk
 - Augustus and SNAP training in Maker-p pipeline
 - Retrieving sequence from NCBI
 - Snoscan#error#make: Nothing to be done for 'all'
 - Maker Annotation pipeline Output error
 - E: Package 'libbamtools-dev' has no installation candidate
 - Increase memory limit in SPAdes-3.6.1
 - SNP filtering from specific regions
 
    ADD REPLY
    • link
    
    
    
    
    
    
        
    
    
        12 months ago by
    
    
        Pierre Lindenbaum
    
    
    
        
    
    166k