Step 5: Post Alignment QC - GATK CollectAlignmentSummaryMetrics and CollectInsertSizeMetrics
22
Create multiQC report of post alignment metrics
23
Step 6: Call variants - GATK HaplotypeCaller
Description:
Dive into a comprehensive tutorial on variant calling from whole genome sequencing (WGS) data using the GATK best practice workflow. Learn how to set up a pipeline in bash (Linux) to pre-process and align reads, ultimately generating a VCF file. Follow step-by-step instructions for quality control with FastQC, alignment using BWA-MEM, marking duplicate reads, performing Base Quality Score Recalibration (BQSR), and calling variants with HaplotypeCaller. Gain insights into the intuition behind each step, runtime expectations, and memory requirements. Access provided code, data sources, and additional resources to enhance your understanding of SAM file formats, SAM flags, and VCF file formats. Perfect for bioinformaticians and researchers looking to master variant calling techniques in genomic analysis.
WGS Variant Calling - Variant Calling with GATK - Part 1 - Detailed NGS Analysis Workflow