Question

Herald:The Biostar Herald for Thursday, May 11, 2023

3

Entering edit mode

2.2 years ago

Biostar 3.6k

The Biostar Herald publishes user submitted links of bioinformatics relevance. It aims to provide a summary of interesting and relevant information you may have missed. You too can submit links here.

This edition of the Herald was brought to you by contribution from Istvan Albert, and was edited by Istvan Albert,

Benchmarking integration of single-cell differential expression | Nature Communications (www.nature.com)

Here, we benchmark 46 workflows for differential expression analysis of single-cell data with multiple batches. We show that batch effects, sequencing depth and data sparsity substantially impact their performances. Notably, we find that the use of batch-corrected data rarely improves the analysis for sparse data, whereas batch covariate modeling improves the analysis for substantial batch effects.

submitted by: Istvan Albert

A draft human pangenome reference | Nature (www.nature.com)

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels.

submitted by: Istvan Albert

We are excited to announce a collection of new human reference genome sequences that reflect much more global diversity! https://t.co/a39cFG8dOu
— Human Pangenome Reference Consortium (@HumanPangenome) May 10, 2023

submitted by: Istvan Albert

Comprehensive de novo mutation discovery with HiFi long-read sequencing | Genome Medicine | Full Text (genomemedicine.biomedcentral.com)

Long-read sequencing (LRS) techniques have been very successful in identifying structural variants (SVs). However, the high error rate of LRS made the detection of small variants (substitutions and short indels < 20 bp) more challenging. The introduction of PacBio HiFi sequencing makes LRS also suited for detecting small variation. Here we evaluate the ability of HiFi reads to detect de novo mutations (DNMs) of all types, which are technically challenging variant types and a major cause of sporadic, severe, early-onset disease.

submitted by: Istvan Albert

AirLift: A Fast and Comprehensive Technique for Remapping Alignments between Reference Genomes | bioRxiv (www.biorxiv.org)

AirLift is the first read remapping tool that enables users to quickly and comprehensively map a read set, that had been previously mapped to one reference genome, to another similar reference. Users can then quickly run downstream analysis of read sets for each latest reference release. Compared to the state-of-the-art method for remapping reads (i.e., full mapping), AirLift reduces the overall execution time to remap read sets between two reference genome versions by up to 27.4×. We validate our remapping results with GATK and find that AirLift provides high accuracy in identifying ground truth SNP/INDEL variants.

submitted by: Istvan Albert

Transformation of alignment files improves performance of variant callers for long-read RNA sequencing data | Genome Biology | Full Text (genomebiology.biomedcentral.com)

Long-read RNA sequencing (lrRNA-seq) produces detailed information about full-length transcripts, including novel and sample-specific isoforms. Furthermore, there is an opportunity to call variants directly from lrRNA-seq data. However, most state-of-the-art variant callers have been developed for genomic DNA. Here, there are two objectives: first, we perform a mini-benchmark on GATK, DeepVariant, Clair3, and NanoCaller primarily on PacBio Iso-Seq, data, but also on Nanopore and Illumina RNA-seq data; second, we propose a pipeline to process spliced-alignment files, making them suitable for variant calling with DNA-based callers. With such manipulations, high calling performance can be achieved using DeepVariant on Iso-seq data.

submitted by: Istvan Albert

Want to get the Biostar Herald in your email? Who wouldn't? Sign up righ'ere: toggle subscription

herald • 1.2k views

ADD COMMENT • link 2.2 years ago by Biostar 3.6k

Benchmarking integration of single-cell differential expression | Nature Communications (www.nature.com)

A draft human pangenome reference | Nature (www.nature.com)

We are excited to announce a collection of new human reference genome sequences that reflect much more global diversity! https://t.co/a39cFG8dOu— Human Pangenome Reference Consortium (@HumanPangenome) May 10, 2023

Comprehensive de novo mutation discovery with HiFi long-read sequencing | Genome Medicine | Full Text (genomemedicine.biomedcentral.com)

AirLift: A Fast and Comprehensive Technique for Remapping Alignments between Reference Genomes | bioRxiv (www.biorxiv.org)

Transformation of alignment files improves performance of variant callers for long-read RNA sequencing data | Genome Biology | Full Text (genomebiology.biomedcentral.com)

We are excited to announce a collection of new human reference genome sequences that reflect much more global diversity! https://t.co/a39cFG8dOu
— Human Pangenome Reference Consortium (@HumanPangenome) May 10, 2023