Journal: medRxiv
Article Title: Clinical, in vitro, and in vivo evidence of WAPL as a novel cohesinopathy gene and phenotypic driver of 10q22.3q23.2 genomic disorder
doi: 10.64898/2026.02.23.26346364
Figure Lengend Snippet: a. Simplified depiction of cohesin balance. Cohesin-mediated genome organization is initiated by loading of the cohesin ring (dark grey) by NIPBL and MAU2 onto chromatin (dark blue), followed by loop extrusion. WAPL resets this process by removing cohesin from chromatin. PDS5 facilitates both cohesin release by WAPL and cohesin stability by sororin, among other functions . b-c. Probability of haploinsufficiency (pHaplo, b) of 18,641 autosomal genes, via , and loss of function observed/expected upper bound fraction (LOEUF, c) of 18,567 genes, via gnomAD . Red lines, average of known autosomal dominant CdLS genes NIPBL , RAD21 , and SMC3 . Blue lines, average of cohesin release factor genes WAPL , PDS5A , and PDS5B . See also Table S1. d. 27 WAPL variants identified in subjects including predicted damaging missense (ms, top) and loss-of-function (pLoF, bottom), plotted in protein space in genomic orientation. Domains of the canonical protein (1190 amino acids) via UniProt ( https://www.uniprot.org ) and Ensembl ( https://ensembl.org ). gnomAD pLoF variants passing quality filters are denoted by black (frameshift, stop gain) and grey (splice) circles. The diamond denotes the point after which escape from nonsense-mediated decay would be predicted (codon 1151). The triangle is the start position for minor 3′ isoforms in (h). The cut site for CRISPR/Cas9 experiments is in gold text. The p.Val863AlafsTer2 variant was seen in two individuals. Missense and pLoF variants both cluster in the C-terminal half of WAPL (p=0.0020 and p=0.0024, respectively by two-tailed binomial test). e. Location of WAPL within the 7.8 Mb 10q22.3q23.2 reciprocal genomic disorder region (black bar; via ClinGen Pathogenic CNV track of the UCSC Genome Browser ( https://genome.ucsc.edu/ )). f. AlphaFold per-residue structural confidence (pLDDT, or predicted local distance difference test), via . Y-axis range is 0-100, with scores <50 (orange) being a reasonably strong predictor of disorder ( https://alphafold.ebi.ac.uk/ ). g. Deep protein language model-derived predicted pathogenicity of every possible missense change in WAPL , via . Y-axis, amino acid. X-axis, residue. Higher LLR (log-likelihood ratio) indicates more likely deleteriousness h. Regional missense constraint in amino acid space, via gnomAD v2.1.1 ( https://gnomad.broadinstitute.org/ ). Scores indicate the fraction of expected amino acid variation in a population cohort. i. WAPL transcript expression by tissue, via gnomAD. C-terminal isoforms exist but are more lowly expressed. j-k. 3D structures of WAPL via AlphaFold (j) ( https://www.alphafold.ebi.ac.uk/entry/Q7Z5K2 ) that demonstrates low-confidence structure for its N-terminal half, and via crystallography of its C-terminal half (k) (pdb ID 4k6j, amino acids 631-1190). Case missense variants (red) show possible clustering. Plots created via ( https://g2p.broadinstitute.org ; ).
Article Snippet: 10q dels and dups were genotyped via droplet digital PCR as in , with a Thermo-Fisher Taq-Man target probe (ID Hs00599013_cn).
Techniques: CRISPR, Variant Assay, Two Tailed Test, Residue, Derivative Assay, Expressing