Skip to content

MyoSpectrum

MyoSpectrum is an integrative transcriptional atlas of human skeletal muscles spanning healthy and myopathic conditions.

WARNING

This project V1.0 has been archived and is no longer updated.

Overview

On April 10, 2024, Communications Biology published a collaborative study by the neuromuscular disease team at Huashan Hospital together with the University of Helsinki, University of Campania, University of Western Australia, and Karolinska Institute. This study revealed the myopathy spectrum linking transcriptional expression and clinical phenotypes across 1,221 human skeletal muscles through integrative analysis.

Background

Myopathies refer to a large group of debilitating diseases affecting the human skeletal muscle system. They are generally genetic in origin and mostly classified as rare diseases. Bulk RNA-seq, as a fundamental sequencing method, can capture the instantaneous transcriptional physiological state of muscles. Although it has been applied in myopathy research, published data lack integration, and the association between transcription and clinical features has not been systematically studied.

This study utilized RNA-seq data from GTEx, GEO databases, and local Helsinki human skeletal muscles (292 myopathy muscles and 929 control muscles), analyzing the transcriptomic data (9,231 genes) of 1,221 muscles and their shared associations with clinical phenotypes.

Study Design

This study incorporated Bulk RNA-seq data from different myopathies, sequencing methods, sequencing depths, and phenotypes, resulting in considerable dataset heterogeneity.

Similar to the "baimiao" technique in traditional Chinese ink painting, this study innovatively used unimportant blank space (controls) to highlight the main subject (myopathies). Because myopathies are mostly rare diseases (incidence typically below 1 in 100,000), population-based data from GTEx (n = 803) can serve as healthy controls. By leveraging GTEx-recorded donor causes of death and categorizing muscles accordingly, three types of controls were derived:

  • Very healthy controls — accidental death without accompanying disease (n = 234)
  • Generally unhealthy controls — death accompanied by general diseases (n = 470)
  • Severely unhealthy controls — death accompanied by wasting diseases (n = 87)

This ultimately revealed a stepwise progressive transcriptional pattern.

Workflow

Dataset

This study comprises 23 muscle phenotypes (8 control types, 15 myopathy types) and two sequencing methods: mRNA and total RNA.

PhenotypenSequencingSource
Control (accident death)31mRNAGTEx
Control (unexpected death)203mRNAGTEx
Control (intermediate death)46mRNAGTEx
Control (ventilator case)424mRNAGTEx
Control (slow death)87mRNAGTEx
Control (others)111mRNA/total RNAGTEx/GEO
Control (amputee)24mRNAHelsinki
Control (hyperCKemia)3mRNAHelsinki
FSHD61total RNAGEO
DM144total RNAGEO
LGMD R1241total RNAGEO
CDM36total RNAGEO
Titinopathy31mRNAHelsinki
IBM28mRNA/total RNAHelsinki/GEO
DMD5total RNAGEO
BMD5total RNAGEO
Actinin-2 myopathy5mRNAHelsinki
Myopathy (HNRNPA1)5mRNAHelsinki
SMPX myopathy4mRNAHelsinki
Myopathy (unsolved)24mRNAHelsinki

Total: 1,221 muscles (929 controls + 292 myopathy)

The Transcriptional Myopathy Spectrum

After integrating all data, a distinctive transcriptional pattern of disease progression emerges:

Myopathy Spectrum

Spectrum progression order:

  1. Very healthy donors (death from accidents, suicide, etc., n = 234)
  2. Donors with general diseases (unhealthy at death, accompanied by various diseases, n = 470)
  3. Donors with wasting diseases (accompanied by wasting conditions such as cancer, tuberculosis, etc., n = 87)
  4. Myopathy patients (various common muscle diseases, n = 292)

Key Finding

Notably, myopathy muscles on the far right are not entirely unhealthy — some muscles (hyperCKemia, asymptomatic myopathy) intrude downward into the very healthy range, displaying transcriptional patterns similar to healthy controls. This is analogous to the asymptomatic pre-clinical stages observed in clinical practice.

Neither age, sex, sequencing method, nor myopathy type is a driving factor of the myopathy spectrum. Notably, Helsinki myopathy muscles (mRNA sequencing) and GEO myopathy muscles (total RNA sequencing) both cluster in the upper right, suggesting they share the same transcriptional pattern.

Clinical Validation

To verify whether the transcriptional progressive results have clinical significance, published myopathy data were used for multi-level validation.

Clinical Features and Transcriptional Spectrum

Both data-level simulations and clinical feature associations across different myopathies showed the same stepwise disease progression as the myopathy spectrum:

MyopathyClinical Featurep-valueResult
CDMCTG repeat expansion (peripheral blood)1.07e-03Significant stepwise progression
LGMD R12Mercuri score (conventional MRI)2.09e-06Significant stepwise progression
LGMD R1210-meter walk test0.014Significant stepwise progression
LGMD R126-minute walk test0.011Significant stepwise progression
FSHDFat fraction (quantitative MRI)0.193Consistent trend
FSHDPathology score0.36Consistent trend
FSHDClinical severity score0.753Consistent trend

About FSHD

Only CDM and LGMD R12 showed more pronounced stepwise progression, likely related to biopsy muscle heterogeneity. CDM and LGMD R12 were biopsied from the same muscle (e.g., vastus lateralis), while FSHD samples came from diverse sources (tibialis anterior, gastrocnemius, portions of quadriceps or biceps femoris).

Using 234 very healthy GTEx muscles as a novel healthy control, DEG analysis on the integrated data yielded transcriptional signatures for each myopathy type.

Featured Genes and qPCR Validation

About Control Selection

For ethical reasons, neither Helsinki control muscles (from hyperCKemia and diabetic amputee patients) nor GEO control muscles (histologically normal muscles from suspected myopathy patients) occupy a truly healthy position — they fall in the intermediate, not-quite-healthy range. The 234 very healthy muscles at the bottom (from donors without known diseases) serve as a novel healthy control.

Top 5 up-regulated genes in general myopathy:

GeneFull NameFunction
MGST1Microsomal Glutathione S-Transferase 1Protection against oxidative stress, potential role in aging
AOX1Aldehyde Oxidase 1Contributor to myogenesis
FASNFatty Acid SynthaseMarker of fat infiltration in myopathic muscles
PRKCDProtein Kinase C DeltaRegulation of lipogenesis
CHRNA1Cholinergic Receptor Nicotinic Alpha 1Increases in aging skeletal muscle, induces sarcopenia

MGST1, AOX1, FASN, PRKCD, and CYP4B1 — these five genes had never been reported as significant in prior myopathy studies, yet were found to be differentially expressed in cell and animal models, suggesting that controls used in current myopathy research are not truly "healthy."

Helsinki muscle qPCR validation:

  • General myopathy: MGST1 (RQ 10.34/2.09, p = 0.02), AOX1, FASN, PRKCD
  • Titinopathy: CYP4B1 (RQ 19.71/1.41, p = 0.03)
  • IBM: CD163 (RQ 27.87/10.05, p = 0.23)

Pathway analysis indicates that myopathies primarily affect metabolism-related pathways.

Tissue Deconvolution

Using the TAPE deep learning autoencoder with two human skeletal muscle single-cell datasets (Tabula Sapiens and GSE143704) for tissue deconvolution:

  • Myopathy groups showed fewer vasculature structure cell proportions, higher satellite stem cells, natural killer T cells, and fibroblast (tendon) cells
  • Greater proportion of adipocytes and COL1A+ fibroblasts were found in myopathy groups

Comparison with Published Results

Re-analyzing both integrated data and raw data from published studies using the same software pipeline revealed a "pyramid" pattern in muscle sequencing results:

  • Top-tier highly differentially expressed genes are nearly unaffected by sample size or control changes
  • Bottom-tier differentially expressed genes shift continuously with these factors

Featured genes were consistently preserved across different control phenotype comparisons (FSHD 80%, LGMD R12 80%, IBM 73.3%), with retention rates increasing to 100% when sample sizes were expanded.

Conclusions

  1. Unprecedented controls: Identified 234 genuinely healthy GTEx muscles as a novel control group
  2. Continuous spectrum: The transcriptomic landscape reveals continuous disease progression from healthy to severe myopathy
  3. Novel featured genes: MGST1, AOX1, FASN, PRKCD, and CHRNA1 were rarely reported in previous myopathy studies
  4. Clinical correlation: The transcriptional spectrum aligns with CTG repeats, MRI fat infiltration, walk tests, and other clinical features

Limitations

  1. The integration process itself introduces interference in results, particularly given the substantial heterogeneity of this dataset
  2. Due to ethical constraints, experimental validation lacked "healthy" muscle controls and could only use "histologically healthy" muscles as substitutes
  3. Muscle selection for biopsies also introduces variability in results
  4. Flow cytometry or large-scale single-cell datasets could better characterize muscle composition

Publication

Zhong, H., Sian, V., Johari, M., Katayama, S., Oghabian, A., Jonson, P. H., Hackman, P., Savarese, M., & Udd, B. Revealing myopathy spectrum: integrating transcriptional and clinical features of human skeletal muscles with varying health conditions. Communications Biology, 2024, 7(1):438. DOI

Acknowledgments

Funding support: FHRC, Academy of Finland, Jane and Aatos Erkko Foundation, Magnus Ehrnrooth Foundation, AFM Téléthon, China Scholarship Council.

Source Code

最近更新