Multi-omics data integration identifies novel biomarkers and patient subgroups in inflammatory bowel disease.
Background: Inflammatory bowel disease (IBD), comprising Crohn's disease (CD) and ulcerative colitis (UC), is a complex condition with diverse manifestations; recent advances in multi-omics technologies are helping researchers unravel its molecular characteristics to develop targeted treatments.
Objective: In this work, we explored one of the largest multi-omics cohorts in IBD, the Study of a Prospective Adult Research Cohort (SPARC IBD), with the goal of identifying predictive biomarkers for CD and UC and elucidating patient subtypes.
Methods: We analyzed genomics, transcriptomics (gut biopsy samples), and proteomics (blood plasma) from hundreds of patients from SPARC IBD. We trained a machine learning model that classifies UC versus CD samples. In parallel, we integrated multi-omics data to unveil patient subgroups in each of the 2 indications independently and analyzed the molecular phenotypes of these patient subpopulations.
Results: The high performance of the model showed that multi-omics signatures are able to discriminate between the 2 indications. The most predictive features of the model, both known and novel omics signatures for IBD, can potentially be used as diagnostic biomarkers. Patient subgroup analysis in each indication uncovered omics features associated with disease severity in UC patients and with tissue inflammation in CD patients. This culminates with the observation of 2 CD subpopulations characterized by distinct inflammation profiles.
Conclusions: Our work unveiled potential biomarkers to discriminate between CD and UC and to stratify each population into well-defined subgroups, offering promising avenues for the application of precision medicine strategies.