A novel machine learning framework – Mal-ID – can decipher an individual’s immune system’s record of past infections and diseases, according to a new study, providing a powerful tool with the potential for diagnosing autoimmune disorders, viral infections, and vaccine responses with precision.

T_Lymphocyte,_also_known_as_a_T_cell_(yellow_color) (1)

Source: NIAID

Colorized scanning electron micrograph of a T lymphocyte(also known as a T cell)

Traditional clinical diagnostic methods for autoimmune diseases or other immunological pathologies tend to rely on a combination of physical examination, patient history, and various laboratory testing for cellular or molecular abnormalities – a lengthy process often complicated by initial misdiagnoses and ambiguous systems.

READ MORE: From Spanish flu to today: how immune cells keep up with a changing virus

READ MORE: Study uncovers the specialized immune cells that fight COVID-19

These approaches make limited use of data from the patient’s individual adaptive immune system’s B cell receptors (BCRs) and T cell receptors (TCRs).

In response to pathogens, vaccines, and other antigenic stimuli, BCR and TCR repertoires undergo changes through clonal expansion, somatic mutation, and selective reshaping of immune cell populations.

Sequencing BCRs and TCRs could provide a comprehensive diagnostic tool, potentially enabling simultaneous detection of infectious, autoimmune, and immune-mediated diseases in a single test. However, the extent to which immune receptor repertoire sequencing alone can reliably and broadly classify diseases remains uncertain.

Disease signatures

To address this, Maxim Zaslavsky and colleagues developed Mal-ID (MAchine Learning for Immunological Diagnosis) – a 3-model machine learning framework that analyzes immune receptor datasets to identify signatures of infectious and immunological diseases and vaccine responses in patients.

Zaslavsky et al. trained Mal-ID on BCR and TCR data systematically collected from 593 individuals, including patients with COVID-19, HIV, and type-1 diabetes, as well as influenza vaccine recipients and healthy controls.

According to the findings, Mal-ID effectively distinguished six distinct disease states in 550 paired BCR and TCR samples with a multiclass AUROC score of 0.986, indicating exceptionally high classification accuracy. This metric reflects the model’s ability to rank positive cases above negative ones across all disease comparisons.

Although the model was successful in differentiating COVID-19, HIV, lupus, T1D, and healthy individuals – illustrating its potential as a powerful diagnostic tool – Zaslavsky et al. note that the approach still needs to be refined, using clinical information, before it could be used with confidence in clinical applications.