Joint AIML/FHAIVE Workshop Talks – The AI Modelling Lab

Date: 4th October 2023

Time: 09:00 - 11:00

Location: IF 1.15 and online

Talks

We will be running a joint workshop with members of FHAIVE, Dario Greco’s research group based in Tampere, Finland. We will explore some of the applications of AI in the biomedical domain.

Local Organiser: Paola Galdi

The Programme is as follows:

Opening: 9:00 –9:05

9:05 – 9:20: Building knowledge graphs from electronic health records for adverse event prediction (Paola, AIML)
Abstract: With older age, there is an increased chance of being diagnosed with more than one long-term condition. The medical treatment of patients with multiple conditions is challenging because the interactions of symptoms and medications are complex and hard to predict. In this talk, I will discuss an ongoing project using knowledge-graph methods to detect people who are likely to have unexpected health problems (like falls or bleeding), with the ultimate goal of supporting doctors in the choice of proper treatment and preventive care. I will briefly introduce the Clinical Practice Research Datalink (CPRD) dataset and the data model underlying the knowledge graph. I will then present a first attempt at repurposing a knowledge graph recommender system (KGAT) in the context of adverse events predictions. I will conclude with an overview of the challenges and open questions left to address.
9:20 – 9:35: Graph learning for toxicology and chemical safety assessment– (Angela, FHAIVE)
Abstract: Integrating diverse data sources has become crucial to accurately predict the characteristics of drugs and chemicals and uncover novel associations between chemical exposure and human diseases. We developed a knowledge graph that contains manually curated relevant toxicological information related to drugs and chemicals. We started investigating the effectiveness of different network embedding algorithms and the predictive power of their features. We exploited the content of our knowledge graph in multiple applications, including the retrieval of relevant genes involved in COVID-19 pathogenesis and their targeting therapeutics, prediction of the potential side effects of drugs and comparison of tissues and cell line transcriptomic alterations.
9:35 – 9:50: Systems pharmacology for drug discovery (Tonino, FHAIVE)
Abstract: In recent years, the explosion in the amount of designed chemicals determined a “big bang” of the chemical universe. Such a wealth of chemical structures significantly boosted the possibilities to identify molecules with pharmacological properties for the treatment of a plethora of human complex diseases. Moreover, we hypothesize that modelling the complexity of the molecular buildup of diseases can be a concrete means to identify effective drug candidates. To this aim, network models are at the forefront to face this challenge, since they allow to investigate the molecular interactions sustaining physiological and pathological processes. By investigating aberrant patterns of connectivity in disease network models it is possible to pinpoint known and unknown molecular determinants of complex phenotypes, driving drug discovery predictions towards concrete pharmacological solutions.

Break: 9:50 – 10:00

10:00 – 10:15: A multi-dimensional disease map (Lena, FHAIVE)
Abstract: To overcome phenotype-based disease definitions and gain a mechanistic disease understanding, a multi-dimensional view is needed. Combining multiple data layers generates a more informed picture of disease similarity than looking at single dimensions. We mapped the relationships of 500 diseases based on a consensus of six data dimensions including genomic, clinical, and pharmacological data.
10:15 – 10:30: A Bayesian network approach for a robust estimate of disease co-occurrence (Guillermo, AIML)
Abstract: Examining associations between long-term conditions may be important in identifying opportunities for intervention in multimorbidity, but is challenging when data is limited. Previous literature in multimorbidity typically relies on association measures that are flawed when using small sample sizes, do not report confidence intervals or otherwise appropriately account for uncertainty, which is crucial in measures such as Relative Risk, as they are known to overestimate rare events. We have developed a Bayesian inference framework that is robust to small data samples and used it to quantify morbidity associations in the oldest old, a population with limited available data. We analysed associations obtained with Relative Risk (RR), and compared them with our proposed measure, Associations Beyond Chance (ABC), examining both parirwise associations and network aggregations. Our Bayesian framework was appropriately more cautious in attributing association when evidence is lacking, particularly in less common conditions. This caution in reporting association was also present in reporting differences in associations between sex and affected the aggregated measures of multimorbidity and network representations. Incorporating uncertainty into multimorbidity research is crucial to avoid misleading findings when data is limited, a problem that particularly affects small but important subgroups. Our proposed framework improves the reliability of estimations of associations and, more in general, of research into disease mechanisms and multimorbidity.
10:30 – 10:45: MARS: A Neurosymbolic System for Biomedical Mechanism-of-Action Retrieval (Lauren, AIML)
Abstract: Recently, several machine learning approaches have aided drug discovery by identifying promising candidates and predicting potential indications. However, understanding the ways in which drugs achieve their therapeutic effects, otherwise known as their mechanisms-of-action (MoA), is important for understanding potency, side effects, and interactions with various tissue types, among other things. We leveraged and improved the interpretability of a neurosymbolic reinforcement learning method in an attempt to reveal MoAs. While doing so, we observed that our findings raised several concerns with the reasoning process. Specifically, we debate situations in which patterns following a “guilt-by-association” trend are useful for predictions regarding novel compounds. We present our results to facilitate discussion about how generalizable ML-based models are to the drug discovery process as well as how important interpretability can be to such models.

Discussion/Closing: 10:45 – 11:00