The Shepherd Group

Overview

Computational immunology (or immunoinformatics) is the application of computational methods to research problems in the field of immunology. The Shepherd Group uses diverse computational methods — ranging from molecular dynamics to deep learning — to elucidate (mainly) human immune responses to pathogens, protein therapeutics and cancer.

Key research themes are:

The analysis of adaptive immune repertoires — large sets of B-cell receptor (antibody) and T-cell receptor sequences serived from Next Generation Sequencing. We host and contribute to the ongoing development of OGRDB: the Open Germline Receptor Database (Lees, et al., NAR, 2020) and VDJbase: the adaptive immune receptor Genotype and Haplotype database (Omer et al., NAR, 2020). Currently we are developing novel deep learning methods to address key challenges, such as how to identify the subset of receptors targeted at a specific antigen (e.g. virus). We are also active members of The AIRR Community (part of The Antibody Society), developing tools and resources to support the analysis of B-cell and T-cell receptor repertoire data.
Using molecular dynamics (MD) simulations to elucidate the function of important molecules. For example, we've use MD shed light on whether mutations in the stalk of haemagglutinin may enable influenza A virus to escape from broadly-neutralising antibodies (Lees et al., Frontiers in Immunology, 2017) and to gain insight into the dynamic properties of the hepatitis C virus cell entry machinery (Stejskal et al., PLOS Computational Biology, 2020).
Exploring the boundary between self and non-self. For example, we have developed a novel method called proteome scanning for predicting whether small sequence differences between an individual's endogenous Factor VIII and therapeutic Factor VIII are likely to break self-tolerance (Hart et al., Haematologica, 2019).

Some images from past research

Analysis of an antibody repertoire after vaccination. Left: Network diagram showing the abundance of a small subset of heavy chain CDR3s in the vicinity of a neutralising antibody of interest. Sequences identified using the AbMining ToolBox (D'Angelo et al., 2014). Centre: Length distribution of heavy chain CDR3s. Right: Lineage plot showing light chain CDR3s after sequence assembly.

Epitopes and antigenic escape. Top left: Distribution of key energetic residues on three structural epitopes (PDBs 1NBZ, 1VBF and 1JRH). Bottom left: Amino acid frequency plot for a single haemagglutinin residue showing antigenic activity in the period 1988-97 prior to N-glycosylation. Right: Canonical antigenic sites of influenza A H3N2 haemagglutinin.

Haemophilia and hepatitis B virus. Left: Heatmap showing the predicted risk of antibody development associated with different combinations of HLA-DR allele (y-axis) and Factor VIII mutation (x-axis). Right: The frequency (y-axis) of epitopes in the hepatitus B virus core antigen with different binding affinities (x-axis) for different class II HLA alleles that are known to be protective.

On this page...

Overview
Images

Research Presentations

"What can we learn from immune repertoire data?", at the Institute of Hapatology, King's College London (27/09/2022)

"Assessing the risk of alloimmune responses by scanning the human proteome", at the 5th Immunogenicity and Tolerance Seminar, Amsterdam, Netherlands (15/11/2019)

"Identifying haemophiliacs with anti-FVIII Abs: a case study in repertoire sequencing analytics", at the Antibody Engineering & Therapeutics Europe conference, Amsterdam, Netherlands (13/06/2019)

"Alloimmunity, the proteome and antibody repertoire analytics: predicting inhibitor risk in haemophilia A patients", research seminar, School of Biosciences, University of Kent, Canterbury (27/03/2018)

"Antibody repertoire analytics: deep sequencing meets the evolution within us", at a BIDA lunchtime workshop, Birkbeck (20/06/2017)

"The contribution of antibody repertoire sequencing to HIV vaccine design and beyond", at the Using Next Generation Sequencing for Improved Therapeutic Antibody Design meeting, University of Oxford (8/03/2017)

"Informing HIV immunogen design by characterizing clonally related sequence sets with NGS", VAAVV 2015, the Vaccines Against Antigenically Variable Viruses Symposium, Ames, Iowa, US (6/11/2015)

"Case studies in computational immunology: protein therapeutics,surveillance and vaccine design", PHE (Public Health England) Porton, Porton Down, Wiltshire (29/04/2015)

"Self-tolerance, surveillance and vaccine design: case studies in computational immunology", The Pirbright Seminar Series, Pirbright Institute, Surrey (5/03/2015)

"Immunoinformatics and computational virology: making a contribution to surveillance and vaccine design", Respiratory Viruses 2014, St Hilda's College, Oxford (8/09/2014)

"Immunoinformatics and computational virology — what they can do for you", Institute of Hepatology, London (4/06/2014)

"Searching for epitopes: case studies in computational virology", Center for Advanced Host Defense, Immunobiotics, and Translational Comparative Medicine (CAHDIT), Iowa State University, Ames, Iowa, USA (14/11/2013)

"Selective pressure in the influenza A H3 stalk — a challenge for broad-spectrum vaccines?" at Influenza2013 conference, Oxford (William Lees, 18/09/2013)

"Stratifying patients with mild/moderate haemophilia A: a computational pharmacogenetic pipeline", The Centre for Systems and Synthetic Biology, Brunel University (7/11/2012)

"Conformational epitope prediction for Influenza A haemagglutinin", Bio-therapeutics Workshop, The EBI Industry Programme, EBI, Hinxton, Cambridge (12/04/2011)

"The automated construction of gene/protein networks from the literature", University of Dundee (28/01/2010)

"Trends in Biomedical Text Mining", Institute of Cancer Research, Sutton (12/05/2009)

"A Computational Grid Framework for Immunological Applications", UK e-Science All Hands Meeting 2008, Edinburgh (10/09/2008)

"A fast and flexible approach to data integration using a generic schema", poster, UK e-Science All Hands Meeting 2008, Edinburgh (9/09/2008)

"Protein Name Tagging in the Immunological Domain", poster and spotlight presentation, Third International Symposium on Semantic Mining in Biomedicine (SMBM 2008), Turku, Finland (2/09/2008)

"Biomedical Text Mining: Some Examples and Challenges", Biomedical
Textmining/Machine Learning Clinic, Centre for Computational Statistics and Machine Learning, UCL (30/04/2008)

"ImmunoGrid-PI2S2 Integration Architecture", Grid Open Days, Università di Palermo, Sicily (7/12/2007)

"ImmunoGrid — Towards a Simulation of the Human Immune System on a Natural Scale", Department of Biological Sciences, University of Essex (8/11/2007)

"ProSpecTome: A new tagged corpus for protein named entity recognition" at BioLINK 2007, Vienna, Austria (19/07/2007)

"ImmunoGrid Technology Report" at ICT for BIO-Medical Sciences Conference 2006, Bruxelles, Belgium (29/06/2006)

"Neural networks for protein structure prediction" at the Unilever Centre for Molecular Informatics, University of Cambridge (29/04/2003)

"The PFDB — a generic protein family database" at III Jornadas de Bioinformatica Symposium on Bioinformatics & Computational Biology at the Centro de Investigación del Cáncer, Universidad de Salamanca, Spain (18/09/2002).