Updated on Sat 29 November 2014


See also talks, slides & videos for more recent post-academia work.


Alison Cuff, Ian Sillitoe, Tony Lewis, Andrew B. Clegg, Robert Rentzsch, Nicholas Furnham, Marialuisa Pellegrini-Calace, David T. Jones, Janet Thornton and Christine A. Orengo, Extending CATH: Increasing Coverage of the Protein Structure Universe and Linking Structure with Function, in Nucleic Acids Research Database Issue 39 (2010).

Juan A. G. Ranea, Ian Morilla, Jon G. Lees, Adam J. Reid, Corin Yeats, Andrew B. Clegg, Francisca Sánchez Jiménez and Christine Orengo, Finding the 'Dark Matter' in Human and Yeast Protein Network Prediction and Modelling, in PLoS Computational Biology 6:9 (2010).

Contributor, The management of bacterial meningitis and meningococcal septicaemia in children and young people younger than 16 years in primary and secondary care (National Institute for Health and Clinical Excellence, 2010).

Adam J. Reid, Juan A. G. Ranea, Andrew B. Clegg and Christine A. Orengo, CODA: Accurate Detection of Functional Associations between Proteins in Eukaryotic Genomes Using Domain Fusion, in PLoS ONE 5:6 (2010).

Steve Pettifer, Jon Ison, Matus Kalas, Dave Thorne, Philip McDermott, Inge Jonassen, Ali Liaquat, Jose M. Fernandez, Jose M. Rodriguez, INB-Partners, David G. Pisano, Christophe Blanchet, Mahmut Uludag, Peter Rice, Edita Bartaseviciute, Kristoffer Rapacki, Maarten Hekkelman, Olivier Sand, Heinz Stockinger, Andrew B. Clegg, Erik Bongcam-Rudloff, Jean Salzemann, Vincent Breton, Teresa K. Attwood, Graham Cameron and Gert Vriend, The EMBRACE Web Service Collection, in Nucleic Acids Research Web Servers Issue (2010).


Corin Yeats, Jon Lees, Oliver Redfern, Andrew Clegg and Christine Orengo, Gene3D: Merging Structure and Function For a Thousand Genomes, in Nucleic Acids Research Database Issue 38:D296-D300 (2009).

Pascal Kahlem, Andrew Clegg, Florian Reisinger, Ioannis Xenarios, Henning Hermjakob, Christine Orengo and Ewan Birney, ENFIN — A European network for integrative systems biology, in Comptes Rendus Biologies 332:11 (2009).

Contributor, Reducing differences in the uptake of immunisations (National Institute for Health and Clinical Excellence, 2009).

Jose M. G. Izarzugaza, Anja Baresic, Lisa E. M. McMillan, Corin Yeats, Andrew B. Clegg, Christine A. Orengo, Andrew C. R. Martin and Alfonso Valencia, An integrated approach to the interpretation of Single Amino Acid Polymorphisms within the framework of CATH and Gene3D, in BMC Bioinformatics 10 (Suppl 8):S5 (2009).

Renata Kabiljo, Andrew B. Clegg and Adrian J. Shepherd, A realistic assessment of methods for extracting gene/protein interactions from free text, in BMC Bioinformatics 10:233 (2009).


Andrew B. Clegg, Computational-Linguistic Approaches to Biological Text Mining (730KB PDF), PhD thesis, Birkbeck College (London: 2008).

Andrew B. Clegg and Adrian J. Shepherd, Syntactic pattern matching with GraphSpider and MPL, in Proceedings of the Third International Symposium on Semantic Mining in Biomedicine (SMBM'08) (Turku, Finland: 2008). Code available on SourceForge.

Andrew B. Clegg and Debbie Pledge, Streamlining the clinical guideline production process with fuzzy citation matching, in Proceedings of the First Conference on Text and Data Mining of Clinical Documents (Louhi'08) (Turku, Finland: 2008).

Andrew B. Clegg and Adrian J. Shepherd, Text mining, in Jon Keith (ed.), Bioinformatics Volume II: Structure, Function and Applications (Humana Press, New Jersey: 2008).


Christian Guy, Emma Goddard, Emily Milner, Lisa Murch, and Andrew B. Clegg, Looking into the core of the sun, in Hasok Chang and Catherine Jackson (eds.), An Element of Controversy: The Life of Chlorine in Science, Medicine, Technology and War (British Society for the History of Science: 2007).

Andrew B. Clegg and Adrian J. Shepherd, Benchmarking natural-language parsers for biological applications using dependency graphs, in BMC Bioinformatics 8:24 (2007).

Andrew B. Clegg and Adrian J. Shepherd, Evaluating and integrating treebank parsers on a biomedical corpus, in Proceedings of the Association for Computational Linguistics Workshop on Software (Ann Arbor, Michigan: 2005).

No comments? I'm no longer sure blog comments are relevant. I'd rather you replied on Twitter, or wrote a response on your own blog or a site like Medium.

All content (cc) Andrew Clegg, under Creative Commons Attribution-ShareAlike 4.0 License. Built on Pelican & Python. Theme based on svbhack by Giulio Fidente.