--- title: 'My Work' type: page --- On this page you will find summaries of projects that I've worked on including both software and scientific research. ### Contents - [Software](#software) - [General/Open Source](#generalopen-source) - [TimeTrack](#timetrack) - [NLP / ML](#nlp--ml) - [CDCRTool](#cdcrtool) - [Partridge](#partridge) - [Sapienta](#sapienta) - [Academic Publications](#academic-publications) - [2021](#2021) - [2019](#2019) - [2018](#2018) - [2017](#2017) - [2016](#2016) - [2013](#2013) # Software ## General/Open Source ### TimeTrack [](https://github.com/ravenscroftj/timetrack) A small command-line tool I wrote for monitoring my time spent on projects - it has API integration with the popular SaaS timesheet tool [Harvest](https://www.getharvest.com/) ## NLP / ML ### CDCRTool [](https://github.com/ravenscroftj/cdcrtool) A tool for annotating co-references of entities that occur in linked news paper article/scientific paper pairings. Some 'sharp' code but this was my first venture into 'full stack' using ReactJS on the frontend and Flask on the backend. The repository also contains the final corpus which we made available as part of our EACL21 publication. ### Partridge [Website](https://papro.org.uk/) | [](https://github.com/ravenscroftj/partridge) A scientific paper indexing system that uses machine learning to enrich papers in order to make them more easy to search and filter. Originally written in Python 2 with xml-rpc worker processes and recently updated to use Python 3 and [dramatiq](https://dramatiq.io/) for concurrency. ### Sapienta [Website](http://www.sapientaproject.com/) | [Live Instance](https://sapienta.papro.org.uk/) | [](https://github.com/ravenscroftj/sapienta) An NLP pipeline for processing and enriching scientific papers with sentence-level information about their core scientific concepts (CoreSCs). This is a Python 3 implementation of Prof Maria Liakata's 2010 paper. We provide a free web service for low volume requests and a simple to use docker configuration for those who want to run the software over a larger number of papers. # Academic Publications Below are links to my various publishing profiles in case you prefer to follow me on an external site/silo: - [ORCID](https://orcid.org/0000-0003-0293-9452) - [Google Scholar Profile](https://scholar.google.co.uk/citations?user=e43P2foAAAAJ&hl=en) - [Semantic Scholar](https://www.semanticscholar.org/author/145681615) ### 2021
Ravenscroft J., Oellrich A., Saha S., & Liakata M. Multi-label Annotation in Scientific Articles - The Multi-label Cancer Risk Assessment Corpus
Duma D., Liakata M., Clare A., Ravenscroft J., & Klein E. Applying Core Scientific Concepts to Context-Based Citation Recommendation
Duma D., Liakata M., Clare A., Ravenscroft J., & Klein E. Rhetorical Classification of Anchor Text for Citation Recommendation