Corpora of patient information sheets and consent forms for UK cancer trials 2007-2017

Isaacs, Talia and Murdoch, Jamie and Demjén, Zsófia and Stevenson, Fiona (2019). Corpora of patient information sheets and consent forms for UK cancer trials 2007-2017. [Data Collection]. Colchester, Essex: UK Data Service. 10.5255/UKDA-SN-853933

Obtaining informed consent is an ethical imperative when conducting research involving human participants. However, participants’ actual level of understanding is often difficult and impractical to assess in operational research. One setting where the stakes for understanding are high due to the potential consequences of research participation is randomised controlled trials (RCTs), which test the effectiveness and safety of medical treatments. However, ethics committees' gatekeeping mechanisms often mean that legalese is mandated in consent forms, which can work against patients’ understanding. The goal of this text-based study was, therefore, to build and analyse a corpus of patient information sheets (PIS) and consent forms (CF) from RCTs conducted in the UK.

Data description (abstract)

This data collection consists of 27 participant information sheets and 23 consent forms freely available on-line. Materials were collected following a comprehensive search for publicly available ethical materials from randomised control trials (RCTs) targeting cancer (2007-17), primarily by systematically searching key on-line databases and monograph series. These corpora, which are different, to our knowledge, than any existing collection of medical English, could further research on information provision for patients in RCTs specifically and in healthcare settings more generally, in addition to advancing the study of the language of written ethical documents. Secondary analyses of these data could be undertaken using techniques from corpus linguistics, computational linguistics, and/or discourse analysis, for example, to investigate the nature and complexity of the language used and/or broach participants’ understanding of ethical principles or preference for how different language functions are expressed.

Data creators:
Creator Name Affiliation ORCID (as URL)
Isaacs Talia University College London
Murdoch Jamie University of East Anglia
Demjén Zsófia University College London
Stevenson Fiona University College London
Sponsors: IOE Seed Funding Scheme 2017, UCL Institute of Education
Topic classification: Media, communication and language
Keywords: ethics, research, cancer, health, linguistics
Project title: What does it take to understand? Using techniques from discourse analysis and corpus linguistics to analyse the language demands of ethical materials in health intervention research
Grant holders: Talia Isaacs, Zsófia Demjén, Fiona Stevenson
Project dates:
1 January 201731 December 2017
Date published: 14 Nov 2019 16:32
Last modified: 14 Nov 2019 16:33

Available Files



Read me


data downloads and page views since this item was published

View more statistics


No resources to display

Edit item (login required)

Edit Item Edit Item