Birmingham Elsevier interdisciplinary research discourse datasets

Thompson, Paul (2016). Birmingham Elsevier interdisciplinary research discourse datasets. [Data Collection]. Colchester, Essex: UK Data Archive. 10.5255/UKDA-SN-852198

This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website.

The main lines of analysis were multidimensional analysis (MDA) for which Doug Biber (Northern Arizona University) acted as a consultant. From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw numerical data are available in this repository.

A second computational approach taken was to use topic modelling to establish, in an inductive manner, what the papers in the GEC corpus are ‘about’. The TopicModel folder contains data for this investigation, some of which are discussed in our paper that appears in the Corpora journal (publication mid 2016).

We also conducted survey and interview data analysis and the data are presented here.

Data description (abstract)

This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website.
The main lines of analysis were multidimensional analysis (MDA). From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw numerical data are available in this repository.
A second computational approach taken was to use topic modelling to establish, in an inductive manner, what the papers in the GEC corpus are ‘about’. The TopicModel folder contains data for this investigation.
We also conducted survey and interview data analysis and the (anonymised) data are presented here.

Data creators:
Creator Name Affiliation ORCID (as URL)
Thompson Paul University of Birmingham https://orcid.org/0000-0002-9595-3757
Contributors:
Name Affiliation ORCID (as URL)
Hunston Susan University of Birmingham
Sponsors: ESRC
Grant reference: ES/K007300/1
Topic classification: Natural environment
Media, communication and language
Science and technology
Keywords: multidimensional analysis; topic modelling; interdisciplinary research
Project title: Investigating Interdisciplinary Research Discourse: the case of Global Environmental Change
Grant holders: Paul Thompson, S Hunston
Project dates:
FromTo
30 August 20133 November 2015
Date published: 09 Feb 2016 11:05
Last modified: 09 Feb 2016 11:05

Available Files

Data

Documentation

Read me

Downloads

data downloads and page views since this item was published

View more statistics

Altmetric

Edit item (login required)

Edit Item Edit Item