Thompson, Paul (2016). Birmingham Elsevier interdisciplinary research discourse datasets. [Data Collection]. Colchester, Essex: UK Data Archive. 10.5255/UKDA-SN-852198
This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website.
The main lines of analysis were multidimensional analysis (MDA) for which Doug Biber (Northern Arizona University) acted as a consultant. From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw numerical data are available in this repository.
A second computational approach taken was to use topic modelling to establish, in an inductive manner, what the papers in the GEC corpus are ‘about’. The TopicModel folder contains data for this investigation, some of which are discussed in our paper that appears in the Corpora journal (publication mid 2016).
We also conducted survey and interview data analysis and the data are presented here.
Data description (abstract)
This project investigated the discourse of interdisciplinary research (IDR) through comprehensive linguistic analyses of the full holdings of a successful IDR journal, Global Environmental Change (GEC) in the period 1990-2010, and of ten other comparison journals published by Elsevier. The ten were chosen to represent other interdisciplinary (ID) journals and monodisciplinary (MD) journals. The corpus data cannot be included in the repository as it belongs to Elsevier – individual files can all be consulted through the Elsevier website.
The main lines of analysis were multidimensional analysis (MDA). From the MDA, we derived six constellations in which papers with similar MDA profiles clustered. We then examined the N-grams and P-frames in each constellation – the raw numerical data are available in this repository.
A second computational approach taken was to use topic modelling to establish, in an inductive manner, what the papers in the GEC corpus are ‘about’. The TopicModel folder contains data for this investigation.
We also conducted survey and interview data analysis and the (anonymised) data are presented here.
Data creators: |
|
||||||
---|---|---|---|---|---|---|---|
Contributors: |
|
||||||
Sponsors: | ESRC | ||||||
Grant reference: | ES/K007300/1 | ||||||
Topic classification: |
Natural environment Media, communication and language Science and technology |
||||||
Keywords: | multidimensional analysis; topic modelling; interdisciplinary research | ||||||
Project title: | Investigating Interdisciplinary Research Discourse: the case of Global Environmental Change | ||||||
Grant holders: | Paul Thompson, S Hunston | ||||||
Project dates: |
|
||||||
Date published: | 09 Feb 2016 11:05 | ||||||
Last modified: | 09 Feb 2016 11:05 | ||||||