Sensier, Marianne and Gök , Abdullah and Shapira, Philip
(2017).
Sustaining growth for innovative new enterprises: UK firm data.
[Data Collection]. Colchester, Essex:
UK Data Archive.
10.5255/UKDA-SN-851779
This project probes the growth strategies of innovative small and medium-size enterprises (SMEs). Our research focuses on emerging green goods industries that manufacture outputs which benefit the environment or conserve natural resources, with an international comparative element involving the UK, the US, and China.
The project investigates the contributions of strategy, resources and relationships to how innovative British, American, and Chinese SMEs achieve significant growth. The targeted technology-oriented green goods sectors are strategically important to environmental rebalancing and have significant potential (in the UK) for export growth. The research examines the diverse pathways to innovation and growth across different regions. We use a mix of methodologies, including analyses of structured and unstructured data on SME business and technology performance and strategies, case studies, and modelling. Novel approaches using web mining are pioneered to gain timely information about enterprise developmental pathways. Findings from the project will be used to inform management and policy development at enterprise, regional and national levels.
The project is led by the Manchester Institute of Innovation Research at the University of Manchester, in collaboration with Georgia Institute of Technology, US; Beijing Institute of Technology, China, and Experian, UK.
Data description (abstract)
To select the group of UK firms we initially searched in the FAME database (available from the University of Manchester Library) with keywords relating to the green goods sector, please see the publication Shapira, et al (2014, in Technological Forecasting & Social Change, vol. 85, pp. 93-104) for further details on the keywords. This database contains anonymized firm data from a sample of UK firms in the green goods production industry. We combine data from structured sources (the FAME database, patents and publications) with unstructured data mined from firm's web-sites by saving key words in text and summing up counts of these to create additional explanatory variables for firm growth. The data is in a panel from 2003-2012 with some observations missing for firms. We collect historical data from firm's web-sites available in an archive from the Wayback machine.
Data creators: |
Creator Name |
Affiliation |
ORCID (as URL) |
Sensier Marianne |
University of Manchester |
|
Gök Abdullah |
University of Manchester |
|
Shapira Philip |
University of Manchester |
|
|
Sponsors: |
Economic and Social Research Council
|
Grant reference: |
ES/J008303/1
|
Topic classification: |
Economics Trade, industry and markets
|
Keywords: |
firms, sustainable production, web mining
|
Project title: |
Sustaining Growth for Innovative New Enterprises
|
Grant holders: |
Professor Philip Shapira, Professor Alan Harding, FAME Database, Contains National Statistics data © Crown copyright and database right (2015)
|
Project dates: |
From | To |
---|
1 January 2012 | 31 December 2014 |
|
Date published: |
13 May 2015 15:57
|
Last modified: |
09 Nov 2017 17:12
|
Temporal coverage: |
From | To |
---|
1 January 2003 | 31 December 2012 |
|
Collection period: |
Date from: | Date to: |
---|
1 January 2012 | 31 December 2014 |
|
Geographical area: |
Manchester |
Country: |
United Kingdom |
Spatial unit: |
European Union Geographies > NUTS-I Areas European Union Geographies > NUTS-II Areas |
Data collection method: |
We collected the financial information on the UK firms by downloading Companies House data from the FAME database available through the University of Manchester Library (see http://www.library.manchester.ac.uk/searchresources/databases/f/). Grant information on companies came from the Technology Strategy Board. Patent information was from the Derwent database and publication information was from the Web of Science. The Consumer Price index was from the Office for National Statistics (http://www.ons.gov.uk/ons/rel/cpi/consumer-price-indices/index.html). The Human Resources in Science and Technology variable was from the Eurostat database (http://ec.europa.eu/eurostat/data/database). Unstructured data was mined from firm's web-sites. The UK Intellectual Property Office has clarified that the data mining we are doing and the way we are doing it is permissible. See: https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/375954/Research.pdf |
Observation unit: |
Organization |
Kind of data: |
Numeric, Text |
Type of data: |
Business microdata
, Cohort and longitudinal studies, Geospatial data
|
Resource language: |
English |
|
Data sourcing, processing and preparation: |
Data on patents, publications and grants were linked to the relevant firms. Unstructured data was mined from the firm's live web-sites and from archives on the Wayback machine. The data was collated in IBM Content Analytics software in XML format. The data was then imported into Vantage Point software where text fields were merged and the data cleaned. Variables were extracted from Vantage Point and imported into Stata 12 for analysis. Data transformations were undertaken in Stata 12 and the description of these are listed in column 3 of Table 1 in DataTechnicalReportESRC_SI. We standardise the web variables by dividing them by the number of phrases counted across all the firm’s web-pages for the year and then multiply this by 100.
|
Rights owners: |
Name |
Affiliation |
ORCID (as URL) |
Shapira Philip |
University of Manchester |
|
|
Contact: |
Name | Email | Affiliation | ORCID (as URL) |
---|
Sensier, Marianne | marianne.sensier@manchester.ac.uk | University of Manchester | Unspecified |
|
Publisher: |
UK Data Archive
|
Last modified: |
09 Nov 2017 17:12
|
|
Available Files
Data
Documentation
Publications
Probing “green” industry enterprises in the UK: A new identification approach. Philip Shapira, Abdullah Gök, Evgeny Klochikhin and M. Sensier. Technological Forecasting & Social Change, June 2014, vol. 85, pp. 93-104 |
Use of web mining in studying innovation. A. Gök, A. Waterworth and P. Shapira, Scientometrics, January 2015, vol. 102, issue 1, pp 653-671 |
The UK Intellectual Property Office |
Website
Edit item (login required)
 |
Edit Item |