Dalrymple, Mary and Mofu, Suriel
(2016).
On-line language documentation for Biak (Austronesian).
[Data Collection]. Colchester, Essex:
UK Data Archive.
10.5255/UKDA-SN-851820
The purpose of this project was to create an on-line database of digital audio texts and their analysed and annotated transcriptions in the Biak language, an Austronesian language spoken in Papua by 50,000-70,000 speakers. The project benefits the academic linguistic community by making Biak digitised audio and annotated transcriptions freely available in further linguistic analysis and theory development; the community of Biak speakers in West Papua by creating a permanent on-line storehouse of a representative variety of Biak texts in both audio and written form; and the project partners at Universitas Negeri Papua and Universitas Cenderawasih by training and experience in the use of tools and best practice methods in language documentation and the practical skills to undertake future documentation efforts for the hundreds of under-described languages of the region.
Data description (abstract)
This collaborative project involved the University of Oxford and two universities in Papua, Universitas Cenderawasih and Universitas Negeri Papua, in the creation of an on-line database of 52 digital audio and video texts and the linguistically annotated transcriptions and translations of 23 of the texts for the Austronesian language Biak, a language with about 50,000-70,000 speakers in Papua. These resources provide a snapshot of audio and textual data on the language, and are useful for language preservation efforts, for ongoing efforts to produce teaching materials in the indigenous languages of Papua, and as a basis for the creation of dictionaries and glossaries in the language. Since they are linguistically annotated, they are also useful for linguists conducting research on Biak and related Austronesian languages.
Data creators: |
Creator Name |
Affiliation |
ORCID (as URL) |
Dalrymple Mary |
University of Oxford |
|
Mofu Suriel |
University Negeri Papua |
|
|
Contributors: |
Name |
Affiliation |
ORCID (as URL) |
Alfons Arsai |
State University of Papua |
|
Rumbrawer Franc |
Cenderawasih University |
|
|
Sponsors: |
ESRC
|
Grant reference: |
RES-000-22-3788
|
Topic classification: |
Education Society and culture
|
Keywords: |
Biak
|
Project title: |
On-line language documentation for Biak (Austronesian)
|
Alternative title: |
Biak (Austronesian): Annotated, transcribed and translated audio texts and wordlists
|
Grant holders: |
Mary Dalrymple, Suriel Mofu
|
Project dates: |
From | To |
---|
1 October 2009 | 30 September 2010 |
|
Date published: |
14 Apr 2015 13:33
|
Last modified: |
26 Feb 2016 12:51
|
Collection period: |
Date from: | Date to: |
---|
1 October 2009 | 30 September 2010 |
|
Geographical area: |
Various locations in Papua/West Papua |
Country: |
Indonesia |
Spatial unit: |
No Spatial Unit |
Data collection method: |
The recordings were the result of face-to-face and telephone interviews.
Besides the 52 digital audio and video files, we provide annotated transcriptions of 23 of the texts by using Toolbox, a freely-available data management and analysis tool for language documentation, which supports the creation of resources in various forms: transcribed texts with free translations into Indonesian and English (of most use to the Biak-speaking community and for pedagogical use in Papua) and linguistically annotated transcriptions in two forms: a standard human-readable form like the paper-based corpora familiar to linguists, and a translation of this form to XML via the utility tools for Toolbox, suitable for computer analysis and database search. |
Observation unit: |
Individual |
Kind of data: |
Audio, Text, Video |
Type of data: |
Qualitative and mixed methods data |
Resource language: |
English |
|
Data sourcing, processing and preparation: |
The digitised audio files were produced from audio tapes recorded in Papua and digitised at Oxford's Phonetics Laboratory. The annotated transcriptions were produced using Toolbox.
Data was collected in line with the standards of Oxford’s Central University Research Ethics Committee (CUREC), who provide rules and guidance on data collection from both literate and nonliterate speakers. Signed consent forms were obtained from all literate participants in the audio recordings. Nonliterate participants indicated their consent by thumbprint on the consent form, which was read and explained to them. Data has been anonymised.
|
Rights owners: |
Name |
Affiliation |
ORCID (as URL) |
Dalrymple Mary |
University of Oxford |
|
Mofu Suriel |
State University of Papua |
|
Rumbrawer Franc |
Cenderawasih University |
|
|
Contact: |
Name | Email | Affiliation | ORCID (as URL) |
---|
Dalrymple, Mary | mary.dalrymple@ling-phil.ox.ac.uk | University of Oxford | Unspecified |
|
Publisher: |
UK Data Archive
|
Last modified: |
26 Feb 2016 12:51
|
|
Available Files
Data
Documentation
Additional metadata
Edit item (login required)
|
Edit Item |