British Academic Written English Corpus

Title

British Academic Written English Corpus

Author

Nesi, Hilary; Gardner, Sheena; Thompson, Paul; Wickens, Paul

Availability

Available for non-commercial use on condition that this header is included in its entirety with any copy distributed. Registration or request via our order form is required.

As this resource is restricted in some way, you will have to apply for approval to get a copy.

Languages

English

Editorial Practice

Encoding format: TEI XML

OTA keywords

Linguistic corpora
Corpus

LC keywords

Linguistics
Linguistics analysis (Linguistics)

Extent
  • designation: CollectionText
  • size: 206 files: ca. 61.9 MB
Creation Date

The resource was created between 2004 and 2007

Source Description

The original assignments are held anonymised as PDF documents by the project team.

Notes

Title proper taken from OTA Catalogue Form

The BAWE corpus contains 2761 pieces of proficient assessed student writing, ranging in length from about 500 words to about 5000 words. Holdings are fairly evenly distributed across four broad disciplinary areas (Arts and Humanities, Social Sciences, Life Sciences and Physical Sciences) adn across four levels of study (undergraduate and taught masters level). Thirty-five disciplines are represented. The assignments have been annotated using a system devised in accordance with the TEI guidelines. There is a dtd file which must be kept in the same folder as the corpus files, named tei_bawe.dtd and the holdings are described in an Excel spreadsheet 'BAWE.xls'. The transcription and mark-up conventions are described in the BAWE manual document, which is in PDF format.