GerManC. A Historical Corpus of German Newspapers 1650-1800

Title

GerManC. A Historical Corpus of German Newspapers 1650-1800 [Electronic resource]

Editor Durrell, Martin (ed.); Ensslin, Astrid (ed.); Bennett, Paul (ed.)
Availability This resource is freely available, you should be able to download it now.
Languages

German

Editorial Practice

Encoding format: TEILite XML

OTA keywords Linguistic corpora
Corpus
LC keywords

Linguistic analysis (Linguistics)
German language--Written German

Extent
  • designation: Text data
  • size: 154 files : ca. 5.54 MB
Creation Date 2006- April 2007
Source Description

Various. See source file : sources.doc : . Various German newspapers Germany: 1650-1800

Note: For updated information visit the web site http://www.llc.manchester.ac.uk/research/projects/germanc/

Notes

Mode of access: Online. OTA website

The corpus consists of 45 text samples of some 200 words each from German newspapers of the early modern period 1650-1800. There are three texts each from five German regions - North Germany, West Germany, East Central Germany, South-West Germany (including Switzerland) and South-East Germany (including Austria) - for each fifty year sub-period. The corpus consists first of a set of unannoted text files contained in sets of folders for each region and period, and also of files with each individual text, and secondly of a set of xml files each of which contains one fully annotated text, organised into folders according to sub-period and region. In addition there is a documentation file which provides a full account of each stage of the corpus construction and annotation, together with any necessary modification of TEI standards. This file also contains a complete reference list of all the names of places, organisations and historical personages occuring in the corpus. Finally, a source file provides full bibiographic details of the original texts