gerdracor-coref

German Drama Corpus for Coreference
Log | Files | Refs | README | LICENSE

commit 7bdbb8ea4c045fc46757304c143963f9d8a862f7
parent 7f5bf5ce0b5c9107df8a7f98e60e9aa1ecba967d
Author: Janis Pagel <janis.pagel@ims.uni-stuttgart.de>
Date:   Mon,  7 Oct 2019 16:47:10 +0200

Update README

Diffstat:
MREADME | 12++++++++++++
1 file changed, 12 insertions(+), 0 deletions(-)

diff --git a/README b/README @@ -25,6 +25,14 @@ We provide several formats to represent the corefence annotations: For the texts that have not been fully annotated, we only provide CoNLL output for the parts that have been annotated. The XMI and TEI output always contain the full text. +### XMI + +As the XMI files can become quite large, they have been compressed using `gzip`. Uncompress them by going into a command line and run + +```sh +$ gzip -d <FILENAME>.xmi.gz +``` + ## Organization The annotations are sorted into folders according to the different output formats. @@ -46,3 +54,7 @@ $ tree -d $ git branch * gold ``` + +## Contribution + +We appreciate contributions regarding extensions, bug fixes and the like. Please feel free to create issues or pull requests.