gerdracor-coref

German Drama Corpus for Coreference
Log | Files | Refs | README | LICENSE

commit 6a548c7fbe38b6194262b75e12bd501555bd81dc
parent 894e3d9f401eee31957d308bd026fd133ee328cd
Author: Janis Pagel <janis.pagel@ims.uni-stuttgart.de>
Date:   Mon,  2 Dec 2019 00:26:33 +0100

Update Readme

Diffstat:
MREADME.md | 6+++---
1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/README.md b/README.md @@ -2,7 +2,7 @@ ## General Information -The GerDraCor-Coref (German Drama Corpus for Coreference) is a fork of the [GerDraCor](https://github.com/dracor-org/gerdracor) and contains coreference annotations for a subset of the GerDraCor texts. The texts are all German dramatic texts, written between 1730 and 1920. Annotated are all noun phrases, singletons were removed though. Additionally, generic entities, abstract anaphora and amiguous mentions are also marked explicitely. In case of the latter two, only a part of the corpus has been annotated. +The GerDraCor-Coref (German Drama Corpus for Coreference) is a fork of the [GerDraCor](https://github.com/dracor-org/gerdracor) and contains coreference annotations for a subset of the GerDraCor texts. The texts are all German dramatic texts, written between 1730 and 1920. Annotated are all noun phrases, singletons were removed. Additionally, generic entities, abstract anaphora and amiguous mentions are also marked explicitely. In case of the latter two, only a part of the corpus has been annotated. ### File Naming @@ -33,7 +33,7 @@ For the texts that have not been fully annotated, we additionally provide TEI ou ### XMI -As the XMI files can become quite large, they have been compressed using `gzip`. Uncompress them by going into a command line and run +As the XMI files can become quite large, they have been compressed using `gzip`. Uncompress them by entering a command line and run ```sh $ gzip -d <FILENAME>.xmi.gz @@ -46,7 +46,7 @@ DIRNDL is a file format based on the CoNLL format, but additionally also contain ## Organization The annotations are sorted into folders according to the different output formats. -Parallel annotations by different annotators are organized into branches in the git tree. The main annotations are located in the `gold` branch. Partial annotations are sorted under the main folder in a folder called `part`. +Parallel annotations by different annotators are organized into branches in the git tree. The main annotations are located in the `gold` branch. Partial annotations are sorted under the main folder in a subfolder called `part`. ### Folder structure