Improving the Performance of Coreference Resolvers Across Datasets by Leveraging Target Entities

This repository contains code for self-labeling method which uses Named Entities from target data to automatically add coreference annotations and improve coreference resolver performance.

Install requirements

Setup a Python virtual environment

virtualenv venv-my --python=python3.6 or python3 -m venv venv-my 
source venv-my/bin/activate

Install the requirements:

pip install -r requirements.txt

Running the experiments

1. Start the Stanford CoreNLP server

nohup java -mx16g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 9000 -timeout 30000 &

2. Run self-labeling process, the parameter should be a json file consistent with the dataset above:

python3 annotate.py manual.json

3. Split the annoated data into training subset

python3 split_data.py path/to/annotated/data

Now the data is ready for training coreference resolution models.

4. Train e2e-coref model on the split data

Please refer to e2e-coref for detailed instructions about training and evaluating the model.

5. Pronoun evaluation

python3 scorer.py key_file prediction_file pro

Please refer to CoVal for more details about pronoun evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
pro_eval		pro_eval
self-labeling		self-labeling
.gitignore		.gitignore
README.md		README.md
get_conll_static.py		get_conll_static.py
get_statics.py		get_statics.py
local_minimize_check.py		local_minimize_check.py
random_choose_documents.py		random_choose_documents.py
requirements.txt		requirements.txt
retrieve_document.py		retrieve_document.py
split_data.py		split_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Improving the Performance of Coreference Resolvers Across Datasets by Leveraging Target Entities

Install requirements

Running the experiments

About

Uh oh!

Releases

Packages

Languages

mingzhu-wu/self-labeling-coref-annotation

Folders and files

Latest commit

History

Repository files navigation

Improving the Performance of Coreference Resolvers Across Datasets by Leveraging Target Entities

Install requirements

Running the experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages