AQMAR Arabic Wikipedia Supersense Corpus
This is a 65,000-token corpus of 28 Arabic Wikipedia articles hand-annotated for nominal supersenses. It extends the
Named Entity Corpus
and was developed by
Nathan Schneider
,
Behrang Mohit
,
Kemal Oflazer
, and
Noah Smith
as part of the
AQMAR
project.
Download
AQMAR_Arabic_SST_corpus-1.0.zip
(
README
,
VERSION
,
LICENSE
,
guidelines
,
examples
)
Further Reading
Please cite the following if you write any papers involving the use of the data above:
Coarse Lexical Semantic Annotation with Supersenses: An Arabic Case Study
Nathan Schneider
,
Behrang Mohit
,
Kemal Oflazer
, and
Noah A. Smith
.
In
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics
, Jeju, South Korea, July 2012.
Acknowledgments
This research was supported by
Qatar National Research Fund
grant
NPRP
08-485-1-083.
Contact
Please e-mail
nschneid [strudel] cs.cmu.edu
or
behrang [strudel] cmu.edu
with questions.