An official website of the European Union How do you know?      
European Commission logo
JRC Publications Repository Menu

JRC Eurovoc Indexer JEX – A freely available multi-label categorisation tool

cover
Eurovoc (2011) is a highly multilingual thesaurus consisting of over 6,700 hierarchically organised subject domains used by European Institutions and many authorities in Member States of the European Union (EU) for the classification and retrieval of official documents. JEX is JRC-developed multi-label classification software that learns from manually labelled data to automatically assign Eurovoc descriptors to new documents in a profile-based category-ranking task. The JEX release consists of trained classifiers for all 23 official EU languages, of parallel training data in the same languages, of an interface that allows viewing and amending the assignment results, and of a module that allows users to re-train the tool on their own document collections. JEX allows advanced users to change the document representation so as to possibly improve the categorisation result through linguistic pre-processing. JEX can be used as a tool for interactive Eurovoc descriptor assignment to improve speed and consistency of the human categorisation process, or it can be used fully automatically. The output of JEX is a language-independent Eurovoc feature vector lending itself to tasks such as cross-lingual clustering and classification.
2013-03-22
European Language Resources Agency (ELRA)
JRC67293
http://www.lrec-conf.org/proceedings/lrec2012/index.html,    http://www.lrec-conf.org/proceedings/lrec2012/pdf/875_Paper.pdf,    https://publications.jrc.ec.europa.eu/repository/handle/JRC67293,   
Language Citation
NameCountryCityType
Datasets
IDTitlePublic URL
Dataset collections
IDAcronymTitlePublic URL
Scripts / source codes
DescriptionPublic URL
Additional supporting files
File nameDescriptionFile type 
Show metadata record  Copy citation url to clipboard  Download BibTeX
Items published in the JRC Publications Repository are protected by copyright, with all rights reserved, unless otherwise indicated. Additional information: https://ec.europa.eu/info/legal-notice_en#copyright-notice