I would like to include a link to a wordnet file downloadable by the python wn module, if such a file exists, and make this visible from the wordnets in the world page. We can then serve them in the OMW interface (and possibly elsewhere).
The minimal information would be something like
pkg:
url: https://github.com/omwn/omw-data/releases/download/v1.4/omw-ja-1.4.tar.xz
id: omw-ja
version: 1.4
The url should link to a package (or collection), either an xml file or a tarred directory with an xml file and possible license, readme, citation.bib
https://wn.readthedocs.io/en/latest/guides/lexicons.html#wn-lmf-files-packages-and-collections
I'm not sure if we should also have a status field: is this the most up-to-date wordnet for this language? If so, we would need to define the criteria, ...
But I think it would be helpful for someone who knows nothing about wordnets, to know that this was the recommended wordnet for this language, ...
Should we allow multiple packages? Or on this page just have the best one, and if people want older versions they can go to the project url?
The goal of this is to create a community curated list of available wordnets, ...
We can then have a script which downloads them all and creates some statistics (similar to the summary script in the omw-data package).
I would like to include a link to a wordnet file downloadable by the python wn module, if such a file exists, and make this visible from the wordnets in the world page. We can then serve them in the OMW interface (and possibly elsewhere).
The minimal information would be something like
The url should link to a package (or collection), either an xml file or a tarred directory with an xml file and possible license, readme, citation.bib
https://wn.readthedocs.io/en/latest/guides/lexicons.html#wn-lmf-files-packages-and-collections
I'm not sure if we should also have a status field: is this the most up-to-date wordnet for this language? If so, we would need to define the criteria, ...
But I think it would be helpful for someone who knows nothing about wordnets, to know that this was the recommended wordnet for this language, ...
Should we allow multiple packages? Or on this page just have the best one, and if people want older versions they can go to the project url?
The goal of this is to create a community curated list of available wordnets, ...
We can then have a script which downloads them all and creates some statistics (similar to the summary script in the omw-data package).