ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

Re: First version of the Morphological Dictionary

  • Subject: Re: First version of the Morphological Dictionary
  • From: Diomidis Spinellis <dds [ at ] aueb [ dot ] gr>
  • Date: Sat, 29 Jun 2019 18:45:03 +0300
Dear Konstantine,

Indeed, great and useful work! As for PEP8, the best practice is to enforce it by running a Python style checker as part of an e.g. Travis CI job.

Kind regards,

Diomidis Spinellis

On 29-Jun-19 17:25, Marios Papachristou wrote:
Great work, Konstantine!

As feedback I would add the following, not directly related to the morphological lexicon itself: Please try to adhere with Python coding conventions (PEP8) and writing good commit messages.

I am also thinking of using it on 3gm to improve the lemmatization procedure on the topic models since the Greek language requires lemmas to produce satisfying results.

Keep up the good work,
Marios

Στις Σάβ, 29 Ιουν 2019 στις 4:25 μ.μ., ο/η Konstantinos Agiannis <k [ dot ] agiannis [ at ] yandex [ dot ] com <mailto:k [ dot ] agiannis [ at ] yandex [ dot ] com>> έγραψε:

    Hello,

    I have uploaded the first version of a Greek Morphological
    Dictionary. A second version will be released in July.

    A link for Download is located here
    https://github.com/eellak/gsoc2019-greek-morpho

    It contain almost 500.000 distinct forms with morphological
    information described according to Universal Dependencies.
    It also contains Etymologies and Definitions.

    Things you can do using this SQL database.

    Lemmatizer table

    SELECT DISTINCT form,lemma FROM words WHERE form != lemma

    Lookup POS tagger (only non ambiguous POS)

    SELECT DISTINCT lemma AS l,pos FROM words WHERE (SELECT
    count(DISTINCT pos) FROM words WHERE l = lemma) = 1;

    In the current version there is a bug for Tags. In future version
    you will able to do

    SELECT DISTINCT form FROM words WHERE tags like '%Ant%' AND gender =
    'Fem';

    To extract all greek female name forms.

    I hope you find it useful.

    Also I am looking for feedback about the next release.

    Possible new features are synonyms and antonyms.
    ----
    Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και
    συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων
    του Google Summer of Code - A discussion list for student developers
    and mentors of Google Summer of Code projects.,
    https://lists.ellak.gr/gsoc-developers/listinfo.html

    Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ.
    ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr
    <mailto:gsoc-developers%2Bunsubscribe [ at ] ellak [ dot ] gr>>.



--
Undergraduate Student | School of Electrical and Computer Engineering | National Technical University of Athens
~marios <http://papachristoumarios.github.io>

----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html

Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.
 
----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html

Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.

πλοήγηση μηνυμάτων