ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

Re: First version of the Morphological Dictionary

Great work, Konstantine!

As feedback I would add the following, not directly related to the
morphological lexicon itself:
Please try to adhere with Python coding conventions (PEP8) and writing good
commit messages.

I am also thinking of using it on 3gm to improve the lemmatization
procedure on the topic models since the Greek language requires lemmas to
produce satisfying results.

Keep up the good work,
Marios

Στις Σάβ, 29 Ιουν 2019 στις 4:25 μ.μ., ο/η Konstantinos Agiannis <
k [ dot ] agiannis [ at ] yandex [ dot ] com> έγραψε:

> Hello,
>
> I have uploaded the first version of a Greek Morphological Dictionary. A
> second version will be released in July.
>
> A link for Download is located here
> https://github.com/eellak/gsoc2019-greek-morpho
>
> It contain almost 500.000 distinct forms with morphological information
> described according to Universal Dependencies.
> It also contains Etymologies and Definitions.
>
> Things you can do using this SQL database.
>
> Lemmatizer table
>
> SELECT DISTINCT form,lemma FROM words WHERE form != lemma
>
> Lookup POS tagger (only non ambiguous POS)
>
> SELECT DISTINCT lemma AS l,pos FROM words WHERE (SELECT count(DISTINCT
> pos) FROM words WHERE l = lemma) = 1;
>
> In the current version there is a bug for Tags. In future version you will
> able to do
>
> SELECT DISTINCT form FROM words WHERE tags like '%Ant%' AND gender = 'Fem';
>
> To extract all greek female name forms.
>
> I hope you find it useful.
>
> Also I am looking for feedback about the next release.
>
> Possible new features are synonyms and antonyms.
> ----
> Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων
> που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer
> of Code - A discussion list for student developers and mentors of Google
> Summer of Code projects.,
> https://lists.ellak.gr/gsoc-developers/listinfo.html
>
> Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ.
> ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.
>


-- 
Undergraduate Student | School of Electrical and Computer Engineering |
National Technical University of Athens
~marios <http://papachristoumarios.github.io>
 
----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html

Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.

απαντήσεις

αναφορές

πλοήγηση μηνυμάτων