ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

[opensource-devs] Interest in the Goverment Gazette

Dear Sirs,

I am writing this email to you with regards to my interest in the project named «Extraction of Responsibilities per unit in public sector organizations from the Government Gazette». Having read through the details of the project I would like to ask some questions so that I can understand better the requirements. I would be very grateful if you have the time to answer these questions before I submit my proposal.

First, I see that the knowledge prerequisites include Python, Java and Machine Learning. I’m more familiar to Java, Machine learning and Data mining. I haven’t worked with Python, but I am willing to sit and work with this language before Google Summer of Code starts. Is Python going to be used for Machine learning purposes? Secondly, am I right in understanding that Machine Learning is used to automatically find and match «specific Named Entities types with references to assigned responsibilities-services per unit and links between the two must be extracted» is one of the main issues of this project?

If so, am I right in thinking the steps required include: Preprocessing the data, Data integration, Hierarchical or partitioned clustering, Categorization and correlation rules?

Finally, I am bit confused about the NER module. Is there any more information on this subject?

Thank you in advance.
Best regards
Ioakeim
 
----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Γενική λίστα αλληλογραφίας που απευθύνεται σε developers/contributors έργων ανοικτού λογισμικού - A general discussion list for developers/contributors of open-source projects,
https://lists.ellak.gr/opensource-devs/listinfo.html

Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <opensource-devs+unsubscribe [ at ] ellak [ dot ] gr>.

απαντήσεις

πλοήγηση μηνυμάτων