I am writing this email to you with regards to my interest in the
project named «Extraction of Responsibilities per unit in public sector
organizations from the Government Gazette». Having read through the
details of the project I would like to ask some questions so that I can
understand better the requirements. I would be very grateful if you have
the time to answer these questions before I submit my proposal.
First, I see that the knowledge prerequisites include Python, Java and
Machine Learning. I’m more familiar to Java, Machine learning and Data
mining. I haven’t worked with Python, but I am willing to sit and work
with this language before Google Summer of Code starts. Is Python going
to be used for Machine learning purposes?
Secondly, am I right in understanding that Machine Learning is used to
automatically find and match «specific Named Entities types with
references to assigned responsibilities-services per unit and links
between the two must be extracted» is one of the main issues of this
If so, am I right in thinking the steps required include: Preprocessing
the data, Data integration, Hierarchical or partitioned clustering,
Categorization and correlation rules?
Finally, I am bit confused about the NER module. Is there any more
information on this subject?
Thank you in advance.
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Γενική λίστα αλληλογραφίας που απευθύνεται σε developers/contributors έργων ανοικτού λογισμικού - A general discussion list for developers/contributors of open-source projects,
Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <opensource-devs+unsubscribe [ at ] ellak [ dot ] gr>.