ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

Flex GovDoc Scanner Project Queries - GSOC 2025

Dear Mentors,

I'm Sushil Pandey, a third-year CSE student specializing in AI & ML. I'm
writing to express my interest in contributing to the Flex GovDoc Scanner
project for GSOC 2025.

With my background in Node.js, REST APIs, OCR systems, and NLP techniques,
I believe I can effectively contribute to transforming Greek ΓΕΜΗ portal
documents into structured, searchable data.

Based on my initial research, I see potential optimization opportunities in:
- Implementing intelligent rate limiting for sustainable crawling
- Utilizing language-specific OCR models to improve Greek text recognition
accuracy
- Developing a hybrid search approach combining full-text and
metadata-based queries

I've begun drafting my project proposal and exploring the portal structure.
I would greatly appreciate your insights on:
- Database architecture recommendations (MongoDB vs Elasticsearch)
- Expected document volume and scaling considerations
- Preferred tools for processing Greek language documents

I'm eager to refine my proposal with your guidance to ensure it aligns with
the project's goals.

Thank you for your consideration.

Regards,
Sushil Pandey
----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html
Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.

πλοήγηση μηνυμάτων