Hello everyone, My name is Muien, and I am planning to apply for GSoC 2026 under the project “GlossAPI: Needs-Driven Evolution of the Dataset Production Pipeline for Greek Language Data.” I have an intermediate Python background, experience with pytest-based testing, and prior open-source contribution experience (one merged PR in MalariaGEN). I am particularly interested in improving pipeline modularity, maintainability, and testing practices. I have started reviewing the GlossAPI repository and would appreciate clarification on a few points: 1. What are currently the main bottlenecks or pain points in the dataset production pipeline? 2. Are there specific components that would benefit most from refactoring or restructuring? 3. Is there a recommended workflow or dataset I should use locally while exploring the system? I plan to begin contributing before the proposal deadline and would appreciate guidance on where early contributions would be most helpful. Thank you, Mohammed Muien GitHub: https://github.com/muien5080
---- Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects., https://lists.ellak.gr/gsoc-developers/listinfo.html Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.