Once you're OK, let's have a discussion on the design before you start writing code. On Wed, Mar 3, 2021, at 10:37, Giorgos wrote: > Hello, > > I am currently studying the resources included in the wiki (multihash, > ssdeep), while simultaneously thinking about the database schema. > Once I feel comfortable with the material, should I create a new GitHub > repository for the project and start committing code? > > Thanks in advance, > Giorgos > > On 1/2/21 7:29 μ.μ., Alexios Zavras wrote: > > Thanks for your interest in getting involved with hashesDB. > > > > Please note that the db keeps only meta-information on files. > > Each hash is a few bytes; 250 bytes for 1 billion files > > amount to 250GB (nowadays available on a USB stick). > > Even if we have that many data (which would be a nice problem to have), > > one could partition the data, keeping different hashes > > on different SQLite data files. > > > > The advantages of SQLite are obviously the single-file data store > > and the serverless implementation. > > I believe its ease of deployment it offers outweigh > > any potential reservation of scale. > > > > To your last point, of course I'd welcome any involvement > > to the project whenever possible. > > However, it should be clear that any work outside the GSoC framework > > will not affect any decisions related to GSoC. > > > > > > On Mon, Feb 1, 2021, at 16:41, Giorgos wrote: > >> Hello everyone, > >> > >> I am Giorgos Kosmas and I am a third-year undergraduate student at > >> National Technical University of Athens, studying towards a degree in > >> Electrical and Computer Engineering. > >> > >> I am interested in participating in Google Summer of Code during the > >> summer. Particularly, I would like to get involved in the development of > >> the "hashesDB" project. > >> > >> I have some clarifying questions regarding the library in which the > >> database will be implemented. I am familiar with SQLite, however I am > >> not sure if SQLite is appropriate for this project, due to the fact that > >> SQLite is a single-disk database. Given the fact that this project > >> should support huge archives of files, is it likely that the total size > >> of our database exceeds the memory size of a single disk? In this case > >> the user would be required to buy a larger and more expensive hard-disk, > >> instead of just buying a much cheaper disk that would provide the > >> additional memory space required. > >> > >> I would like to begin contributing to this project as soon as the fall > >> exam period ends(late February), before the announcement of the accepted > >> mentoring organizations. If this is possible, I would be glad to arrange > >> a skype call in order to discuss further technical decisions in detail. > >> > >> Thanks in advance, > >> > >> Giorgos > >> > -- -- zvr -
---- Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects., https://lists.ellak.gr/gsoc-developers/listinfo.html Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.