ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

Re: [GSoC 2021]Questions about HashesDB

Once you're OK, let's have a discussion on the design
before you start writing code.

On Wed, Mar 3, 2021, at 10:37, Giorgos wrote:
> Hello,
> 
> I am currently studying the resources included in the wiki (multihash, 
> ssdeep), while simultaneously thinking about the database schema.
> Once I feel comfortable with the material, should I create a new GitHub 
> repository for the project and start committing code?
> 
> Thanks in advance,
> Giorgos
> 
> On 1/2/21 7:29 μ.μ., Alexios Zavras wrote:
> > Thanks for your interest in getting involved with hashesDB.
> >
> > Please note that the db keeps only meta-information on files.
> > Each hash is a few bytes; 250 bytes for 1 billion files
> > amount to 250GB (nowadays available on a USB stick).
> > Even if we have that many data (which would be a nice problem to have),
> > one could partition the data, keeping different hashes
> > on different SQLite data files.
> >
> > The advantages of SQLite are obviously the single-file data store
> > and the serverless implementation.
> > I believe its ease of deployment it offers outweigh
> > any potential reservation of scale.
> >
> > To your last point, of course I'd welcome any involvement
> > to the project whenever possible.
> > However, it should be clear that any work outside the GSoC framework
> > will not affect any decisions related to GSoC.
> >
> >
> > On Mon, Feb 1, 2021, at 16:41, Giorgos wrote:
> >> Hello everyone,
> >>
> >> I am Giorgos Kosmas and I am a third-year undergraduate student at
> >> National Technical University of Athens, studying towards a degree in
> >> Electrical and Computer Engineering.
> >>
> >> I am interested in participating in Google Summer of Code during the
> >> summer. Particularly, I would like to get involved in the development of
> >> the "hashesDB" project.
> >>
> >> I have some clarifying questions regarding the library in which the
> >> database will be implemented. I am familiar with SQLite, however I am
> >> not sure if SQLite is appropriate for this project, due to the fact that
> >> SQLite is a single-disk database. Given the fact that this project
> >> should support huge archives of files, is it likely that the total size
> >> of our database exceeds the memory size of a single disk? In this case
> >> the user would be required to buy a larger and more expensive hard-disk,
> >> instead of just buying a much cheaper disk that would provide the
> >> additional memory space required.
> >>
> >> I would like to begin contributing to this project as soon as the fall
> >> exam period ends(late February), before the announcement of the accepted
> >> mentoring organizations. If this is possible, I would be glad to arrange
> >> a skype call in order to discuss further technical decisions in detail.
> >>
> >> Thanks in advance,
> >>
> >> Giorgos
> >>
>

-- 
-- zvr -
----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html
Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.

απαντήσεις

αναφορές

πλοήγηση μηνυμάτων