ΕΕΛΛΑΚ - Λίστες Ταχυδρομείου

Re: Extend deepbots to support RL environments

Hello Manos,
Thank you for your feedback. I have tried to follow up with your reviews
and added few concerns in the proposal itself for you to comment on.

Best,
Sanket


On Mon, Mar 29, 2021 at 8:00 AM Manos Kirtas <manolis [ dot ] kirt [ at ] gmail [ dot ] com> wrote:

> Thank you Sanket,
>
> I have added some comments directly on the PDF. Feel free contact me in
> order to discuss them. Excuse me for the delay!
>
> Best,
>
> Manos.
> On 23/3/21 7:52 μ.μ., Sanket Thakur wrote:
>
> Hello Manos,
> Thanks for your reply.
> I have modified my proposal accordingly. We can still iterate over it.
> Let me know your thoughts on it.
>
> Best,
> Sanket
>
> On Tue, Mar 23, 2021 at 12:15 PM Manos Kirtas <manolis [ dot ] kirt [ at ] gmail [ dot ] com>
> wrote:
>
>> Hello,
>>
>> Glad to hear that you are interested on contributing in deepbots project.
>> I have some comments on you proposal
>>
>>    - It will be helpful if you specify in that you will work on first.
>>    For example, there is an extensive list of examples
>>    <https://github.com/aidudezzz/deepbots/issues/85> that you can work
>>    on. Being more specific can help us to guide you in order to compose a
>>    strong proposal
>>    - What types of problems you are interested for? What kind of robots
>>    can be used to replicate those gym examples?
>>    - Are you interested on contributing RL algorithms? Is there any
>>    existing implementation that can be used or we should develop it from
>>    scratch. I totally recommend to take on a look on existing implementation
>>    (such as stable-baselines
>>    <https://stable-baselines.readthedocs.io/en/master/>)
>>    - If you are interested on implementing RL algorithms from scratch,
>>    it would be great to cite the respective papers.
>>    - Which framework you are going to use in order to implement those
>>    algorithms (etc. pytorch, tensorflow)?
>>    - Elaborate as more as possible the custom testbets that you are
>>    interested to develop. What's you ideas? What type of task we want to
>>    solve? What robot can be used? Can this problem be solved with both
>>    discrete and continuous action space?
>>    - I found very interesting idea to have an infrastructure for
>>    hyperparameter optimization! Can we use an existing framework for that, for
>>    example ray? <https://ray.io/>
>>
>> In my prospective it will be better to stick on 3 specific examples/tasks
>> in order to further examine what can be used. Additionally, I feel that
>> firstly we should take a look on existing implementations of RL algorithms
>> and integrate them on those specific examples. Stable-baselines are already
>> supported from deepbots and can be easily integrated on any example with
>> not so much effort. Regarding the hyperparameter optimization, I will
>> recommend ray since a have some experience and I can guide you. Of course
>> any other ideas it is more than welcome to be discussed!
>>
>> Finally, I find very useful to include a timeline in which you can
>> schedule your different ideas and develop a plan that can be feasible on
>> the given timeline.
>>
>> Those some comments can extend our discussion in order to develop a
>> strong proposal. I'm glad to hear your thoughts about the above comments!
>>
>> Best regards,
>>
>> Manos.
>>
>>
>>  On 22/3/21 12:35 μ.μ., Sanket Thakur wrote:
>>
>> Hello,
>> I am writing this to express my interest to work on '* Extend deepbots
>> to support stable-baselines and implement gym-style default RL
>> environments *' as a part of Gsoc 2021.
>> I am attaching my proposal for the project and relevant contributions.
>> It'd be great to hear your reviews on it.
>>
>> Thanks.
>>
>> Best,
>> Sanket
>>
>> ----
>> Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,https://lists.ellak.gr/gsoc-developers/listinfo.html
>> Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr> <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.
>>
>>

Attachment: GSOC Application - GFOSS.pdf
Description: Adobe PDF document

----
Λαμβάνετε αυτό το μήνυμα απο την λίστα: Λίστα αλληλογραφίας και συζητήσεων που απευθύνεται σε φοιτητές developers \& mentors έργων του Google Summer of Code - A discussion list for student developers and mentors of Google Summer of Code projects.,
https://lists.ellak.gr/gsoc-developers/listinfo.html
Μπορείτε να απεγγραφείτε από τη λίστα στέλνοντας κενό μήνυμα ηλ. ταχυδρομείου στη διεύθυνση <gsoc-developers+unsubscribe [ at ] ellak [ dot ] gr>.

απαντήσεις

πλοήγηση μηνυμάτων