RLOS-2021-Microsoft

Contains updates for my work on Parallel parsing improvements on Vowpal Wabbit.

View on GitHub

drawing drawing

Presentation link

Video link

About the program (from website):

The Reinforcement Learning (RL) Open Source Fest is a global online program focused on introducing students to open source reinforcement learning programs and software development while working alongside researchers, data scientists, and engineers on the Real World Reinforcement Learning team at Microsoft Research NYC. Students will work on a four-month research programming project during their break from university (May-August 2021). Accepted students will receive a $10,000 USD stipend.

Our goal is to bring together a diverse group of students from around the world to collectively solve open source reinforcement learning problems and advance the state-of-the-art research and development alongside the RL community while providing open source code written and released to benefit all.

At the end of the program, students will present each of their projects to the Microsoft Research Real World Reinforcement Learning team online.

Project Abstract:

Vowpal Wabbit is known for its blazing-fast performance. However, VW’s parsers can be a bottleneck for most operations, so an effective way to multithread the parsers is required to unleash their true potential. Last year, parallel parsing support for text input format was provided. This project builds upon that by providing a better and more efficient way to read and write cache, support for multiple passes, multiline examples, and JSON/DsJSON input formats.

Work done:

All the work can be found here in the form of pull requests. link. Look at how to run for more details.

Pages:

Resources:

These resources could be helpful for getting started with multithreading in c++.

Lessons learnt the hard way:

Contribute?

Please feel free to contribute by making pull requests or opening issues on the repo. I would also love to hear your thoughts about this project. Please use any form of contact mentioned on my website or github bio.

Acknowledgement:

This summer (the summer of 2021) was a great time for me. Along with all the technical learnings, the most important skill that I learnt was time management.

I would like to thank my mentors @peterychang and @jackgerrits, for their constant support and guidance. Also, thanks to @shivanshs9 for his feedback on the proposal.

Lastly, thanks to Microsoft for this amazing oppotunity and the generous funding.

This has been a summer worth remembering!