RLOS-2021-Microsoft

Contains updates for my work on Parallel parsing improvements on Vowpal Wabbit.

View on GitHub

Benchmark results:

This page includes screenshots from the presentation, containing benchmark results.

Nomenclature:

Other details:

Graphs:

drawing drawing drawing

Callgrind results:

The following image shows results from a callgrind run on ST for the three datasets. Callgrind is a tool which helps in identifying the total CPU time taken for each of the functions in a program.

drawing

From the results, we see that the cache parser is the fastest, followed by Text, which in turn is closely followed by JSON.

This explains why we see the highest improvement in performance over ST for JSON(68%) compared to Text(60%) and cache(25%).