https://bbengfort.github.io/2016/08/parallel-nlp-preprocessing/