from smnp.token.tools import regexPatternTokenizer whitespacesTokenizer = regexPatternTokenizer(None, r'\s')