converts UTF-8 input to use different codepoints
