I am still learning Chinese, so I typically take a text, use the Stanford segmenter, and then convert to pinyin.
The conversion adds an unnecessary space before and after each character, which means words identified in segmentation are broken up.
I would like it to not add spaces, but also to not change spaces already in the input. If the package contains a way to do this, I couldn't find it in the documentation.
I am still learning Chinese, so I typically take a text, use the Stanford segmenter, and then convert to pinyin.
The conversion adds an unnecessary space before and after each character, which means words identified in segmentation are broken up.
I would like it to not add spaces, but also to not change spaces already in the input. If the package contains a way to do this, I couldn't find it in the documentation.