Khmer Script Punctuation Line Breaking Is Broken
Regression Bug (used to work in InDesign CS 5.5)
Adobe InDesign CC 15.03 currently will break lines between Khmer script and its punctuation even though no space is present (Khmer Unicode Punctuation: https://en.wikipedia.org/wiki/Khmer_script#Spacing_and_punctuation ) as pictured in the attachment.
Desired Result: Khmer punctuation should stay with the Khmer script preceding it (just like an English period, comma, question mark, etc. does). The whole word should line break with the punctuation (currently Khmer uses zero-width spaces [U+200B] to indicate a word-boundary).
The issue has been fixed in the latest release of InDesign – InDesign 2021.
Please update to this new version of InDesign to get the fix.
—
Adobe InDesign team
-
Nathan Wells commented
And another
-
Nathan Wells commented
Here's a screenshot.
-
Nathan Wells commented
This is still broken @Ravi Kiran
As Eric Pawley comments.
ៗ gets put on a new line (which should never happen).
Also, some words are being broken in the middle of a syllable instead of only being broken on the zero-width space character U+200B
Words like ហ្វេសបុក
គ្រីស្ទ
យេស៊ូវThis all worked in InDesign CS 5.5 and I still have to use CS 5.5 when we publish books using justified text.
-
Eric Pawley commented
I've tested the release, 16.0.0.77, and find that the bugs I listed in 16.0.0.60 are still present.
The most obvious error is the line break before ៗ "repetition sign (U+17D7), which should never be permitted.
But the word ហ្វេសបុក "Facebook" is still breaking in the middle of a syllable between បុ andក.Line breaking for "Khmer" words appears to be acceptable, but line breaking for non-Khmer words is very bad. In addition to borrowed English words in Khmer texts, the Khmer script is used to write many other languages including Bunong, Kuy, Jarai, Tampuan, Kreung, and Brao.
Up until InDesign 2019, line breaking worked very well for these languages. With InDesign 2020 and 2021, about 10-20% of lines are broken in the middle of words. The only workaround of which I am aware is to apply "NoBreak" to all text and then use a Grep Style to allow breaks only at white space characters or zero width space.
-
Eric Pawley commented
InDesign 16.0.0.60 x64 (Mainline CL #1084417) is much improved, but still has issues. In the attached file (copy-paste RFA Khmer) the text is typed with Zero width space between words:
-- ទំព័រហ្វេសបុក "facebook page" is broken before the final letter in the word.
-- The name អាឡិកហាន់ដ្រូ "Alejandro" is broken between syllables.
-- The punctuation mark ៗ "repetition sign (U+17D7)" should be attached to the word before it (ផ្សេង)
-
Nathan Wells commented
Thanks @amaarora
Unfortunately, I am still on High Sierra because I still have to use InDesign CS 5.5 in order for Khmer to work properly. I looked again and saw there is a possible workaround to use it in Mojave so I will do my best to try that so I can load InDesign 16.0
Thanks for your work on getting the Khmer language working in InDesign again. -
Manith commented
I am Khmer. I have my own history, the Khmer Empire.
Thanks for your developer team for support our Khmer language.
-
Eric Pawley commented
I confirmed that this bug still exists in ID 2021 PR (16.0.0.36)