"Stephen J. Turnbull" <turnbull(a)sk.tsukuba.ac.jp> writes:
> Not really relevant to the list anymore.
It seems like very relevant info to me, especially the discussion of the
right-justification issue below.  Pardon me if I send this to the lists.
> >>>>> "Dan" == Dan Harkless <dan-xemacs(a)dilvish.speed.net> writes:
> 
>     Dan> "Stephen J. Turnbull" <turnbull(a)sk.tsukuba.ac.jp> MIS-attributes!
> 
>     Dan> Uh, no, I did not write the above.  That was Hrvoje as well.
> 
> Very sorry.  My bad.
> 
>     >> OK.  Since Dan has provided an algorithm to replicate the
>     >> problem, it should get fixed soon enough.
> 
>     Dan> Actually, I provided an algorithm to replicate the problem
>     Dan> back in April of '98, but I guess my bug just got recorded
>     Dan> then forgotten about, rather than becoming a topic of
>     Dan> discussion on xemacs-beta.
> 
> I didn't see it, or didn't notice it (I'm mostly interested in Mule).
I just posted my bug to comp.emacs.xemacs.  Steven Baur then replied that he
had recorded it.
> I'm not referring to your usage, anyway.  I don't like it when threads
> get introduced into xemacs-beta in the middle just so that people can
> bitch about Mule.  
xemacs-beta or xemacs-mule?
> (That probably wasn't the intent, but since whoever
> diverted the thread - Hrvoje, Jan - didn't explain anything about the
> mechanics of the bug, that was the effect.)
My recent comp.emacs.xemacs post (which was apparently what was forwarded)
explained it in brief and included a URL to my original comp.emacs.xemacs
bug report, which explained it in more detail.
>     >> This would not be acceptable in Japanese.
> 
>     Dan> Why not?  If a space is so important, wouldn't it be better
>     Dan> to _not_ have it at the end of a line, where it's invisible?
> 
> Because you want it to reappear when refilling takes place.  Using ^
> and $ to mark line beginning and end of lines:
> 
>     ^NormalJapaneseTextLooksLikeThisAllRunTog$
>     ^etherWithNotEnoughPunctuationAndLinesAre$
>     ^BrokenWillyNillyExceptThatALineShouldNot$
>     ^StartWithPunctuationIncludingExplicitWhi$
>     ^teSpace.$
> 
> Using spaces for emphasis:
> 
>     ^NormalJapaneseTextLooksLikeThisAllRunTog$
>     ^etherWithNotEnoughPunctuationAndLinesAre$
>     ^BrokenWillyNillyExceptThatALineShouldNot$
>     ^StartWithPunctuation, Including Explicit $
>     ^WhiteSpace.$
> 
> Inserting,
> 
>     ^NormalJapaneseTextUSUALLYLooksLikeThisAl$
>     ^lRunTogetherWithNotEnoughPunctuationAndL$
>     ^inesAreBrokenWillyNillyExceptThatALineSh$
>     ^ouldNotStartWithPunctuation, Including E$
>     ^xplicit WhiteSpace.$
> 
> See how the space between "explicit" and "whitespace" comes back?
I'm not advocating removing it.  I was suggesting:
     ^NormalJapaneseTextLooksLikeThisAllRunTog$
     ^etherWithNotEnoughPunctuationAndLinesAre$
     ^BrokenWillyNillyExceptThatALineShouldNot$
     ^StartWithPunctuation, Including $
     ^Explicit WhiteSpace.$
And if "Including Explicit WhiteSpace" is in romaaji, which I take it is the
most common usage of explicit whitespace in Japanese writing, you can't
always avoid that ragged right side (in fixed-width text), so the above
should be acceptable.
If "Including Explicit WhiteSpace" is Japanese, however, I finally see why
one might want to put the space past fill-column.  Nobody had brought up
this right-justification issue until now (or if they did, I missed it).
However, if you're using explicit white space in Japanese, you can't have a
clean right side in all cases -- what about:
     ^NormalJapaneseTextLooksLikeThisAllRunTog$
     ^etherWithNotEnoughPunctuationAndLinesAre$
     ^BrokenWillyNillyExceptThatALineShouldNot$
     ^StartWithPunctuation Including Explicit $
     ^WhiteSpace.BlahBlahBlahBlahBlahBlahBlah.$
>     >> I think it would be pretty unacceptable in English too (people
>     >> will go nuts with three-letter words that don't fit into four
>     >> letter spaces at the end of the line).
> 
>     Dan> Yeah, I know some people would find it unacceptable --
>     Dan> that's why I was proposing it be controlled by a variable
>     Dan> that you can set either way.  But I think you may be
>     Dan> overestimating the number of people who'd find it
>     Dan> unaccepable anyway -- look how long this extra space bug has
>     Dan> persisted...
> 
> I think you're underestimating the number of people who can visually
> estimate the length of the word "the" but can't see empty space.
In all my non-email, non-newsgroup-post buffers, I set the fill-column to 80
(and have my XEmacs window 81 columns wide -- an annoying necessity), so
anytime there's an extra space at the end of the line, I see the line-wrap
"curly arrow", not just "empty space".
I'd think most people would want to use all of the columns they're entitled
to... 
> Only people who are writing to strict coding standards with automatic
> checking software are likely to find the bug "highly" annoying.  (A
> few people will naturally find it a pain in the neck, of course, but
> they will be few.)
No, like I say, anyone who has fill-column set to the width of the window
minus 1.
>     Dan> That's a fundamental failure in what auto-fill
>     Dan> claims to do, and it's been broken ever since 20.0.
> 
> Documentation?  Info says "break at appropriate places".
> "Appropriate" is culturally determined, eg, by company coding
> standards.
Okay, you got me there.  I didn't re-read the documentation before saying
that, and I was remembering it as being defined mechanically in terms of
fill-column. 
>     >> Unfortunately, people who write in Japanese often need both
>     >> behaviors in the same buffer (as Hrvoje points out).
> 
>     Dan> Hmmm.  Do they *need* both behaviors?  I still fail to see
>     Dan> what's so bad about breaking at the previous boundary if
>     Dan> you've specifically told xemacs you want to preserve spaces.
> 
> Yes.  The above text is how many Japanese write English embedded in
> Japanese (except that they wouldn't use capitals to indicate word
> breaks as I did).  Such authors don't need both.  That is not
> acceptable in most academic writing, however; the English should look
> like English.  Such embedding is an everyday occurance, the need is
> real.
With the right-justification issue in play, I now understand how this could
be the case.
>     Dan> So what'll happen?  In Asian languages, auto-fill will fail
>     Dan> to perform as advertised, allowing you to get lines of
>     Dan> (fill-column + 1) columns, whereas in Western languages, that
>     Dan> won't be the case?
> 
> Yes.  It's not just the magic space, it's all punctuation.  Lines may
> not start with punctuation.  There are two styles.  In one,
> punctuation at the end of the line is retained, resulting is lines
> going past the fill column.  Most Japanese texts seem to do this.  (In
> fact, I have seen proportionally filled text with exdented
> punctuation.)  
Wow.  Sounds pretty ugly.  If that's common usage, however, especially in
printed material, that really changes things.  I thought we were just
talking about two different views of "getting all the columns one is
entitled to" in XEmacs.
> This style is often seen in programming (the exdented
> "*/" terminator, right?) by Westerners, too.  
But the "*/" doesn't go past fill-column.
> I believe that some old style manuals recommend this style for English,
> too.  The alternative style is to have some short lines, as in English.
> This is pretty ugly, IMHO, and evidently most Japanese agree.
Or to have variable-width spaces (or multiple spaces in series in a
fixed-width environment).
>     Dan> Seems pretty hokey to me
> 
> Please.  If you don't understand the language, don't go there.  Shall
> we start on English spelling?
I didn't mean the Japanese usage seemed hokey, I meant XEmacs' definition of
fill-column and auto-fill seemed hokey, if it was only trying to artifically
gain Japanese users 1 extra column in certain situations.  Again, the
right-justification bit definitely changes my understanding of the issue.
>     Dan> I know it'd bug the heck out of me if I had fill-column set
>     Dan> to 80 and was working on a buffer of C code with Japanese
>     Dan> comments (I had to do this frequently at my previous job),
>     Dan> and the C code would auto-fill properly, but the Japanese
>     Dan> comment lines would occasionally be 81 columns, breaking my
>     Dan> company's coding style policy, and me with no way to stop
>     Dan> that from happening.
> 
> Well, your company's coding style policy conflicts with the relevant
> national standard, then.  (AFAIK it is in fact a Japanese national
> standard that specifies the two styles of line breaking.)  I believe
> both are supported by Mule....
> 
> ...Yes.  (setq kinsoku-extend-limit 0) does what you want.
> 
> Except that it probably doesn't in the pre-Hrvoje-reversion-code, as
> that variable only exists in Mule.  So obviously the behavior you
> reported is a bug.  (Please note that nobody bothered to say that this 
> behavior introduced to support Asian languages was occurring in
> non-Mule XEmacsen, either; even you only noted it as a "by-the-way"!)
Well, that was discussed, but yes, perhaps could have been stressed more.
However, I wouldn't want the spaces past fill-column just because I had
built my XEmacs with MULE support (but wasn't actively using it).
-----------------------------------------------------------------------
Dan Harkless                   | To prevent SPAM contamination, please 
dan-xemacs(a)dilvish.speed.net   | do not post this private email address
SpeedGate Communications, Inc. | to the USENET or WWW.  Thank you.