Table that won't sort correctly: wash-up

Simon BUCH simon.buch at m-ais.com
Mon Dec 10 01:14:16 PST 2012


Hello,

I suspect the effect that you may be observing is Microsoft's handling 
of some multibyte characters, where they get mutated to the Unicode 
replacement character '�' U+FFFD - which appears to happen very often 
when content is imported or pasted into FrameMaker from Microsoft word.

The character is not shown in FrameMaker, but if the file is exported to 
XML or HTML, it is rendered as a question mark in a rhombus diamond 
symbol: � - as shown on the web page:
http://www.fileformat.info/info/unicode/char/fffd/index.htm
Some browsers and applications display it as a hollow box character.

If you know the exact position of one of the errant character in the 
FrameMaker document, you can select the single character by using the 
cursor keys, and press the delete key. For some document I found it was 
possible to select Select Edit > Find/Change... ; specify text to find 
as: \x0d ; and then click Change All.

Regards
// Simon BUCH




On 09/12/2012 12:52, Steve Rickaby wrote:
> At 09:24 +0000 8/12/12, Steve Rickaby wrote:
>
>> Plain old MIF-ing fixes it.
> This is interesting, as it provided a specific example of an invisible (to the eye) problem that MIF-washing fixes.
>
> To recap, I had a large table whose first column contained entries of the form '[xxxx]', where 'xxxx' was a reference marker. The text had originated in Word, and the table didn't sort properly when the first column was used as the sort key.
>
> In the MIF, quite a lot of the first column entries were of the form:
>
>        <ParaLine
>          <String `['>
>          <String `xxxx]'>
>        > # end of ParaLine
>
> In other words, the '[xxxx]' entry consisted of two separate but adjacent strings. This messed up the sort. The only visible evidence of this on the screen was that it took one more right-cursor movement to advance from the start of the line to past the '[' as expected. Also, the wildcard pattern '^[' didn't get a match.
>
> Clearly one of the things that FrameMaker does when parsing MIF is to spot adjacent strings like this and concatenate them. Useful to know. Why the text came out of Word like this is another question altogether. But with Word, who knows?
>




More information about the framers mailing list