Let me preface, I am pretty sure I know why the tool is behaving the way it is, and given what I think I know, I am pretty sure there is no easy fix – but figured I would bring it up anyway just in case:
In the situations where you choose HTML -> Text import, and the original HTML has text elements set into a coloured background, the coloured background comes along for the ride, along with the text colour in the original HTML. One thing that I would like to do a lot, and I imagine a lot of people would, is highlighting and annotating various parts of some web research document. If the text is as described above, the highlight tool eradicates the existing background colour underneath the selection. In the case of dark background/light foreground sites, this means the selected text now appears to have been deleted. Actually it is there, in white (or whatever light colour the web designer chose). If I press highlight again it places the selected highlighter colour behind the text. If it is a sane site design, it is probably okay that point; if it is an inverse site with light text, then the result can be fairly unreadable.
So, solutions? On one hand, an obvious answer would be a preference to strip colour from the HTML document. But, I understand that might be well nigh impossible without writing your own HTML parser (ha).
What if it was done post-import: Import the HTML using WebKit, then remove all colour information from the document?
I imagine there is no way for the highlight tool itself to detect the situation, based on its behaviour. It thinks the text is already highlighted to begin with?
Well, just so you know, anyway.
Work-around
Upon import:
- Select All (Cmd-A)
- Show Fonts (Cmd-T)
- Set foreground colour to black
- Remove Highlight (Cmd-Shift-H)