Why does Scrivener use &nbsp for separating sections?

Robotech_Master · May 3, 2016, 12:18am

I was in a discussion over on the Mobileread forum with Jim Chapman, developer of the Windows 10 EPUB-reading app Freda, asking him to add support to his app so that my Scrivener-compiled EPUBs would display properly in his app. (See the other thread I started in this forum.) He agreed to amend Freda to throw in a line break for the

format Scrivener uses, then he added:

Is that right? If so, why does Scrivener do it this way? It seems that keeping to XHTML standards should be the way to go.

So why doesn’t Scrivener? Should it? I’d really like to know.

Robotech_Master · May 3, 2016, 3:02pm

Some further thoughts on this issue.

AmberV · May 4, 2016, 12:52am

We are in complete agreement on whether or not using empty paragraphs with non-breaking spaces, or sequences of
s, to affect spacing is not at all a good way to make a well-formed page. It’s not exactly breaking standards to do this, to be clear, you won’t get an error result from W3C or an ePub validator—but it is a bit like using the Tab key to indent your paragraphs instead of proper ruler formatting. I see in your blog post you did represent that point of view as well. I think everyone that works with HTML feels it is ugly and when in a circumstance where CSS can be used to generate spacing, will do so.

We don’t have a lot of control over the conversion from RTF to HTML, to be clear. By and large we don’t generate HTML, we make a document look right and then ask the engine to turn what that looks like into HTML/CSS that looks as close as possible to the original. What we can do is often limited to what can be done with Find and Replace All—just searching for string patterns and changing them. As you can imagine that’s a fragile approach that must be used very carefully. I’ve added a note to see if we can search for empty paragraphs and replace them with 1em-height CSS margins, but I can’t promise anything. I mean we basically have to look for these stub lines, remove them, and then hack a class into the prior paragraph’s style attribute. That’s not impossible, but again just doing that with search and replace is risky. What if the thing above the stub paragraph is another stub paragraph, or something other than a body text paragraph or maybe the stub paragraph isn’t actually meant to be a scene separator and being generated for some other reason, etc.

I’m not even sure if using padding on the ultimate paragraph of a scene is the right approach either. I would think that a section separator should be a discrete element that could be styled centrally and referred to in the DOM as a specific semantic thing, rather than one paragraph with way more padding below it than most paragraphs.

Looking to the future (somewhat long-term) we hope to introduce an optional and more advanced approach to ePub generation that would result in extremely clean HTML code, using another engine entirely. It’s still far too early to give any details or estimations on how it will work though.

Robotech_Master · May 4, 2016, 10:57am

Thanks for the response!

But what about the breaks between separate scenes? Those aren’t represented by a blank paragraph that would get replaced by a non-breaking space; they’re not represented by anything at all but generated by Scrivener. Apparently Scrivener throws a non-breaking space paragraph in there, too. Why not

instead?

AmberV · May 4, 2016, 7:43pm

That’s what I was referring to when I said we make a document look a certain way and then pass that to the HTML converter. The software inserts an empty paragraph and the HTML converter dutifully inserts what is necessary to make an empty paragraph display in most contexts. I’m not aware of any construct in RTF that would universally be considered an

in the modern sense of the element.

Have you considered using MultiMarkdown with Scrivener? A lot of what I’m saying here is owing to the limitation of being an RTF based editor and trying to generate clean HTML out of that. MMD works by ignoring all of the rich text stuff and using Scrivener more like a plain-text editor with a simple syntax based heavily on Markdown. MMD itself does not have an ePub generator, but (a) the HTML5 it produces is super clean and semantic, and (b) there is another tool called Pandoc which can take MMD files created in Scrivener and turn them into ePubs—it does a pretty good job of it, too.

Robotech_Master · March 23, 2019, 2:45am

So, I’m curious. Since I started this thread, and wrote these articles—

http://www.teleread.com/how-epubs-readers-disagree-when-non-breaking-spaces-break-standards/

http://www.teleread.com/blank-line-issue-pits-scrivener-against-epub-standards/

—has anything changed?

Does Scrivener have any better way to indicate section breaks that is honored by the majority of e-readers yet?