Page 1 of 1

What is the best way to convert from HTML to DSL?

PostPosted: Wed Oct 09, 2013 11:20 am
by veole
Using Regular Expressions in a text editor? Is there a script or some tool to help to do it? It seems that the most important step to make a DSL dictionary is to use Calibre to convert the source to HTML. And then what?

I've seen perfectly formated tables in DSL (no images), so there has to me something to get from HTML to DSL so nicely. Calibre leaves a lot of trash to use RegEx effectively.

Thank you.

Re: What is the best way to convert from HTML to DSL?

PostPosted: Mon Oct 21, 2013 6:17 pm
by C2BlEv
Yes, regex is the approach used by most DSL folks. I use emeditor on a Win machine. You need to learn the DSL language first (start with the description of DSL, then open the dsl files for the dictionaries that you like). DSL is like a very basic html, so you will need to make decisions on what to do with the html formatting that DSL does not support.

Re: What is the best way to convert from HTML to DSL?

PostPosted: Tue Oct 22, 2013 5:34 pm
by veole
I understand how DSL and RegEx work. I can edit dictionaries and fix and renew their format. But if I open a html file with Emeditor, I don't know html and it seems very confusing.

Maybe someone could try to do a script (standalone or for Calibre) to convert html to a basic DSL format, so you don't have to face a plain html code every time you want to make a new DSL dictionary.