Spell checking, yet again, multiple dictionaries
Posted: Thu Apr 24, 2014 2:24 am
My website uses a number of "languages":
1. Canadian English ( colour, travelling )
2. US English ( color, traveling ), mostly in the many quotations from Americans
3. slang. Mostly in quotations. Outside that I consider it an error.
4. smatterings of French, German, Latin, Hebrew and Arabic, along with a translation.
5. gibberish: strange text that should not be spell checked in any language, e.g. part numbers, unlinked URLs.
In each of these are embedded entities. My old workhorse SlickEdit editor
does not do UTF-8, so I can't code them directly. I find newer editors
are slow and clumsy by comparison.
What would be nice is for HTMLValidator to use an appropriate dictionary, and extras dictionary for each language and for it to treat entities as if I had coded the equivalent Unicode char.
How would the markup tell HTMLValidator which dictionary to use?
HTML5 has <span lang="en".
You could use:
ar=Arabic
de=German
en-CA=Canada
en-US=USA
en-x-slang=slang
en-x-none=gibberish
en=English
fr=French
fr-CA=French Canadian
he=Hebrew
la=Latin
see http://www.w3.org/International/article ... iew.en.php
the choice is wide.
For me, HTMLValidator is mainly for spell checking. My markup rarely fails because it is mostly computer-generated. It only fails when I am debugging changes to the generator programs.
I would like to easily toggle back and forth between checking markup, comments or both.
1. Canadian English ( colour, travelling )
2. US English ( color, traveling ), mostly in the many quotations from Americans
3. slang. Mostly in quotations. Outside that I consider it an error.
4. smatterings of French, German, Latin, Hebrew and Arabic, along with a translation.
5. gibberish: strange text that should not be spell checked in any language, e.g. part numbers, unlinked URLs.
In each of these are embedded entities. My old workhorse SlickEdit editor
does not do UTF-8, so I can't code them directly. I find newer editors
are slow and clumsy by comparison.
What would be nice is for HTMLValidator to use an appropriate dictionary, and extras dictionary for each language and for it to treat entities as if I had coded the equivalent Unicode char.
How would the markup tell HTMLValidator which dictionary to use?
HTML5 has <span lang="en".
You could use:
ar=Arabic
de=German
en-CA=Canada
en-US=USA
en-x-slang=slang
en-x-none=gibberish
en=English
fr=French
fr-CA=French Canadian
he=Hebrew
la=Latin
see http://www.w3.org/International/article ... iew.en.php
the choice is wide.
For me, HTMLValidator is mainly for spell checking. My markup rarely fails because it is mostly computer-generated. It only fails when I am debugging changes to the generator programs.
I would like to easily toggle back and forth between checking markup, comments or both.