Unicode usage level on the web

Post here if your message doesn't fit into another forum but is still about web development. Includes site critiques, web hosting and server questions, helpful software and resources, and more.
Post Reply
User avatar
MikeGale
Rank VI - Professional
Rank VI - Professional
Posts: 709
Joined: Mon Dec 13, 2004 1:50 pm
Location: Tannhauser Gate

Unicode usage level on the web

Post by MikeGale » Sun Feb 05, 2012 5:42 pm

Interesting analysis of the web.

Majority of content is Unicode (January 2012).

http://j.mp/Athi9x

User avatar
Albert Wiersch
Site Admin
Site Admin
Posts: 3417
Joined: Sat Dec 11, 2004 9:23 am
Location: Near Dallas, TX
Contact:

Re: Unicode usage level on the web

Post by Albert Wiersch » Mon Feb 06, 2012 2:27 pm

Thanks Mike... good article & good news.

This prompted me to comment out the 'iso-8859-1' related entries in the HTML inserter in CSE HTML Validator. I don't see a good enough reason to keep them, but do have a good reason to remove them (why use them when you can use UTF-8?).
Image
Albert Wiersch

User avatar
MikeGale
Rank VI - Professional
Rank VI - Professional
Posts: 709
Joined: Mon Dec 13, 2004 1:50 pm
Location: Tannhauser Gate

Re: Unicode usage level on the web

Post by MikeGale » Mon Feb 06, 2012 5:08 pm

Agreed utf-8 is the way to go.

Having said that I encounter a problem fairly often. People who don't understand Unicode often use an editor that isn't up to date on a web page. Result, a competent piece of markup gets broken. Worse, the problem might be hard to spot. My answer is to say, use a validator and officially wash my hands of any damage that might be caused.

Post Reply