Not a kernel guy

… in the Windows kernel team

Thursday, November 23, 2006

Забавное чтиво про Unicode BOM.

Наткнулся сегодня на забавный пост в блоге Michael Kaplan: Every character has a story #4: U+feff (alternate title: UTF-8 is the BOM, dude!). Майкл умудряется интересно писать про скушные вещи вроде Unicode BOM, распознавание кодировок и т.п. Избранные места:

Enter Microsoft.

(Yes, I know — boo, hiss, etc.)

But every 4-6 months another huge thread on the Unicode List gets started about how bad the BOM is for UTF-8 and how it breaks UNIX tools that have been around and able to support UTF-8 without change for decades2 and about how Microsoft is evil for shipping Notepad that causes all of these problems and how neither the W3C nor Unicode would have ever supported a UTF-8 BOM if Microsoft did not have Notepad doing it, and so on, and so on.

2 - Never mind that Unicode has not existed for that long, let alone UTF-8!

Posted at 10:00 pm •

Powered by WordPress