alexyz
September 15th, 2009, 09:15 PM
I've have data in Bengali language (Bangladesh) that I need to convert to unicode.
The data is encoded using an outdated non-unicode approach, in which the font has Bengali characters in place of latin characters. Because of this, you MUST use this particular font, unlike unicode.
I'm not sure if I'm using precisely the right terminology here, but one way to understand the problem is to look at the font using a font viewer or character map. You see Bengali characters in the "Latin" character space where normally you see "ABCDEF..." . With a unicode font you see "ABCDEF..." in the "Latin" character space, and the Bengali elsewhere in its own space. Here's another explanation: http://www.bornosoft.com/kb_interface/efont.htm
* It might be possible to convert this data using the recode or iconv commands, but I'm not sure what the "from" encoding should be. I'd prefer a command-line, script-able solution rather than a GUI.
The font being used is called SulekhaT.
I've done extensive searching with no luck. Probably because I should be searching in Bangla! ](*,) Any help would be most appreciated!
The data is encoded using an outdated non-unicode approach, in which the font has Bengali characters in place of latin characters. Because of this, you MUST use this particular font, unlike unicode.
I'm not sure if I'm using precisely the right terminology here, but one way to understand the problem is to look at the font using a font viewer or character map. You see Bengali characters in the "Latin" character space where normally you see "ABCDEF..." . With a unicode font you see "ABCDEF..." in the "Latin" character space, and the Bengali elsewhere in its own space. Here's another explanation: http://www.bornosoft.com/kb_interface/efont.htm
* It might be possible to convert this data using the recode or iconv commands, but I'm not sure what the "from" encoding should be. I'd prefer a command-line, script-able solution rather than a GUI.
The font being used is called SulekhaT.
I've done extensive searching with no luck. Probably because I should be searching in Bangla! ](*,) Any help would be most appreciated!