Unicode support for non-unicode locales

Shun-ichi GOTO shunichi.goto at gmail.com
Mon Oct 8 16:59:52 UTC 2007


2007/10/9, Shun-ichi GOTO <shunichi.goto at gmail.com>:
> If we treat filename as raw byte data, some filename might be broken
> in path operation. So the Python code shold handle filename as unicode
> characters by decoding.

In fact, current mercurial cannot manage some filename.
For example, a filename "正規表現.txt" is the case.
4 characters "正規表現" is Japanese of "regular expression"
and 2nd byte of 3rd character is '\' (0x5c).
So, hg ci -Am "test"  fails on adding this file.

{{{
[c:\temp\test]hg ci -Am initial
adding 正規・現.txt
removing 正規・現.txt
dir1/正規・現.txt not tracked!
正規・現.txt not tracked!
nothing changed
}}}

-- 
Shun-ichi GOTO



More information about the Mercurial-devel mailing list