[PATCH 1 of 8] use UTF-8 to encode/decode log text

Andrey grooz-work at gorodok.net
Mon Nov 20 19:02:58 UTC 2006


On 21 November 2006 (Tue) 00:43, you wrote:
> On Tue, Nov 21, 2006 at 12:48:48AM +0600, Andrey wrote:
> > On 21 November 2006 (Tue) 00:14, Alexis S. L. Carvalho wrote:
> > > Thus spake Andrey:
> > > > @@ -60,6 +62,7 @@ class changelog(revlog):
> > > >          """
> > > >          if not text:
> > > >              return (nullid, "", (0, 0), [], "", {})
> > > > +        text = unicode(text, CHANGELOG_ENCODING)
> > >
> > > Should we encode/decode the whole changelog text or just the user and
> > > comment sections?
> > >
> > > I'm not sure about the extra section (branch name should be UTF-8, but
> > > I don't know if binary data is forbidden), but, at least for now, I
> > > think we don't want to encode/decode the list of files.
> >
> > I see. Seems like only comment should be encoded for now, and maybe
> > extra.
>
> Don't forget user!

Then we have to make config parser understand Unicode first, because user name 
is usually read from config file.



More information about the Mercurial-devel mailing list