Subject: BUG #3819: UTF8 can't handle \000
From: fschmidt@gmail.com ("Franklin Schmidt")
Date: 12/17/2007 7:34:39 AM
The following bug has been logged online:
Bug reference: 3819
Logged by: Franklin Schmidt
Email address: fschmidt@gmail.com
PostgreSQL version: 8.2
Operating system: XP & Linux
Description: UTF8 can't handle \000
Details:
Trying to store \000 in a text field with UTF8 encoding causes an error. I
assume this is because Postgres is written in C, but it's still wrong. A
solution was suggested here:
http://www.nabble.com/invalid-byte-sequence-for-encoding-%22UTF8%22%3A-0x00-
tp9058998p9096326.html
"I can think of some ways the server could support it without extensive
changes .. e.g. use a modified UTF8 representation which stores \u0000 as
0xc0 0x80 internally"
---------------------------(end of broadcast)---------------------------
TIP 1: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to majordomo@postgresql.org so that your
message can get through to the mailing list cleanly
|