我有一个PostgreSQL数据库,它有一个表,主键应用于三列。根据数据库,密钥上有一个索引:
Indexes:
"full_log_pkey" PRIMARY KEY, btree (server_name, line_number, log_generation)
然而,一些简单的测试表明我有重复的密钥:
select count(*) from full_log;
count
----------
60644405
select count(*) from
(select distinct server_name,
line_number,
log_generation
from full_log) as foo;
count
----------
60636564
显然,与行相比,不同的行(基于主键)更少。我的问题是,这怎么可能?
编辑:完整的表定义如下:
Table "public.full_log"
Column | Type | Modifiers
----------------+-----------------------------+-----------
activity | character(1) |
archivaldate | timestamp without time zone |
media_type | character varying(5) |
vsn | text |
archive_set | character varying(20) |
copy | smallint |
file_start | integer |
file_offset | integer |
fs_name | character varying(20) |
inode | double precision |
file_length | bigint |
file_type | character(1) |
overflow | integer |
device_number | integer |
server_name | text | not null
path | text |
line_number | integer | not null
log_generation | integer | not null
Indexes:
"full_log_pkey" PRIMARY KEY, btree (server_name, line_number, log_generation)
Foreign-key constraints:
"full_log_server_name_fkey" FOREIGN KEY (server_name) REFERENCES servers(server_name)
Rules:
insert_update_full_log AS
ON INSERT TO full_log
WHERE (EXISTS ( SELECT full_log.activity, full_log.archivaldate, full_log.media_type, full_log.vsn, full_log.archive_set, full_log.copy, full_log.file_start, full_log.file_offset, full_log.fs_name, full_log.inode, full_log.file_length, full_log.file_type, full_log.overflow, full_log.device_number, full_log.server_name, full_log.path, full_log.line_number, full_log.log_generation
FROM full_log
WHERE full_log.server_name = new.server_name AND full_log.line_number = new.line_number AND full_log.log_generation = new.log_generation)) DO INSTEAD UPDATE full_log SET activity = new.activity, archivaldate = new.archivaldate, media_type = new.media_type, vsn = new.vsn, archive_set = new.archive_set, copy = new.copy, file_start = new.file_start, file_offset = new.file_offset, fs_name = new.fs_name, inode = new.inode, file_length = new.file_length, file_type = new.file_type, overflow = new.overflow, device_number = new.device_number, path = new.path
WHERE full_log.server_name = new.server_name AND full_log.line_number = new.line_number AND full_log.log_generation = new.log_generation
有关重复行的示例:
select * from full_log where line_number = 6332986;
activity | archivaldate | media_type | vsn | archive_set | copy | file_start | file_offset | fs_name | inode | file_length | file_type | overflow | device_number | server_name | path | line_number | log_generation
----------+---------------------+------------+--------+-------------+------+------------+-------------+---------+------------+-------------+-----------+----------+---------------+-------------+-------------------------------------------------------------------------------------------+-------------+----------------
A | 2010-10-13 10:49:49 | ti | Z00711 | lcbp_rel | 1 | 226237 | 779099 | lcbp | 21798068.3 | 31198108 | f | 0 | 8511 | redact | wdl/delivery/irishparis_2010_09/MSE2_Histoire des rois d'Angleterre/MSE2_239.TIF | 6332986 | 1
A | 2010-10-13 10:49:49 | ti | Z00711 | lcbp_rel | 1 | 226237 | 779099 | lcbp | 21798068.3 | 31198108 | f | 0 | 8511 | redact | wdl/delivery/irishparis_2010_09/MSE2_Histoire des rois d'Angleterre/MSE2_239.TIF | 6332986 | 1
(2 rows)
答案 0 :(得分:4)
此查询返回什么内容?
select server_name, line_number, log_generation
from full_log
group by server_name, line_number, log_generation
having count(*) > 1
将其与
进行比较可能会有所帮助select line_number, log_generation
from full_log
group by line_number, log_generation
having count(*) > 1
但可能没有。我认为这个条款
WHERE (EXISTS ( SELECT full_log.activity,
full_log.archivaldate,
full_log.media_type,
full_log.vsn,
full_log.archive_set,
full_log.copy,
full_log.file_start,
full_log.file_offset,
full_log.fs_name,
full_log.inode,
full_log.file_length,
full_log.file_type,
full_log.overflow,
full_log.device_number,
full_log.server_name,
full_log.path,
full_log.line_number,
full_log.log_generation
FROM full_log
WHERE full_log.server_name = new.server_name
AND full_log.line_number = new.line_number
AND full_log.log_generation = new.log_generation))
可以简化为本条款。 (虽然我认为这不会导致问题。)
WHERE (EXISTS ( SELECT full_log.server_name,
full_log.line_number,
full_log.log_generation
FROM full_log
WHERE full_log.server_name = new.server_name
AND full_log.line_number = new.line_number
AND full_log.log_generation = new.log_generation))
您说当您更改非键列的数据类型时,PostgreSQL会删除并重新创建索引。我没有看到这里发生的事情,我不确定我是否见过这种情况。我可能没有注意到更改是否成功,并且我不经常更改列的数据类型。 (现在我已经说过了,我最后一次不能告诉你。)我现在有了PostgreSQL 9.0.2。
答案 1 :(得分:1)
您使用的是表继承吗?如果是这样,PK不会跨子女强制执行(至少不是8.2)。
答案 2 :(得分:0)
我怀疑是NULL值。也许查询是否有任何。
答案 3 :(得分:0)
我已经看到当目标行的ctid由于之前的触发器而改变时会发生这种情况。无法回想起确切的步骤,但这基本上就是问题所在。
如果你在那里有触发器,我对这个相关主题的回答可能就是你遇到的:
What are PostgreSQL RULEs good for?
要点是:如果在发出插入/更新/删除语句时,postgres没有返回正确数量的受影响的行,则可能会遇到奇怪的问题。然后,规则(或使用后触发器)可以帮助摆脱它们。