我需要做一些非常奇怪的事情,即在视图中创建假记录 以填补产品价格的发布日期之间的差距。
实际上,我的情况比这复杂一点,但我已经简化为产品/日期/价格。
假设我们有这张表:
create table PRICES_TEST
(
PRICE_DATE date not null,
PRODUCT varchar2(13) not null,
PRICE number
);
alter table PRICES_TEST
add constraint PRICES_TEST_PK
primary key (PRICE_DATE, PRODUCT);
有了这些记录:
insert into PRICES_TEST values (date'2012-04-15', 'Screw Driver', 13);
insert into PRICES_TEST values (date'2012-04-18', 'Screw Driver', 15);
insert into PRICES_TEST values (date'2012-04-13', 'Hammer', 10);
insert into PRICES_TEST values (date'2012-04-16', 'Hammer', 15);
insert into PRICES_TEST values (date'2012-04-19', 'Hammer', 17);
选择记录将返回给我:
PRICE_DATE PRODUCT PRICE
------------------------- ------------- ----------------------
13-Apr-2012 00:00:00 Hammer 10
16-Apr-2012 00:00:00 Hammer 15
19-Apr-2012 00:00:00 Hammer 17
15-Apr-2012 00:00:00 Screw Driver 13
18-Apr-2012 00:00:00 Screw Driver 15
假设今天是2012年4月21日,我需要一个视图,每天都要重复每个价格,直到新价格发布。像这样:
PRICE_DATE PRODUCT PRICE
------------------------- ------------- ----------------------
13-Apr-2012 00:00:00 Hammer 10
14-Apr-2012 00:00:00 Hammer 10
15-Apr-2012 00:00:00 Hammer 10
16-Apr-2012 00:00:00 Hammer 15
17-Apr-2012 00:00:00 Hammer 15
18-Apr-2012 00:00:00 Hammer 15
19-Apr-2012 00:00:00 Hammer 17
20-Apr-2012 00:00:00 Hammer 17
21-Apr-2012 00:00:00 Hammer 17
15-Apr-2012 00:00:00 Screw Driver 13
16-Apr-2012 00:00:00 Screw Driver 13
17-Apr-2012 00:00:00 Screw Driver 13
18-Apr-2012 00:00:00 Screw Driver 15
19-Apr-2012 00:00:00 Screw Driver 15
20-Apr-2012 00:00:00 Screw Driver 15
21-Apr-2012 00:00:00 Screw Driver 15
任何想法如何做到这一点?我不能真正使用其他辅助表,触发器或PL / SQL编程,我真的需要使用一个视图来做到这一点。
我认为这可以使用oracle分析完成,但我不熟悉。我试着读这个http://www.club-oracle.com/articles/analytic-functions-i-introduction-164/,但我根本没有得到它。
答案 0 :(得分:6)
您可以使用CONNECT BY LEVEL
语法创建行生成器语句,与表中的不同产品交叉连接,然后将外部连接到您的价格表。最后一步是使用LAST_VALUE
函数和IGNORE NULLS
重复价格,直到遇到新值,并且因为您想要一个带有CREATE VIEW
语句的视图:
create view dense_prices_test as
select
dp.price_date
, dp.product
, last_value(pt.price ignore nulls) over (order by dp.product, dp.price_date) price
from (
-- Cross join with the distinct product set in prices_test
select d.price_date, p.product
from (
-- Row generator to list all dates from first date in prices_test to today
with dates as (select min(price_date) beg_date, sysdate end_date from prices_test)
select dates.beg_date + level - 1 price_date
from dual
cross join dates
connect by level <= dates.end_date - dates.beg_date + 1
) d
cross join (select distinct product from prices_test) p
) dp
left outer join prices_test pt on pt.price_date = dp.price_date and pt.product = dp.product;
答案 1 :(得分:4)
我认为我有一个解决方案,使用CTE的最终结果的增量方法:
with mindate as
(
select min(price_date) as mindate from PRICES_TEST
)
,dates as
(
select mindate.mindate + row_number() over (order by 1) - 1 as thedate from mindate,
dual d connect by level <= floor(SYSDATE - mindate.mindate) + 1
)
,productdates as
(
select p.product, d.thedate
from (select distinct product from PRICES_TEST) p, dates d
)
,ranges as
(
select
pd.product,
pd.thedate,
(select max(PRICE_DATE) from PRICES_TEST p2
where p2.product = pd.product and p2.PRICE_DATE <= pd.thedate) as mindate
from productdates pd
)
select
r.thedate,
r.product,
p.price
from ranges r
inner join PRICES_TEST p on r.mindate = p.price_date and r.product = p.product
order by r.product, r.thedate
mindate
检索数据集中最早的可能日期dates
生成从最早可能日期到今天的日期日历。productdates
cross加入所有可能日期的产品ranges
确定在每个日期应用的价格日期inner join
条件过滤掉没有相关价格日期的日期答案 2 :(得分:4)
我对沃尔夫的出色回答做了一些修改。
我将子查询因子(WITH
)替换为connect by
中的常规子查询。这使代码更简单一些。 (虽然这种类型的代码起初看起来很奇怪,所以这里可能没有大的收获。)
最重要的是,我使用了分区外连接而不是交叉连接和外连接。分区外连接也有点奇怪,但它们适用于这种情况。这使代码更简单,并且应该提高性能。
select
price_dates.price_date
,product
,last_value(price ignore nulls) over (order by product, price_dates.price_date) price
from
(
select trunc(sysdate) - level + 1 price_date
from dual
connect by level <= trunc(sysdate) -
(select min(trunc(price_date)) from prices_test) + 1
) price_dates
left outer join prices_test
partition by (prices_test.product)
on price_dates.price_date = prices_test.price_date;
答案 3 :(得分:1)
我刚刚意识到@Wolf和@jonearles的改进并没有返回我需要的确切结果,因为列出所有日期的行生成器不会按产品生成范围。如果产品A的第一个价格晚于产品B的任何价格,则产品A的第一个列出日期仍然必须相同。但他们确实帮助我进一步努力并获得了预期的结果:
我开始改变@ wolf的日期范围选择器:
select min(price_date) beg_date, sysdate end_date from prices_test
到此:
select min(PRICE_DATE) START_DATE, sysdate as END_DATE, PRODUCT
from PRICES_TEST group by sysdate, PRODUCT
但是,不知何故,每个产品的行数在每个级别都呈指数级增长。我刚刚在outter查询中添加了一个独特的内容。最后的选择是:
select
DP.PRICE_DATE,
DP.PRODUCT,
LAST_VALUE(PT.PRICE ignore nulls) over (order by DP.PRODUCT, DP.PRICE_DATE) PRICE
from (
select distinct START_DATE + DAYS as PRICE_DATE, PRODUCT
from
(
-- Row generator to list all dates from first date of each product to today
with DATES as (select min(PRICE_DATE) START_DATE, sysdate as END_DATE, PRODUCT from PRICES_TEST group by sysdate, PRODUCT)
select START_DATE, level - 1 as DAYS, PRODUCT
from DATES
connect by level < END_DATE - START_DATE + 1
order by 3, 2
) d order by 2, 1
) DP
left outer join prices_test pt on pt.price_date = dp.price_date and pt.product = dp.product;
@Mellamokb解决方案实际上是我真正需要的,当然比我的noobie解决方案更好。
感谢大家不仅帮助我,而且还提供了诸如“with”和“connect by”等功能。