Regexp_substr将字符串解析成碎片

时间:2014-02-25 19:00:02

标签: regex oracle plsql

我整个上午都被困住了,希望得到一些帮助。我一直在阅读我能找到的内容,但在我的情况下我遇到了麻烦。

我的记录与此类似:

A123-700
A123-700 / WORD-8
A123 / A456
WORD-8 / A456-800

我需要将它们分解为“类型”和“系列”并忽略“WORD-8”

例如

A123-300 would be type=A123, series=300
A123-300 / WORD-8 would be type=A123, series=300
A123 / A456 would be type=A123, type=A456
WORD-8 / A456-200 would be type=A456, series=200

到目前为止,我有这样的事情:

WITH gen AS
  ( select 'A123-700' x from dual
  UNION ALL
  select 'A123-700 / WORD-8' x from dual
  union all
  select 'A123 / A456' x from dual
  union all
  select 'WORD-8 / A456-800' x from dual
  )
SELECT x ,
  regexp_substr(x, '[^/]+')              as first_slash,
  regexp_substr(x, '[^-]+')              as first_type,
  regexp_substr(x, '-\w*')               as first_series,
  regexp_substr(x, '[^/][^DASH]+', 1, 2) as second_slash,
  regexp_substr(x, '[^/]+', 1, 2)        as second_type,
  regexp_substr(x, '-\w+', 1, 2)         as second_series
FROM gen;

但结果并不是我所希望的。 我想没有 - ,而且我的“第二”信息也没有出来。

X                 FIRST_SLASH FIRST_TYPE  FIRST_SERIES  SECOND_SLASH  SECOND_TYPE SECOND_SERIES
A123-700          A123-700    A123        -700          (null)        (null)      (null)
A123-700 / WORD-8 A123-700    A123        -700          D-8           WORD-8      -8
A123 / A456       A123        A123 / A456 (null)        A456          A456        (null)
WORD-8 / A456-800 WORD-8      WORD        -8            D-8 /         A456-800    -800

有人可以帮我指出正确的方向吗?

谢谢!

2 个答案:

答案 0 :(得分:1)

WITH 
  gen AS ( 
    select 'A123-700' x from dual
    UNION ALL
    select 'A123-700 / WORD-8' x from dual
    union all
    select 'A123 / A456' x from dual
    union all
    select 'WORD-8 / A456-800' x from dual
  ),
  t_slash as (
    SELECT x ,
      nullif(regexp_replace(x, '\s*/.*$'),'WORD-8') as first_slash,
      nullif(regexp_replace(x, '^[^/]*/?\s*'),'WORD-8') as second_slash
    FROM gen
  )
select x, first_slash, 
  regexp_substr(first_slash, '^[^-]*') as first_type,
  regexp_replace(first_slash, '^[^-]*-?') as first_series,
  second_slash,
  regexp_substr(second_slash, '^[^-]*') as second_type,
  regexp_replace(second_slash, '^[^-]*-?') as second_series
from t_slash

fiddle

答案 1 :(得分:1)

WITH gen AS
  ( select 'A123-700' x from dual
  UNION ALL
  select 'A123-700 / WORD-8' x from dual
  union all
  select 'A123 / A456' x from dual
  union all
  select 'WORD-8 / A456-800' x from dual
  )
SELECT x ,
  regexp_substr(x,'A[[:alnum:]]+',1) as first_type,
  NULLIF( regexp_substr(x,'A[[:alnum:]]+',2),
          regexp_substr(x,'A[[:alnum:]]+',1))  as second_type,
  regexp_substr(x,'(A[[:alnum:]]+)-([[:digit:]]+)',1) as full,
  regexp_substr(regexp_substr(x,'(A[[:alnum:]]+)-([[:digit:]]+)',1),
                '-([[:digit:]]+)',
                1) as first_series
FROM gen;