正则表达式提取这个半格式化的数据

时间:2010-04-01 20:48:50

标签: ruby regex

好吧,我无法弄清楚如何做到这一点。

给出以下文字:

Roland AX-1:


/start
Roland's AX-1 strap-on remote MIDI controller has a very impressive 45-note velocity sensitive keyboard, and has switchable velocity curves, goes octave up/down, transpose, split/layering zones, and has fun tempo control for sequencers and more. Roland's AX-1 comes with a built-in GS control for total MIDI control of GM/GS synths. Its "Expression Bar" can control pitch and mod via an almost ribbon-like controller. It's also the newest and most advanced remote controller for your synths or midi modules.
/end

Roland AX-7:

/start
Roland's AX-7 builds on the infamous Roland AX-1 design. You just strap it on and put it to the front of the stage. Offering several controllers, such as: a D-Beam, then you can open the door to amazing live performance. 7-segment LED display, larger patch memory (Around 128 patches with MIDI data backup), and comes with GM2/GS compatibility make it extra easy to use. The 45-note, velocity-sensitive keyboard. 5 realtime controllers including a data entry knob, touch controller knob, opression bar, a hold button, and D-Beam. 128 patches with MIDI data backup. 2 MIDI zones.
/end

我正在尝试使用以下内容:

/^([\w\d \-]*):\s\s\s\s^\/start([^\:]*)\/end$/im

你可以在这里看到rubular:

http://rubular.com/r/BVRRHsnWdp

感谢您的帮助。我想我正在尝试匹配文本块,直到我点击下一个始终以:$结尾的标题

2 个答案:

答案 0 :(得分:1)

您的解决方案的问题是[^\:]*不允许段落中的任何冒号,但有一些冒号。试试这个,使用段落的非贪婪匹配:

/^([\w\d \-]*):\s+^\/start(.*?)\/end$/im

答案 1 :(得分:0)

看一下Ruby的“触发器”操作符。这对这类问题非常有用。

When would a Ruby flip-flop be useful?”涵盖了这一点。