我正在编写一个小型ruby程序,需要从给定的字符串中提取时间和日期...它最终会猜测它是什么类型的约会/会议并使用它来最好地为用户服务。不幸的是,下面的正则表达式在我的崇高文本正则表达式搜索中起作用,但在ruby中不起作用。
email = "
Med Check
From Google Calendar
This invitation is out of date. This event has been updated.
View updated information on Google Calendar
more details »
Med Check
When
Mon Jan 10, 2013 9:30am – 10:30am Eastern Time
Calendar
Going? Yes - Maybe - No more options »".downcase;
require 'time'
#January February March April May June July August September October November December
r = /(jan(|uary)|feb(|uary)|mar(|ch)|apr(|il)|may|jun(|e)|jul(|y)|aug(|ust)|sept(|tember)|oct(|ober|)|nov(|ember)|dec(|ember)|(\b([1-9]|[12][0-9]|3[01])\b))( |/|,|)(([0-3]|)[0-9]|)(, |\/| )\b2[0-9]{3}\b/
if email[r]
puts email[r]
date =Date.parse(email[r])
puts " We found a date.. Let's see if we can find a time: #{date}"
if date< Date.today
puts "Why do we need to worry about this?"
else
r = /([0-9]|)[0-9]:[0-9][0-9][ |pm|am|](am|pm)/
if email[r]
time = Time.parse("#{email[r]} #{date}")
puts "Found time #{time}"
if time<Time.now
puts "Error: Time before #{Time.now}"
else
#Great!
puts "Finished let's add it."
end
end
end
end
引发这些错误:
/Users/michael/Downloads/parse.rb:27: end pattern with unmatched parenthesis: /(jan(|uary)|feb(|uary)|mar(|ch)|apr(|il)|may|jun(|e)|jul(|y)|aug(|ust)|sept(|tember)|oct(|ober|)|nov(|ember)|dec(|ember)|(\b([1-9]|[12][0-9]|3[01])\b))( |/
/Users/michael/Downloads/parse.rb:27: syntax error, unexpected ','
...1-9]|[12][0-9]|3[01])\b))( |/|,|)(([0-3]|)[0-9]|)(, |\/| )\b...
... ^
/Users/michael/Downloads/parse.rb:27: syntax error, unexpected ')'
...-9]|3[01])\b))( |/|,|)(([0-3]|)[0-9]|)(, |\/| )\b2[0-9]{3}\b/
... ^
/Users/michael/Downloads/parse.rb:27: syntax error, unexpected ']', expecting ')'
...[01])\b))( |/|,|)(([0-3]|)[0-9]|)(, |\/| )\b2[0-9]{3}\b/
... ^
/Users/michael/Downloads/parse.rb:49: syntax error, unexpected end-of-input, expecting ')'
答案 0 :(得分:5)
你的正则表达式中有一个未转义的正斜杠字符。您可以使用Rubular来测试表达式,如http://rubular.com/r/TvoKOsSImt
答案 1 :(得分:1)
将/
更改为\/
。
意思是,在你的正则表达式中写正向或反斜杠之前,在它们之前加一个反斜杠(\
)。
答案 2 :(得分:1)
将此更改:(, |/| )
更改为此(, |\/| )
未转义的斜杠导致ruby正则表达式编译器认为您已提前终止了正则表达式。在斜杠之前添加反斜杠会使其在正则表达式中被视为正常斜杠,而不是被解释为指示正则表达式结束的分隔符。