Question

我正在尝试用Ruby 1.8.7验证'words'。

我的正则表达式目前是：

/[a-zA-Z]\'*\-*/

这只能听到英文单词;有没有办法捕获非英语的UTF-8字符？

Answer 1

即使是1.8.x Regex引擎也支持UTF-8，你只需要使用正确的表达式，它不仅仅是使用/\w/：

s = "résumé and some other words"
puts s[/[a-z]+/u]
puts s[/\w+/u]

你得到：

r
résumé