包含所有字符的对象或包

时间:2016-05-18 10:44:49

标签: r utf-8 escaping character special-characters

是否有一个R对象或包,其中包含例如可用的所有字符。以某种方式可以访问的UTF-8编码? 我在想类似的东西:

> object$vowels
 [1] "A" "E" "I" "O" "U" "a" "e" "i" "o" "u"
> object$umlauts
[1] "Ä" "Ö" "Ü" "ä" "ö" "ü"
> object$special
[1] "!"   "§"   "$"   "%"   "&"   "/"   ...

基本上我想节省收集每个可能角色的时间并相应地手动分类。我知道有

letters[1:26]

其他角色是否有类似的对象?

1 个答案:

答案 0 :(得分:3)

使用wikipedia的引用,我们可以创建自己的函数:

myFunc <- function(type = "special") {
  # https://en.wikipedia.org/wiki/UTF-8
  if(type == "special") myRange <- c(32:47, 58:64, 91:96, 123:126) 

  # extend as required
  #if(type == "vowels") myRange <- c(..)

  #return
  unlist(strsplit(intToUtf8(myRange), ""))
}


myFunc(type = "special")
# [1] " "  "!"  "\"" "#"  "$"  "%"  "&"  "'"  "("  ")"  "*"  "+"  ","  "-"  "." 
# [16] "/"  ":"  ";"  "<"  "="  ">"  "?"  "@"  "["  "\\" "]"  "^"  "_"  "`"  "{" 
# [31] "|"  "}"  "~"