是否有一个R对象或包,其中包含例如可用的所有字符。以某种方式可以访问的UTF-8编码? 我在想类似的东西:
> object$vowels
[1] "A" "E" "I" "O" "U" "a" "e" "i" "o" "u"
> object$umlauts
[1] "Ä" "Ö" "Ü" "ä" "ö" "ü"
> object$special
[1] "!" "§" "$" "%" "&" "/" ...
基本上我想节省收集每个可能角色的时间并相应地手动分类。我知道有
letters[1:26]
其他角色是否有类似的对象?
答案 0 :(得分:3)
使用wikipedia的引用,我们可以创建自己的函数:
myFunc <- function(type = "special") {
# https://en.wikipedia.org/wiki/UTF-8
if(type == "special") myRange <- c(32:47, 58:64, 91:96, 123:126)
# extend as required
#if(type == "vowels") myRange <- c(..)
#return
unlist(strsplit(intToUtf8(myRange), ""))
}
myFunc(type = "special")
# [1] " " "!" "\"" "#" "$" "%" "&" "'" "(" ")" "*" "+" "," "-" "."
# [16] "/" ":" ";" "<" "=" ">" "?" "@" "[" "\\" "]" "^" "_" "`" "{"
# [31] "|" "}" "~"