Question

我想问一下以下代码是否正确，以选择唯一的Genres值并计算它们：

#Import IMDB_data.csv skipping the second row
    read_File = readLines("C:/Users/Evagoras/Downloads/edwisor/R working Directory/IMDB_data.csv")
    Df = read.csv(textConnection(read_File[-2]), header = TRUE, sep=",")

#Select Genre column 
    Df_Genre= subset(Df, select = c("Genre"))

#Select unique values of Genre variable
    Df_Genre_unique=unique(Df_Genre)


#Count unique values of Genre variable
    Df_Genre_unicount= sapply(Df_Genre, function(x) length(unique(x)))

我必须编写哪些代码才能使用索引键存储在数据框中？我对索引键感到困惑，我必须将其存储在具有索引键的新列中吗？

Answer 1

d <- table(Df$Genre)
View(d)

Answer 2

要提取独特的流派，可以使用以下代码

df = unique(IMDB_data$Genre)
View(df)

并计算唯一的流派值

df1 = length(unique(IMDB_data$Genre)

View(df1)

提取唯一类型及其计数，并在R编程中使用索引键存储在数据框中

2 个答案: